Compare rule sets difficulty for human players

Choose experiment plans

Note: there is also a command-line interface for this tool, scripts/analyze-transcripts-mwh.sh; it has more modes and options than available via the web interface. Documentation is here.


Choose one or several experiment plans from the list below. All rule sets included in those plans will be compared.

2PG/adve.test_1(7 players, 12 episodes)
2PG/adve.test_1/(13 players, 26 episodes)
2PG/adve.test_2a(7 players, 45 episodes)
2PG/adve.test_2b(6 players, 29 episodes)
2PG/coop.test_1(2 players, 2 episodes)
2PG/coop.test_1/(1 players, 2 episodes)
2PG/coop.test_2(7 players, 20 episodes)
2PG/test_adve(1 players, 1 episodes)
ad/adve.immovable(3 players, 3 episodes)
ad/coop(2 players, 4 episodes)
ad/coop.immovable(1 players, 1 episodes)
ad/trialFixed(3 players, 7 episodes)
APP/vm/image_test_01(2 players, 4 episodes)
FDCL/basic(223 players, 2343 episodes)
FDCL/trnsfr_1(8 players, 42 episodes)
FDCL/trnsfr_1/tools(1 players, 6 episodes)
pilot04(1 players, 3 episodes)
pilot06(1 players, 1 episodes)
pk/2pilot_cntxt(4 players, 5 episodes)
pk/cntxt2024JUN25(1 players, 3 episodes)
pk/colorVshape(1 players, 1 episodes)
pk/dev2024APR19(1 players, 1 episodes)
pk/dev2024MAY27(1 players, 6 episodes)
pk/movie(2 players, 2 episodes)
pk/pilot_cntxt(1 players, 4 episodes)
pk/position_A(2 players, 3 episodes)
pk/position_A/one_shape_one_color(1 players, 6 episodes)
pk/prod2024JUN15(1 players, 7 episodes)
R:/home/vmenkov/test/test-01.txt:FDCL/basic(5 players, 8 episodes)
R:CGS/1_sm.txt:APP/APP-no-feedback(4 players, 4 episodes)
R:CGS/3_bltr.txt:APP/APP-no-feedback(1 players, 5 episodes)
R:FDCL/basic/colOrdL1_BRKY:FDCL/basic(4 players, 9 episodes)
R:FDCL/basic/colOrdL1_KBYR:APP/APP-no-feedback(1 players, 1 episodes)
RU/JF/tht/exp1(46 players, 204 episodes)
RU/JF/tht/exp2(10 players, 44 episodes)
RU/JF/tht/exp2a(6 players, 36 episodes)
RU/JF/tht/exp2a_rev1(9 players, 50 episodes)
RU/JF/tht/exp2_rev1(8 players, 39 episodes)
RU/JF/tht/exp4(22 players, 43 episodes)
RU/JF/tht/exp4a(12 players, 24 episodes)
RU/JF/tht/exp4a_rev1(1 players, 2 episodes)
RU/JF/tht/exp4_rev1(1 players, 2 episodes)
RU/JF/transfer2(21 players, 114 episodes)
RU/JF/transfer2_patch(5 players, 23 episodes)
RU/JF/transfer_sala(35 players, 177 episodes)
RU/JF/transfer_sala2(122 players, 654 episodes)
vm/adve.colorVshape(34 players, 137 episodes)
vm/colorVshape(12 players, 16 episodes)
vm/composite-05(2 players, 2 episodes)
vm/coop.colorVshape(14 players, 67 episodes)
vm/exp4(1 players, 2 episodes)
vm/pilot06_doubling(3 players, 8 episodes)
vmColorTest(1 players, 1 episodes)

Stage 1: Transcript processing parameters


Learning attainment criterion (enter one):
consecutive error-free moves, or
R ≥
Default mStar = (You can also enter the value Infinity)

Stage 2: data interpretation parameters for M-W

Naive: For each rule sets, only include "naive" players (those who played this rule set as their first rule set)
Every: Consider each (rule set + preceding set) combination as a separate experience to be ranked. (That is, the R1 data from R1:R2:R3, R2:R1:R3, and R2:R3:R1 are viewed as belonging to three distinct experiences, "R1", "R2:R1", and "R2:R3:R1")
Ignore: When viewing a rule set's series, ignore preceding rule sets. (In other words, the R1 data from R1:R2:R3, R2:R1:R3, and R2:R3:R1 are merged, viewed as the same "R1 experience").

Check to use mDagger instead of mStar
(only click once)