Gemini performance on FDCL/basic, broken down into groups

What this table contains:

Play mode – mStar = the number of move attempts until “mastery was demonstrated” (the bot makes 10 correct moves in a row). The smaller the number is, the faster the system learns. “Infinity” means that the run had to be interrupted (usually after ca. 300 move attempts) before such “mastery” could be demonstrated

Prepared-episodes mode: for how many future boards, out of 15, the bot could propose an error-free clearing plan. (15 = perfect result, 0 = never a success).
Rule setPlay-mode runs:
m* (moves until mastery)
Prepared-episodes runs:
Good future episodes (out of 15)
Instructions w/o examples
Directory prepared-1-b prepared-2-b prepared-3-b prepared-5-b prepared-10-b
Number of prepared episodes1 2 3 5 10
Group 1: all pieces are movable. Static bucket assignment
cm_KRBY2210 15 10 10 9
cm_RBKY2010 15 10 7 15
quadMixed12710 15 15 10 10
quadNearby115 15 11 9 14
sm_csqt10 0 15 2 10 0
sm_qcts145 15 15 5 4
Group 2: all pieces are movable. Bucket determined by some sequence independent of game piece
buckets_21301715 10 10 10 15
ccw60 15 15 15 10 15
cw3715 15 15 15 10
cw_0123615 10 2 12 10
Group 3: only some pieces are movable, according to some rule. Bucket does not matter
allOfColOrd_BRKY392 0 0 0 0
allOfColOrd_KRBY373 1 1 0 0
allOfShaOrd_csqt1050 0 0 0 0
allOfShaOrd_qcts300 0 0 0 0
col1Ord_BRKYInfinity0 0 0 0 0
col1Ord_KRBY0 0 0 0 0
colOrdL1_BRKYInfinity0 0 0 0 0
colOrdL1_KBYR0 0 0 0 0
ordL1160 0 0 0 0
ordRevOfL1200 2 0 0 0
sha1Ord_csqt770 0 0 0 0
sha1Ord_qcts0 0 0 0 0
shaOrdL1_csqtInfinity0 0 0 0 0
shaOrdL1_qcts0 0 0 0 0
Group 4: only some pieces are movable, according to some rule. Static bucket assignment
cm_RBKY_cw_0123710 0 1 1 0
col1OrdBuck_BRKY0213951 0 1 0 0
col1OrdBuck_BRKY31200 0 2 0 0
ordL1_Nearby60 0 5 0 5
ordRevOfL1_Nearby860 0 0 0 0
ordRevOfL1_Remotest1480 5 0 0 0
sha1OrdBuck_qcts02131630 0 0 0 0
sha1OrdBuck_tqsc02130 0 0 1 0

Performance observations: