Fischer 960 (FRC) Engine Gauntlets

Komodo 11.2 - Stockfish 8

First round: Hyperbullet 10+0.1, 1 thread, AMD Opteron 2,4Ghz, 128MB Hash

(komodo depth in start pos: 12-14 sf:12-16)
Score of Komodo-11.2 vs Stockfish-8: 266 - 454 - 424  [0.418] 1144
Elo difference: -57.62 +/- 16.05
White Score  : 52.4 %
     Program            Elo      +      -   Games   Score   Av.Op.  Draws

   1 Stockfish-8    : 2329.1    0.0    0.0  1144    58.2 %   2271   37.1 %
   2 Komodo-11.2    : 2270.9    0.0    0.0  1144    41.8 %   2329   37.1 %
On my machine with longer timecontrol (eg 180+2), faster machine and 9 threads it looks to me, that komodo plays better.
So I tried to increase tc and added an cpu, at least the elo different should get smaller...

Second round: Bullet 60+1, 2 threads, AMD Opteron 2,4GHz, 512MB Hash

(komodo depth in start pos: 16-18 sf:16-21)
  White Score  : 53.4 %
  Program            Elo       +      -    Games   Score   Av.Op.  Draws
  1 Stockfish-8    : 2315.3    0.0    0.0  1000    54.4 %   2285   50.5 %
  2 Komodo-11.2    : 2284.7    0.0    0.0  1000    45.6 %   2315   50.5 %
Ok, much better, the diffenece is getting smaller, but still not what I like to see ;) So more time and threads:

Third round: Bullet 120+1, 4 threads, AMD Opteron 2,4GHz, 2048MB Hash

(komodo depth in start pos: 16-20 sf:19-21)
Score of Komodo-11.2 vs Stockfish-8: 210 - 251 - 539  [0.479] 1000
Elo difference: -14.25 +/- 14.61
White Score  : 55.1 %
   Program            Elo       +      -    Games   Score   Av.Op.  Draws
   1 Stockfish-8    : 2307.2    0.0    0.0  1000    52.0 %   2293   53.9 %
   2 Komodo-11.2    : 2292.8    0.0    0.0  1000    48.0 %   2307   53.9 %
Ok, its getting closer!
Lets see, how much komodo gains with Contempt=0. I expect them to be pretty equal, maybe the draw rate goes up?

Komodo 11.2 Contempt=0, Bullet 120+1, 4 threads, AMD Opteron 2,4GHz, 2048MB Hash

Score of Komodo-11.2 vs Stockfish-8: 184 - 223 - 593  [0.480] 1000
Elo difference: -13.56 +/- 13.72
White Score  : 56.0 %
  1 Stockfish-8    : 2306.8    0.0    0.0  1000    52.0 %   2293   59.3 %
  2 Komodo-11.2    : 2293.2    0.0    0.0  1000    48.0 %   2307   59.3 %
Hmm, Contempt=0 doesnt gain anything, it only raises the draw rate!

Komodo 11.2 Contempt=0, Blitz 180+2, 4 threads, AMD Opteron 2,4GHz, 2048MB Hash

Score of Komodo-11.2 vs Stockfish-8: 170 - 226 - 604  [0.472] 1000
Elo difference: -19.48 +/- 13.53
   White Score  : 54.1 %
  1 Stockfish-8    : 2309.8    0.0    0.0  1000    52.8 %   2290   60.4 %
  2 Komodo-11.2    : 2290.2    0.0    0.0  1000    47.2 %   2310   60.4 %
Hmm, komodo doesnt get better, even with longer tc
For fun, I just started an round of:

McBrain v2.2 - SF 8, Hyperbullet 10+0.1, 1 thread, on my i7-4930K CPU @ 3.40GHz, 128MB Hash

(depth 14-17 for both)
Score of McBrain-22 vs Stockfish-8: 234 - 134 - 632  [0.550] 1000
Elo difference: 34.86 +/- 13.00

White Score  : 54.8 %
Num. Name            games   score 
   0 McBrain-22       1000     550 
   1 Stockfish-8      1000     450 
Rank Name          Elo    +    - games score oppo. draws 
   1 McBrain-22     14   16   16  1000   55%   -14   63% 
   2 Stockfish-8   -14   16   16  1000   45%    14   63% 
Impressive draw rate ^^ (but its a SF clone...)
But that means, mcbrain is the one to catch...

McBrain v2.2 - Komodo 11.2 Contempt=0, Hyperbullet 10+0.1, 1 thread, on my i7-4930K CPU @ 3.40GHz, 128MB Hash

(depth komodo 12-15, mcbrain 12-17)
should be around 70 difference...
  Score of McBrain-22 vs Komodo-11.2: 373 - 194 - 433  [0.590] 1000
  Elo difference: 62.87 +/- 16.26

  White Score  : 51.6 %
  Num. Name            games   score 
   0 McBrain-22       1000   589.5 
   1 Komodo-11.2      1000   410.5 
  Rank Name          Elo    +    - games score oppo. draws 
   1 McBrain-22     28   17   17  1000   59%   -28   43% 
   2 Komodo-11.2   -28   17   17  1000   41%    28   43% 

lets see how this is with dynamism=80. I expect the draw rate to go up, if komodo would win more... i dont know

McBrain v2.2 - Komodo 11.2 Contempt=0, Dynamism=80, Hyperbullet 10+0.1, 1 thread, on my i7-4930K CPU @ 3.40GHz, 128MB Hash

(depth komodo 12-15, mcbrain 12-17)
Score of McBrain-22 vs Komodo-11.2: 465 - 173 - 362  [0.646] 1000
Elo difference: 104.49 +/- 17.50
Num. Name            games   score 
   0 McBrain-22       1000     646 
   1 Komodo-11.2      1000     354 
Rank Name          Elo    +    - games score oppo. draws 
   1 McBrain-22     48   18   18  1000   65%   -48   36% 
   2 Komodo-11.2   -48   18   18  1000   35%    48   36% 

Oh, dynamism 80 seems like a bad idea...
The difference between a fresh CFish and mcbrain isnt realy noticable:

CFish-120817 - McBrain v2.2, Hyperbullet 10+0.1, 1 thread, on my i7-4930K CPU @ 3.40GHz, 128MB Hash

Score of McBrain-22 vs CFish-120817: 152 - 185 - 663  [0.483] 1000
Elo difference: -11.47 +/- 12.48
This all (except the dynamism experiement) piped to ordo:
   # PLAYER          :  RATING  POINTS  PLAYED   (%)
   1 CFish-120817    :  2333.3   516.5    1000    52
   2 McBrain-22      :  2321.8  1623.0    3000    54
   3 Stockfish-8     :  2286.6   450.0    1000    45
   4 Komodo-11.2     :  2258.3   410.5    1000    41