Stefan Pohl Computer Chess

private website for chessengine-tests


Latest Website-News (2020/06/03): Testrun of Komodo 14 MCTS finished: +8 Elo to Komodo 13.2 MCTS. Next AB-testrun: Stockfish 200601. New MEA-results for all 3 training-runs are online (and result of S.Vieri t60-3972 (30x384) net).

NN-testrun of Lc0 0.25.1 63651 finished. See the result and download the games in the "NN vs SF testing"- section. Next NN-testrun Lc0 0.25.1 t60-3972.

 

 

I released the new NBSC Advanced Armageddon openings (NBSC= No Black Short Castling), including a new (and better for enginechess!) Armageddon scoring system. The testing-results are just mind-blowing! Learn more in the "NBSC Armageddon openings"- section or download the NBSC Armageddon openings right here

 

Stay tuned. And please stay at home, if possible. Fight Covid-19 and #FlattenTheCurve

 


Stockfish testing

 

Playing conditions:

 

Hardware: i7-6700HQ 2.6GHz Notebook (Skylake CPU), Windows 10 64bit, 8GB RAM

Fritzmark: singlethread: 5.3 / 2521 (all engines running on one thread, only), average meganodes/s displayed by LittleBlitzerGUI: Houdini: 2.6 mn/s, Stockfish: 2.2 mn/s, Komodo: 2.0 mn/s

Hash: 512MB per engine

GUI: Since 19/09/11: Cutechess-cli (GUI ends game, when a 5-piece endgame is on the board), before: LittleBlitzerGUI (draw at 170 moves, resign at -700cp)

Tablebases: None for engines, 5 Syzygy for cutechess-cli

Openings: HERT_500 testset (by Thomas Zipproth) (download the file at the "Download & Links"-section or here)

Ponder, Large Memory Pages & learning: Off

Thinking time: 180''+1000ms (= 3'+1'') per game/engine (average game-duration: around  7.5 minutes). One 5000 games-testrun takes about 7 days.The version-numbers of the Stockfish engines are the date of the latest patch, which was included in the Stockfish sourcecode, not the release-date of the engine-file, written backwards (year,month,day))(example: 170526 = May, 26, 2017). Since July, 2018 I use the abrok-compiles of Stockfish again (http://abrok.eu/stockfish), because they are now much faster than before - now only 1.3% slower than BrainFish-compiles. So, there is no reason anymore to not use these "official" development-compiles.

Download BrainFish (and the Cerebellum-Libraries)here

 

Each Stockfish-version plays 1000 games versus Komodo 14, Houdini 6, Fire 7.1, Xiphos 0.6, Ethereal 12.00. All engines are running with default-settings.

To avoid distortions in the Ordo Elo-calculation, from now, only 2x Stockfish (latest official release + the latest version) and 1x asmFish and 1x Brainfish are stored in the gamebase (all older engine-versions games will be deleted, every time, when a new version was tested). Stockfish, asmFish and BrainFish older Elo-results can still be seen in the Elo-diagrams below. BrainFish plays always with the latest Cerebellum-Libraries of course, because otherwise BrainFish = Stockfish.

 

Latest update: 2020/06/03: Komodo 14 MCTS (+8 Elo to Komodo 13.2 MCTS)

 

(Ordo-calculation fixed to Stockfish 11 = 3554 Elo)

 

See the individual statistics of engine-results here

See the ORDO-rating of the archive-gamebase since 2020 here

Download the current gamebase here

Download the archive-gamebase since 2020 here

 

     Program                      Elo    +    -   Games   Score   Av.Op.  Draws

   1 BrainFish-2 200329 bmi2    : 3620    9    9  5000    82.4 %   3341   33.6 %
   2 Stockfish 200519 bmi2      : 3569    9    9  5000    77.6 %   3344   41.5 %
   3 Stockfish 11 200118        : 3554    6    6 12000    78.2 %   3320   39.6 %
   4 Stockfish 10 181129        : 3508    5    5 16000    78.4 %   3272   37.9 %
   5 Stockfish 9 180201         : 3457    9    9  5000    74.9 %   3255   41.7 %
   6 Komodo 14 bmi2             : 3433    7    7  6000    54.2 %   3401   56.1 %
   7 Houdini 6 pext             : 3426    3    3 29000    64.0 %   3316   47.8 %
   8 Komodo 13.3 bmi2           : 3420    6    6  9000    58.5 %   3357   49.2 %
   9 Komodo 13.1 bmi2           : 3407    5    5 12000    62.0 %   3316   49.9 %
  10 Komodo 12.3 bmi2           : 3395    7    7  7000    62.7 %   3296   49.4 %
  11 Komodo 14 MCTS             : 3322    7    7  5000    44.4 %   3368   53.4 %
  12 Komodo 13.2 MCTS           : 3314    7    7  6000    44.1 %   3359   54.2 %
  13 Ethereal 12.00 pext        : 3300    5    5 11000    38.2 %   3397   47.1 %
  14 Ethereal 11.75 pext        : 3290    6    6  9000    39.3 %   3374   53.2 %
  15 Xiphos 0.6 bmi2            : 3280    5    5 15000    36.9 %   3388   50.8 %
  16 Fire 7.1 popc              : 3280    3    3 29000    45.2 %   3322   52.7 %
  17 Xiphos 0.5.6 bmi2          : 3269    6    6  8000    41.5 %   3335   54.7 %
  18 Ethereal 11.53 pext        : 3262    6    6  8000    42.1 %   3323   53.3 %
  19 Komodo 12.3 MCTS           : 3258    7    7  7000    42.7 %   3316   46.3 %
  20 Ethereal 11.25 pext        : 3254    7    7  6000    38.4 %   3344   51.0 %
  21 rofChade 2.3 bmi2          : 3239    7    7  6000    32.6 %   3377   47.4 %
  22 Booot 6.4 popc             : 3226    7    7  6000    31.1 %   3377   46.5 %
  23 Schooner 2.2 popc          : 3223    7    7  6000    31.3 %   3373   50.3 %
  24 Laser 1.7 bmi2             : 3201    7    7  6000    30.8 %   3353   45.8 %
  25 Fizbo 2 bmi2               : 3195    8    8  5000    36.0 %   3307   39.0 %
  26 Fritz 17                   : 3194    7    7  6000    29.4 %   3359   44.2 %
  27 Shredder 13 x64            : 3192    7    7  6000    31.9 %   3341   42.6 %
  28 Defenchess 2.2 popc        : 3187    8    8  5000    26.6 %   3376   41.8 %
  29 Booot 6.3.1 popc           : 3181    8    8  5000    34.0 %   3310   44.1 %
  30 Andscacs 0.95 popc         : 3150    9    9  5000    23.1 %   3373   35.4 %

The version-numbers (180622 for example) of the engines are the date of the latest patch, which was included in the Stockfish sourcecode, not the release-date of the engine-file. Especially the asmFish-engines are often released much later!!

Below you find a diagram of the progress of Stockfish in my tests since the end of 2018

And below that diagram, the older diagrams.

 

You can save the diagrams (as a JPG-picture (in originial size)) on your PC with mouseclick (right button) and then choose "save image"...

The Elo-ratings of older Stockfish dev-versions in the Ordo-calculation can be a little different to the Elo-"dots" in the diagram, because the results/games of new Stockfish dev-versions - when getting part of the Ordo-calculation - can change the Elo-ratings of the opponent engines and that can change the Elo-ratings of older Stockfish dev-versions (in the Ordo-calculation / ratinglist, but not in the diagram, where all Elo-"dots" are the rating of one Stockfish dev-version at the moment, when the testrun of that Stockfish dev-version was finished).

 

 

 


Sie sind Besucher Nr.