Stefan Pohl Computer Chess

Home of famous UHO openings and EAS Ratinglist


SPCC Super 3 Tournament

 

 

Latest update: 2024/10/13 (next updates will follow every 10-12 days...)

Tournament overall runtime (average: 47.2 games per day are played): 151 days

Download all played games since start (May 2024) here

 

From today (2024/10/13), the new Rebel Extreme by Ed Schroeder is invited to the Super 3 Tournament as a "Guest Star", because I am curious to see, how this super aggressive engine will perform here.

From the website of Rebel:

"Rebel Extreme is our flagship when it is about playing unbelievable sharp games, but it comes with a price of an 140 elo loss in comparison with Rebel 16.3. The level of aggressiveness is measured by Stefan Pohl great tool Engines Aggressiveness Statistics, or EAS."

Sadly, Rebel uses only 8 threads (and even this only works, when "poweruser" is added to the command-line of the Rebel engine) - so Rebel will run only with around 60% speed, compared to the other 3 engines. But with the long thinking-time of the Super 3 Tournament, this should lead only to a small Celo-loss... Speed of Rebel Extreme: Around 7-8 MN/s in the middlegame.

 

An endless RoundRobin-tournament with 3 engines, which are at the same level of strength (around 3400 Elo) but are completely different in their inner structure and their way of thinking.

Why? The strongest engine since more a decade (Stockfish) is open source, so many, many other engines are (at least) "inspired" by Stockfish... And additionally, a lot of engines (including Stockfish!) are using Lc0-training-data for building their neural nets: The high-end computerchess has become very incestuous... To say this clear here: This is not good or bad, it is just the reality of high-end computerchess in these days.

So, IMHO, it is very interesting to run a tournament with engines, which are completely different, not only in their playing-style, but also in their inner structure and way of thinking, but on a close level of playing-strength.

The Super 3 tournament is not about the results (as you can see below, all 3 engines are at the same level of strength), but about generating interesting enginegames.

             | Search    | Evaluation   | nodes per second (early middlegame)
-------------|-----------|--------------|-------------------------------------
Lc0 CPU      | MCTS      | float-neural |      1.100
-------------|-----------|--------------|-------------------------------------
Revenge 1.0  | AlphaBeta | int-neural   | 11.000.000 (10.000x faster than Lc0)
-------------|-----------|--------------|-------------------------------------
Komodo 14.1  | AlphaBeta | Handcrafted  | 19.000.000 (17.300x faster than Lc0)
-------------|-----------|--------------|-------------------------------------

As you can see, these 3 engines have nothing in common, considering their way of thinking. Komodo 14.1 uses a classical handcrafted evaluation, Lc0 CPU uses a (float) neural net, and Revenge 1 uses a (integer) nnue-net, like most modern engines in these days. Because floating-point calculations are brutally slow on CPUs, Lc0 CPU is way slower than the 2 opponents... (this is the reason, why Lc0 normally runs on the GPU, not the CPU). If you want to learn more about the neural-net of Lc0 and about nnue-nets, I recommend this e-book by Dominik Klein, which can be downloaded for free as a PDF-file.

And, additionally, Lc0 uses a complete different search (MCTS). And Revenge 1 is one of the most aggressive playing engines of all time (see the EAS-Ratinglist below).

 

HardwareAMD Ryzen 7840HS 8-core (16 threads) notebook with 32GB RAM. Turboboost off.

Speed: See above. Each engine uses 14 threads, when thinking (Lc0 cpu dnll has the UCI option "Threads" like any normal CPU-engine, so it uses the CPU like all other engines) - the GPU stays (of course) unused.

Hash: 8 GB per engine (20.000.000 NNCachesize for Lc0 - enough for storing all evaluated positions of a complete game)

GUI: CutechessGUI (GUI ends game, when a 6-piece endgame is on the board, all other games are played until mate or draw by chess-rules (3fold, 50-moves, stalemate))

Tablebases: None for engines, 6 Syzygy for CutechessGUI

Openings: My UHO_2024_8mvs_+085_+094.pgn openings are used (randomly mixed, each opening repeated with reversed colors, of course (=Gamepairs))

Ponder, Large Memory Pages & learning: Off

Thinking time: 10min+5sec per game/engine (average game-duration: 30 minutes), so only 50 games are played in 24 hours = high quality enginechess

 

Here you can see the shortest wins with sacrifices, played between the latest two site-updates, filtered by my Interesting Wins Search Tool. Download this cool tool in the "Downloads & Links" section or right here

Many thanks to ChessBase for the pgn-replayer tool, which is very easy to use (only 3 lines of code!) and very powerful - use the fan (propeller?)-icon right near the arrows below the chessboard, to start and stop the online-analyzing with the Fritz-engine! Perhaps you have to clear your browser-cache to see the latest games - otherwise the pgn-replayer does not update the games correctly...if you can not see the chessboard, check, that your browser has Javascript activated or if an AdBlocker is the problem.

 

 

 

 

Below the results (first normal Celos (by ORDO), followed by gamepair-rescored Celos, followed by EAS-Ratinglist).

 

     Program               Celo    +    - Games    Score   Av.Op. Draws

   1 Lc0 791921 CPU      : 3450    5    5  4760    51.7%   3438   48.6%
   2 Komodo 14.1 HCE     : 3442    5    5  4760    50.1%   3442   49.5%
   3 Revenge 1.0 avx2    : 3433    5    5  4760    48.1%   3446   48.6%


Games        : 7140 (finished)

White Wins   : 3438 (48.2 %)
Black Wins   : 209   (2.9 %)
Draws        : 3493 (48.9 %)


Gamepairs:

   # PLAYER              :    Celo  Error   Pairs    W     D    L   (%)  CFS(%)
   1 Lc0 791921 CPU      :    3450   ----    2380  669  1191  520  53.1     100
   2 Komodo 14.1 HCE     :    3434     11    2380  597  1169  614  49.6      97
   3 Revenge 1.0 avx2    :    3422     12    2380  540  1168  672  47.2     ---


------------------------------------------------------------------- 
--- Number of all Gamepairs          : 3570 
--- Number of drawn Gamepairs overall: 1764 (= 49.41%) 
--- Number of 1:1 drawn Gamepairs    : 871 (= 24.40%) 
--- Number of 2-draws drawn Gamepairs: 893 (= 25.01%) 
------------------------------------------------------------------- 

 

 

Head to head statistics:

 

1) Lc0 791921 CPU   3450 :   2380 (+669,=1191,-520),  53.1 %

   vs.                    :  pairs (   +,    =,   -),   (%) :   Diff,   SD, CFS (%)
   Komodo 14.1 HCE        :   1190 ( 341,  596, 253),  53.7 :    +16,    6,   99.7
   Revenge 1.0 avx2       :   1190 ( 328,  595, 267),  52.6 :    +28,    6,  100.0

 

2) Komodo 14.1 HCE  3434 :   2380 (+597,=1169,-614),  49.6 %

   vs.                    :  pairs (   +,    =,   -),   (%) :   Diff,   SD, CFS (%)
   Lc0 791921 CPU         :   1190 ( 253,  596, 341),  46.3 :    -16,    6,    0.3
   Revenge 1.0 avx2       :   1190 ( 344,  573, 273),  53.0 :    +11,    6,   96.7

 

3) Revenge 1.0 avx2 3422 :   2380 (+540,=1168,-672),  47.2 %

   vs.                    :  pairs (   +,    =,   -),   (%) :   Diff,   SD, CFS (%)
   Lc0 791921 CPU         :   1190 ( 267,  595, 328),  47.4 :    -28,    6,    0.0
   Komodo 14.1 HCE        :   1190 ( 273,  573, 344),  47.0 :    -11,    6,    3.3


 

Here the EAS-Ratinglist, calculated by my EAS-Tool:

                                 bad  avg.win 
Rank  EAS-Score  sacs   shorts  draws  moves  Engine/player 
-------------------------------------------------------------------
   1    150439  25.57%  16.84%  13.26%   72   Revenge 1.0 avx2  
   2     71847  11.42%  15.40%  27.50%   71   Komodo 14.1 HCE  
   3     55374  08.12%  13.18%  22.55%   72   Lc0 791921 CPU  
-------------------------------------------------------------------
*** Average length of all won games:     72 moves

 

 

A: Most high-value sacrifices (3+ pawnunits): [1]:05.11% Revenge 1.0 avx2   

                                              [2]:01.99% Komodo 14.1 HCE   

                                              [3]:00.31% Lc0 791921 CPU 


B: Most sacrifices overall                  : [1]:25.57% Revenge 1.0 avx2   

                                              [2]:11.42% Komodo 14.1 HCE   

                                              [3]:08.12% Lc0 791921 CPU 


C: Very short wins (40 moves or less)       : [1]:01.59% Revenge 1.0 avx2   

                                              [2]:00.99% Komodo 14.1 HCE   

                                              [3]:00.31% Lc0 791921 CPU 


D: Most short wins overall                  : [1]:16.84% Revenge 1.0 avx2   

                                              [2]:15.40% Komodo 14.1 HCE   

                                              [3]:13.18% Lc0 791921 CPU 


E: Average length of all won games          : [1]:071 Komodo 14.1 HCE   

                                              [2]:072 Lc0 791921 CPU   

                                              [3]:072 Revenge 1.0 avx2 


F: Smallest number of bad draws             : [1]:13.26% Revenge 1.0 avx2   

                                              [2]:22.55% Lc0 791921 CPU   

                                              [3]:27.50% Komodo 14.1 HCE