Revolution - Revolution is a free, open-source UCI chess engine derived from Stockfish. Jorge Ruiz Centelles, with credit to ChatGPT, modifies and extends the code to explore new concepts. The engine implements cutting-edge search algorithms combined with neural network evaluation. Derived from fundamental chess programming principles, Revolution analyzes positions through parallelized alpha-beta search enhanced with null-move pruning and late move reductions.
As a UCI-compliant engine, Revolution operates through standard chess interfaces without an integrated graphical interface. Users must employ compatible chess GUIs (Arena, Scid vs PC, etc.) for board visualization and move input. Consult your GUI documentation for implementation details.
Revolution 2.0.0 what's new?
Time control: 10+0.1 (rapid with small increment)
Engines tested: Revolution (baseline) vs Revolution1 (new version)
Games played: 2178
Book: UHO_2024_8mvs_+085_+094.pgn (balanced/unbiased openings, up to 8 moves)
Hash: 32 MB (kept deliberately small for throughput)
Threads: 1
Interpreting Results
Score %: 47.54% for Revolution (baseline) → means Revolution1 scored higher (52.46%).
What difference:
-17.08 ± 7.97 → Revolution1 is about +17 Elo stronger (statistically).
nElo -31.33 ± 14.59 → normalized Elo suggests an even wider gap, ~+31 Elo for Revolution1.
LOS (Likelihood of Superiority): 0.00% → from the chosen perspective, Revolution did not outperform; Revolution1 was consistently better.
Draw ratio: 49.59% — fairly typical for this TC and book.
Win/Loss ratio: 483W / 590L → Revolution1 wins ~22% more decisive games than it loses.
Pentanomial vector: [26, 294, 540, 219, 10] → distribution of outcomes by colour and result.
LLR (Log-Likelihood Ratio): -1.16 → indicates the SPRT test would reject the hypothesis that Revolution is stronger; in fact, Revolution1 is favoured.
Bottom Line
Revolution (revolution 2.0) gained around +20 to +30 Elo over Revolution in this test set.
With 2178 games, the confidence interval is already quite tight (±8 Elo).
The LOS of 0% means we can confidently say Revolution1 is not weaker—it is stronger.
Comments
Post a Comment