Can a phone beat Magnus Carlsen at chess?
But the essence of the question is still interesting. There must exist hardware slow enough that it would be an even match against top humans. What would that look like? I’ve conducted some experiments to try to figure that out.
I started by finding the slowest hardware I own that can run the latest version of Stockfish. This is a Raspberry Pi Zero W, which is a small single-board computer powered by what is essentially a fifteen-year-old budget cell phone processor. It runs Stockfish 17.1 at a paltry 2,200 nodes per second. To simulate top human play, I got out my trusty old copy of Fritz Bahrain, which in 2002 drew a match with Kramnik. Using a single core on an i7-6700k, Fritz Bahrain searches about 3.5 million nodes per second, which is pretty close to the reported figures for the machine that Kramnik played. I figured I would have it serve as a reference point for 2800-level play and thought that these machines might have an interesting match.
However, even at only 2,200 nodes per second Stockfish was way too strong. In classical-length games it achieved search depths of 20-25. This is comparable to the eval bar we are familiar with in broadcasts and game analyses, which we know is fallible but still comfortably superhuman. It mercilessly crushed Fritz in a short set of classical-length test games that I played.
So somewhere around 100 nodes per second is likely where performance becomes superhuman. What kind of hardware would that be? It’s hard to say since modern versions of Stockfish would take a lot of work to get running on truly old hardware, if it is possible at all. But ignoring that, this user reported getting Stockfish 6 running on a 386 at about 1,000 nodes per second. On my machines SF 17.1 gets about 35% as many nodes per second as SF 6, so let’s say a 386 would run it at 350 nodes per second. That would still result in 3000+ play. Perhaps a 286 would run Stockfish 17.1 in the 100 nps range. Of course with 16-bit architecture and nowhere near enough RAM to fit the neural net, this would be pretty much impossible, but this experiment suggests that it really is ancient hardware like this we would need to reference if we want modern Stockfish to sink to the level of top humans.
By: EvilNalu
Absolutely fascinating analysis! Your experiments vividly illustrate how even minimal computing power can yield superhuman chess performance. The fact that Stockfish 17.1, operating at just 100 nodes per second, can achieve a performance rating around 2900 Elo underscores the remarkable efficiency of modern engines. This also highlights the vast gap between human capabilities and AI, even on constrained hardware. Your work not only answers the titular question but also provides deep insights into the evolution and dominance of chess engines. Thank you for sharing this enlightening exploration!
ReplyDeletebest regards
how much is starlink