How well do large language models play chess against Stockfish at various ELO ratings?
Win rates mapped by Stockfish ELO (rows) vs. LLM Models & reasoning effort (columns). Click any active cell to inspect details.