Maia-2: A Unified Model for Human-AI Alignment in Chess

Ashton Anderson; Difan Jiao; Jon Kleinberg; Reid McIlroy-Young; Siddhartha Sen; Zhenwei Tang

arxiv: 2409.20553 · v2 · pith:YV6AMWFAnew · submitted 2024-09-30 · 💻 cs.AI

Maia-2: A Unified Model for Human-AI Alignment in Chess

Zhenwei Tang , Difan Jiao , Reid McIlroy-Young , Jon Kleinberg , Siddhartha Sen , Ashton Anderson This is my paper

classification 💻 cs.AI

keywords humanchessskillalignmentlevelsmodeldecision-makinghuman-ai

0 comments

read the original abstract

There are an increasing number of domains in which artificial intelligence (AI) systems both surpass human ability and accurately model human behavior. This introduces the possibility of algorithmically-informed teaching in these domains through more relatable AI partners and deeper insights into human decision-making. Critical to achieving this goal, however, is coherently modeling human behavior at various skill levels. Chess is an ideal model system for conducting research into this kind of human-AI alignment, with its rich history as a pivotal testbed for AI research, mature superhuman AI systems like AlphaZero, and precise measurements of skill via chess rating systems. Previous work in modeling human decision-making in chess uses completely independent models to capture human style at different skill levels, meaning they lack coherence in their ability to adapt to the full spectrum of human improvement and are ultimately limited in their effectiveness as AI partners and teaching tools. In this work, we propose a unified modeling approach for human-AI alignment in chess that coherently captures human style across different skill levels and directly captures how people improve. Recognizing the complex, non-linear nature of human learning, we introduce a skill-aware attention mechanism to dynamically integrate players' strengths with encoded chess positions, enabling our model to be sensitive to evolving player skill. Our experimental results demonstrate that this unified framework significantly enhances the alignment between AI and human players across a diverse range of expertise levels, paving the way for deeper insights into human decision-making and AI-guided teaching tools.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Toward Modeling Player-Specific Chess Behaviors
cs.AI 2026-05 unverdicted novelty 6.0

Champion-specific embeddings and limited MCTS in Maia-2 reduce average Jensen-Shannon divergence to 16 historical chess champions' move distributions in a new latent-space metric, even as standard move accuracy falls.
ChessMimic: Per-Rating Transformer Models for Human Move, Clock, and Outcome Prediction in Online Blitz Chess
cs.LG 2026-06 unverdicted novelty 5.0

Per-100-Elo-band transformers outperform Maia-2 in move prediction accuracy across all bands and reach 0.78 AUC on outcome prediction using held-out Lichess data.