The only aggregation rule satisfying same-scale normalization, recursive consistency, and marginal Elo-strength consistency converts ratings to strengths, takes their weighted arithmetic mean, and converts back.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
citation-role summary
method 1
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
method 1polarities
use method 1representative citing papers
CivBench trains models on turn-level states in Civilization V to predict victory probabilities, providing a progress-based evaluation of LLM strategic capabilities across 307 games with 7 models.
citing papers explorer
-
Aggregating Elo Ratings: An Axiomatization
The only aggregation rule satisfying same-scale normalization, recursive consistency, and marginal Elo-strength consistency converts ratings to strengths, takes their weighted arithmetic mean, and converts back.
-
CivBench: Progress-Based Evaluation for LLMs' Strategic Decision-Making in Civilization V
CivBench trains models on turn-level states in Civilization V to predict victory probabilities, providing a progress-based evaluation of LLM strategic capabilities across 307 games with 7 models.