Regularized last-iterate solvers select the maximum-entropy Nash equilibrium while regret-averaging methods select lower-entropy faces on zero-sum Nash polytopes, verified on analytic testbeds and a 180-game ensemble.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.GT 1years
2026 1verdicts
ACCEPT 1representative citing papers
citing papers explorer
-
Which Nash Equilibrium? Solver-Dependent Selection on Zero-Sum Nash Polytopes
Regularized last-iterate solvers select the maximum-entropy Nash equilibrium while regret-averaging methods select lower-entropy faces on zero-sum Nash polytopes, verified on analytic testbeds and a 180-game ensemble.