pink_action

A Environment Implementation Details The Unity scene contains twoAgentController components, GridManager, GameManager, PythonBridge · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Territory Paint Wars: Diagnosing and Mitigating Failure Modes in Competitive Multi-Agent PPO

cs.LG · 2026-04-04 · conditional · novelty 7.0

PPO in a new competitive game fails due to five implementation bugs and then competitive overfitting where self-play stays near 50% but generalization drops to 21.6%; mixing 20% random opponents restores generalization to 77.1%.

citing papers explorer

Showing 1 of 1 citing paper.

Territory Paint Wars: Diagnosing and Mitigating Failure Modes in Competitive Multi-Agent PPO cs.LG · 2026-04-04 · conditional · none · ref 5
PPO in a new competitive game fails due to five implementation bugs and then competitive overfitting where self-play stays near 50% but generalization drops to 21.6%; mixing 20% random opponents restores generalization to 77.1%.

pink_action

fields

years

verdicts

representative citing papers

citing papers explorer