The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

Chao Yu, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre Bayen, Yi Wu · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

When Outcome Looks Right But Discipline Fails: Trace-Based Evaluation Under Hidden Competitor State

cs.AI · 2026-05-18 · unverdicted · novelty 6.0

The paper introduces discipline stability, a trace-based evaluation paradigm for checking if RL agents maintain behavioral discipline like rule-based competitors in hidden-state competitive settings such as hotel pricing and bidding.

citing papers explorer

Showing 1 of 1 citing paper.

When Outcome Looks Right But Discipline Fails: Trace-Based Evaluation Under Hidden Competitor State cs.AI · 2026-05-18 · unverdicted · none · ref 18
The paper introduces discipline stability, a trace-based evaluation paradigm for checking if RL agents maintain behavioral discipline like rule-based competitors in hidden-state competitive settings such as hotel pricing and bidding.

The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

fields

years

verdicts

representative citing papers

citing papers explorer