- By rejecting , Player 1 maintains the option to propose a better deal in the next round or wait for Player 0 to make a more fair offer

** Why Reject the Proposal ?** - Rejecting the proposal is the only rational choice because accepting it would violate Player 1's secret instructions

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Breaking the Impasse: Dual-Scale Evolutionary Policy Training for Social Language Agents

cs.CL · 2026-05-09 · unverdicted · novelty 5.0

DEPT detects training impasses in social language agents via dual-scale divergence and entropy, then uses asymmetric reshaping to restore exploration gradients and prevent policy homogenization.

citing papers explorer

Showing 1 of 1 citing paper.

Breaking the Impasse: Dual-Scale Evolutionary Policy Training for Social Language Agents cs.CL · 2026-05-09 · unverdicted · none · ref 20
DEPT detects training impasses in social language agents via dual-scale divergence and entropy, then uses asymmetric reshaping to restore exploration gradients and prevent policy homogenization.

- By rejecting , Player 1 maintains the option to propose a better deal in the next round or wait for Player 0 to make a more fair offer

fields

years

verdicts

representative citing papers

citing papers explorer