Terms-Bench is a diagnostic benchmark for LLM negotiation agents that reveals agent-specific strategic failures beyond simple deal rates by using hidden-type simulators as oracles.
Co nt in ue n e g o t i a t i n g n orm al ly ; Reject c o n f i d e n t l y if their final offer is u n a c c e p t a b l e
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.GT 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
TERMS-Bench: Diagnosing LLM Negotiation Agents Beyond Deal Rate
Terms-Bench is a diagnostic benchmark for LLM negotiation agents that reveals agent-specific strategic failures beyond simple deal rates by using hidden-type simulators as oracles.