RealUserSim grounds LLM simulators in 7,275 executable profiles from real conversations, raising behavioral match rates from 24.2% to 45.3% and revealing agent failures hidden by cooperative simulators.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics , pages=
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.HC 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
RealUserSim: Bridging the Reality Gap in Agent Benchmarking via Grounded User Simulation
RealUserSim grounds LLM simulators in 7,275 executable profiles from real conversations, raising behavioral match rates from 24.2% to 45.3% and revealing agent failures hidden by cooperative simulators.