EchoChain benchmark shows no evaluated real-time voice model exceeds 50% success on state updates after mid-speech interruptions, with a 40.2% failure reduction in non-interrupted controls.
[message truncated for analysis]
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
EchoChain: A Full-Duplex Benchmark for State-Update Reasoning Under Interruptions
EchoChain benchmark shows no evaluated real-time voice model exceeds 50% success on state updates after mid-speech interruptions, with a 40.2% failure reduction in non-interrupted controls.