EchoChain benchmark shows no evaluated real-time voice model exceeds 50% success on state updates after mid-speech interruptions, with a 40.2% failure reduction in non-interrupted controls.
DUAL-STREAM REASONING - Test Memory, Integration & Reasoning Continuation: DSR CORE OBJECTIVE: Test if the model can
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
EchoChain: A Full-Duplex Benchmark for State-Update Reasoning Under Interruptions
EchoChain benchmark shows no evaluated real-time voice model exceeds 50% success on state updates after mid-speech interruptions, with a 40.2% failure reduction in non-interrupted controls.