DUAL-STREAM REASONING - Test Memory, Integration & Reasoning Continuation: DSR CORE OBJECTIVE: Test if the model can

CONVERSATION COHERENCE: Ensure each turn serves overall conversation purpose while naturally testing target capability

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

EchoChain: A Full-Duplex Benchmark for State-Update Reasoning Under Interruptions

cs.CL · 2026-04-08 · unverdicted · novelty 7.0

EchoChain benchmark shows no evaluated real-time voice model exceeds 50% success on state updates after mid-speech interruptions, with a 40.2% failure reduction in non-interrupted controls.

citing papers explorer

Showing 1 of 1 citing paper.

EchoChain: A Full-Duplex Benchmark for State-Update Reasoning Under Interruptions cs.CL · 2026-04-08 · unverdicted · none · ref 5
EchoChain benchmark shows no evaluated real-time voice model exceeds 50% success on state updates after mid-speech interruptions, with a 40.2% failure reduction in non-interrupted controls.

DUAL-STREAM REASONING - Test Memory, Integration & Reasoning Continuation: DSR CORE OBJECTIVE: Test if the model can

fields

years

verdicts

representative citing papers

citing papers explorer