Title resolution pending

URL https://doi · 2012 · DOI 10.1016/j.csl

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Do Audio LLMs Listen or Read? Analyzing and Mitigating Paralinguistic Failures with VoxParadox

cs.SD · 2026-05-26 · unverdicted · novelty 7.0

Audio LLMs fail to use paralinguistic audio information and default to transcript content; a new adversarial benchmark plus PCLM and DPO training raise accuracy on VoxParadox from 17.4% to 65.2%.

EchoChain: A Full-Duplex Benchmark for State-Update Reasoning Under Interruptions

cs.CL · 2026-04-08 · unverdicted · novelty 7.0

EchoChain benchmark shows no evaluated real-time voice model exceeds 50% success on state updates after mid-speech interruptions, with a 40.2% failure reduction in non-interrupted controls.

Layer-wise Probing of wav2vec 2.0 and Whisper for Consonant Cluster Reduction in African American English

cs.CL · 2026-06-22 · unverdicted · novelty 6.0

Layer-wise probing of wav2vec2-base and Whisper-small shows both models distinguish reduced vs. canonical consonant clusters in AAE with high accuracy and retain cues to underlying stops, encoding CCR as gradient variation.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Do Audio LLMs Listen or Read? Analyzing and Mitigating Paralinguistic Failures with VoxParadox cs.SD · 2026-05-26 · unverdicted · none · ref 3
Audio LLMs fail to use paralinguistic audio information and default to transcript content; a new adversarial benchmark plus PCLM and DPO training raise accuracy on VoxParadox from 17.4% to 65.2%.
EchoChain: A Full-Duplex Benchmark for State-Update Reasoning Under Interruptions cs.CL · 2026-04-08 · unverdicted · none · ref 1
EchoChain benchmark shows no evaluated real-time voice model exceeds 50% success on state updates after mid-speech interruptions, with a 40.2% failure reduction in non-interrupted controls.
Layer-wise Probing of wav2vec 2.0 and Whisper for Consonant Cluster Reduction in African American English cs.CL · 2026-06-22 · unverdicted · none · ref 30
Layer-wise probing of wav2vec2-base and Whisper-small shows both models distinguish reduced vs. canonical consonant clusters in AAE with high accuracy and retain cues to underlying stops, encoding CCR as gradient variation.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer