When audio-llms don’t listen: A cross-linguistic study of modality arbitration,

· 2026 · arXiv 2602.11488

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Who Wins the Conflict? Mechanistic Interpretability of Text Bias in Audio LLMs

cs.SD · 2026-06-17 · unverdicted · novelty 7.0

Mechanistic tracing shows text suppresses but does not erase audio representations in late layers of Audio LLMs; back-patching reduces text dominance.

Beyond Text Following: Repairable Arbitration Reversals in Audio-Language Models

cs.SD · 2026-06-03 · unverdicted · novelty 6.0

ALMs encode audio evidence but override it with text in conflicts; GACL interpolates joint and same-audio scores to repair reversals, gaining 17.8 nAUC points under a 5pp faithfulness budget.

CAAD: Contrastive Audio-Aware Distillation for Efficient Speech Language Models

eess.AS · 2026-06-22 · unverdicted · novelty 5.0

CAAD internalizes contrastive audio-aware decoding into student SLM weights via synchronized teacher-forcing, delivering an 8% relative gain over standard knowledge distillation on Dynamic-SUPERB while reducing linguistic bias on MCR-BENCH.

citing papers explorer

Showing 1 of 1 citing paper after filters.

CAAD: Contrastive Audio-Aware Distillation for Efficient Speech Language Models eess.AS · 2026-06-22 · unverdicted · none · ref 31
CAAD internalizes contrastive audio-aware decoding into student SLM weights via synchronized teacher-forcing, delivering an 8% relative gain over standard knowledge distillation on Dynamic-SUPERB while reducing linguistic bias on MCR-BENCH.

When audio-llms don’t listen: A cross-linguistic study of modality arbitration,

fields

years

verdicts

representative citing papers

citing papers explorer