Can large audio-language models truly hear? Tackling hallucinations with multi-task assessment and stepwise audio reasoning.arXiv preprint arXiv:2410.16130

· 2025 · arXiv 2410.16130

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Adaptive Perturbation Selection for Contrastive Audio Decoding

cs.SD · 2026-06-30 · unverdicted · novelty 5.0

Adaptive selection among a library of audio perturbations in contrastive decoding produces task-dependent accuracy gains, including +4.3% on an existence task via a hidden-state selector.

On The Landscape of Spoken Language Models: A Comprehensive Survey

cs.CL · 2025-04-11 · unverdicted · novelty 3.0

A literature survey that organizes spoken language models by architecture, training, and evaluation choices and identifies key challenges and future directions.

citing papers explorer

Showing 2 of 2 citing papers.

Adaptive Perturbation Selection for Contrastive Audio Decoding cs.SD · 2026-06-30 · unverdicted · none · ref 7
Adaptive selection among a library of audio perturbations in contrastive decoding produces task-dependent accuracy gains, including +4.3% on an existence task via a hidden-state selector.
On The Landscape of Spoken Language Models: A Comprehensive Survey cs.CL · 2025-04-11 · unverdicted · none · ref 27
A literature survey that organizes spoken language models by architecture, training, and evaluation choices and identifies key challenges and future directions.

Can large audio-language models truly hear? Tackling hallucinations with multi-task assessment and stepwise audio reasoning.arXiv preprint arXiv:2410.16130

fields

years

verdicts

representative citing papers

citing papers explorer