SpectCount fine-tunes LALMs using on-the-fly synthetic signals to fix identified spectrotemporal weaknesses and boost performance on unseen auditory benchmarks.
Each signalx j(t)consists of Nsuperposed pulses (N∼ U {1, N max}), mapped to a textual count labely j
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
eess.AS 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
SpectCount: Spectrotemporal Counting via Synthetic Signals Improves Large Audio Language Models
SpectCount fine-tunes LALMs using on-the-fly synthetic signals to fix identified spectrotemporal weaknesses and boost performance on unseen auditory benchmarks.