Each signalx j(t)consists of Nsuperposed pulses (N∼ U {1, N max}), mapped to a textual count labely j

SpectCount SpectCount synthesizes training dataD={(x j(t), yj)}M j=1, generated on-the-fly, where the model learns to count pulses representing fine-grained acoustic events scattered across the time–frequency space, requiring detailed spect · 2073 · arXiv 8374.4766

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

SpectCount: Spectrotemporal Counting via Synthetic Signals Improves Large Audio Language Models

eess.AS · 2026-06-05 · unverdicted · novelty 6.0

SpectCount fine-tunes LALMs using on-the-fly synthetic signals to fix identified spectrotemporal weaknesses and boost performance on unseen auditory benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

SpectCount: Spectrotemporal Counting via Synthetic Signals Improves Large Audio Language Models eess.AS · 2026-06-05 · unverdicted · none · ref 2
SpectCount fine-tunes LALMs using on-the-fly synthetic signals to fix identified spectrotemporal weaknesses and boost performance on unseen auditory benchmarks.

Each signalx j(t)consists of Nsuperposed pulses (N∼ U {1, N max}), mapped to a textual count labely j

fields

years

verdicts

representative citing papers

citing papers explorer