Audio-Maestro: Enhanc- ing large audio-language models with tool-augmented reasoning,

· 2025 · arXiv 2510.11454

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

SpectCount: Spectrotemporal Counting via Synthetic Signals Improves Large Audio Language Models

eess.AS · 2026-06-05 · unverdicted · novelty 6.0

SpectCount fine-tunes LALMs using on-the-fly synthetic signals to fix identified spectrotemporal weaknesses and boost performance on unseen auditory benchmarks.

Audio-Mind: An Auditable Agentic Framework for Audio Understanding

eess.AS · 2026-05-27 · unverdicted · novelty 4.0

Audio-Mind introduces a conditional, auditable agentic framework for audio understanding that preserves frontend judgment and acquires bounded external evidence only when needed, reporting 80.4% on MMAR and 82.8% on MSU-Bench.

citing papers explorer

Showing 2 of 2 citing papers after filters.

SpectCount: Spectrotemporal Counting via Synthetic Signals Improves Large Audio Language Models eess.AS · 2026-06-05 · unverdicted · none · ref 25
SpectCount fine-tunes LALMs using on-the-fly synthetic signals to fix identified spectrotemporal weaknesses and boost performance on unseen auditory benchmarks.
Audio-Mind: An Auditable Agentic Framework for Audio Understanding eess.AS · 2026-05-27 · unverdicted · none · ref 21
Audio-Mind introduces a conditional, auditable agentic framework for audio understanding that preserves frontend judgment and acquires bounded external evidence only when needed, reporting 80.4% on MMAR and 82.8% on MSU-Bench.

Audio-Maestro: Enhanc- ing large audio-language models with tool-augmented reasoning,

fields

years

verdicts

representative citing papers

citing papers explorer