TinyGiantALM: A Compact Audio-Language Model for Intent-Aware Reasoning under Resource Constraints

· 2026 · cs.SD · arXiv 2606.08425

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Current advancements in Audio Reasoning rely on massive Large Audio-Language Models (LALMs), hindering deployment in resource-constrained environments. We introduce TinyGiantALM, a compact 1.5B efficiency-oriented alternative. Instead of brute-force scaling, we propose an Instruction-Aware Feature Refinement framework using a Query-guided Projector and Semantic Gating to filter acoustic signals based on user intent. On the MMAR benchmark, TinyGiantALM achieves 46.4% zero-shot accuracy, significantly outperforming 7B-13B baselines. While a reasoning gap in logical narrative remains versus 30B+ models and certain trade-offs exist in overly dense or spatial scenes, our approach notably surpasses models up to 8x larger in disentangling mixed-modality environments. These findings demonstrate that architectural precision offers a tangible pathway to secure robust perception capabilities on edge-friendly scales.

representative citing papers

TinyGiantALM: A Compact Audio-Language Model for Intent-Aware Reasoning under Resource Constraints

cs.SD · 2026-06-07 · unverdicted · novelty 4.0

TinyGiantALM, a compact 1.5B audio-language model with instruction-aware refinement, achieves 46.4% zero-shot accuracy on MMAR and outperforms models up to 8x larger in mixed-modality tasks.

citing papers explorer

Showing 1 of 1 citing paper.

TinyGiantALM: A Compact Audio-Language Model for Intent-Aware Reasoning under Resource Constraints cs.SD · 2026-06-07 · unverdicted · none · ref 3 · internal anchor
TinyGiantALM, a compact 1.5B audio-language model with instruction-aware refinement, achieves 46.4% zero-shot accuracy on MMAR and outperforms models up to 8x larger in mixed-modality tasks.

TinyGiantALM: A Compact Audio-Language Model for Intent-Aware Reasoning under Resource Constraints

fields

years

verdicts

representative citing papers

citing papers explorer