Mellow: a small audio language model for reasoning

· 2025 · arXiv 2503.08540

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

TinyMU: A Compact Audio-Language Model for Music Understanding

cs.SD · 2026-04-17 · unverdicted · novelty 5.0

TinyMU is a 229M-parameter compact music understanding model that achieves 82% of state-of-the-art large audio-language model performance on the MuChoMusic benchmark while being 35 times smaller.

Audio-Cogito: Towards Deep Audio Reasoning in Large Audio Language Models

eess.AS · 2026-04-14 · unverdicted · novelty 5.0

Audio-Cogito is an open-source LALM using Cogito-pipe data curation and self-distillation to achieve leading open-source performance on audio reasoning benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

TinyMU: A Compact Audio-Language Model for Music Understanding cs.SD · 2026-04-17 · unverdicted · none · ref 36
TinyMU is a 229M-parameter compact music understanding model that achieves 82% of state-of-the-art large audio-language model performance on the MuChoMusic benchmark while being 35 times smaller.
Audio-Cogito: Towards Deep Audio Reasoning in Large Audio Language Models eess.AS · 2026-04-14 · unverdicted · none · ref 51
Audio-Cogito is an open-source LALM using Cogito-pipe data curation and self-distillation to achieve leading open-source performance on audio reasoning benchmarks.

Mellow: a small audio language model for reasoning

fields

years

verdicts

representative citing papers

citing papers explorer