pith. sign in

hub

Audio flamingo 2: An audio- language model with long-audio understanding and expert rea- soning abilities

16 Pith papers cite this work. Polarity classification is still indexing.

16 Pith papers citing it

hub tools

citation-role summary

background 3

citation-polarity summary

years

2026 15 2025 1

verdicts

UNVERDICTED 16

roles

background 3

polarities

background 3

clear filters

representative citing papers

Continuous Audio Thinking for Large Audio Language Models

cs.CL · 2026-06-05 · unverdicted · novelty 6.0

CoAT adds a continuous latent thinking space to LALMs via expert distillation to retain acoustic information, yielding gains on audio reasoning, understanding, music, emotion, and transcription benchmarks across three models.

Audio Interaction Model

cs.SD · 2026-06-03 · unverdicted · novelty 6.0

Audio-Interaction unifies offline and online audio tasks into one streaming model via the SoundFlow framework and a new 2.6M-item streaming corpus, enabling real-time instruction following and proactive responses.

Step-Audio 2 Technical Report

cs.CL · 2025-07-22 · unverdicted · novelty 6.0

Step-Audio 2 integrates a latent audio encoder, reasoning-centric reinforcement learning, and discrete audio token generation into language modeling to deliver state-of-the-art performance on audio understanding and conversational benchmarks.

citing papers explorer

Showing 16 of 16 citing papers after filters.