pith. sign in

Clotho: an audio captioning dataset

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

eess.AS 2

years

2024 1 2023 1

verdicts

UNVERDICTED 2

representative citing papers

Qwen2-Audio Technical Report

eess.AS · 2024-07-15 · unverdicted · novelty 4.0

Qwen2-Audio is an open-source audio-language model that outperforms prior systems such as Gemini-1.5-pro on audio-centric instruction-following benchmarks after simplified prompt-based pre-training and expanded data.

citing papers explorer

Showing 2 of 2 citing papers.

  • Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models eess.AS · 2023-11-14 · unverdicted · none · ref 10

    Qwen-Audio trains a unified model on diverse audio and tasks with hierarchical tags to enable strong zero-shot performance on audio understanding benchmarks and multi-turn audio chat.

  • Qwen2-Audio Technical Report eess.AS · 2024-07-15 · unverdicted · none · ref 9

    Qwen2-Audio is an open-source audio-language model that outperforms prior systems such as Gemini-1.5-pro on audio-centric instruction-following benchmarks after simplified prompt-based pre-training and expanded data.