2506.11350 , archivePrefix =

Dinkel, Heinrich, Yan, Zhiyong, Wang, Tianzi, Wang, Yongqing, Sun, Xingwei, Niu, Yadong · 2025 · arXiv 2506.11350

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

MECAT: A Multi-Experts Constructed Benchmark for Fine-Grained Audio Understanding Tasks

eess.AS · 2025-07-31 · unverdicted · novelty 7.0

MECAT is a multi-expert benchmark for audio AI offering fine-grained captions and QA pairs generated via expert models and LLM reasoning, paired with the DATE metric that combines semantic similarity and cross-sample discriminability to favor detailed outputs.

Drift-Augmented Scoring: Text-Derived Noise Robustness for Zero-Shot Audio-Language Classification

cs.SD · 2026-06-03 · unverdicted · novelty 6.0

DAS adds a cached per-class bonus derived from noise-conditioned text prompts to cosine scores, improving accuracy by 2.60-5.75 points on UrbanSound8K and mAP by 1.50-1.74 points on FSD50K under urban noise.

Dasheng AudioGen: A Unified Model for Generating Coherent Audio Scenes from Text

cs.SD · 2026-05-27 · unverdicted · novelty 6.0

Dasheng AudioGen uses multi-view captions and a unified semantic-acoustic representation to enable end-to-end generation of mixed audio scenes from text descriptions.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Drift-Augmented Scoring: Text-Derived Noise Robustness for Zero-Shot Audio-Language Classification cs.SD · 2026-06-03 · unverdicted · none · ref 25
DAS adds a cached per-class bonus derived from noise-conditioned text prompts to cosine scores, improving accuracy by 2.60-5.75 points on UrbanSound8K and mAP by 1.50-1.74 points on FSD50K under urban noise.
Dasheng AudioGen: A Unified Model for Generating Coherent Audio Scenes from Text cs.SD · 2026-05-27 · unverdicted · none · ref 25
Dasheng AudioGen uses multi-view captions and a unified semantic-acoustic representation to enable end-to-end generation of mixed audio scenes from text descriptions.

2506.11350 , archivePrefix =

fields

years

verdicts

representative citing papers

citing papers explorer