Fsd50k: an open dataset of human-labeled sound events.IEEE/ACM T ransactions on Audio, Speech, and Language Processing, 30: 829–852

Eduardo Fonseca, Xavier Favory, Jordi Pons, Frederic Font, Xavier Serra · 2021

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Omni-DeepSearch: A Benchmark for Audio-Driven Omni-Modal Deep Search

cs.SD · 2026-05-09 · unverdicted · novelty 8.0

Omni-DeepSearch is a 640-sample benchmark for audio-driven omni-modal search where the best model reaches only 43.44% accuracy, exposing bottlenecks in audio inference, tool use, and cross-modal reasoning.

citing papers explorer

Showing 1 of 1 citing paper.

Omni-DeepSearch: A Benchmark for Audio-Driven Omni-Modal Deep Search cs.SD · 2026-05-09 · unverdicted · none · ref 29
Omni-DeepSearch is a 640-sample benchmark for audio-driven omni-modal search where the best model reaches only 43.44% accuracy, exposing bottlenecks in audio inference, tool use, and cross-modal reasoning.

Fsd50k: an open dataset of human-labeled sound events.IEEE/ACM T ransactions on Audio, Speech, and Language Processing, 30: 829–852

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer