We also assess the ability of our models to recall their pretraining dataset with a membership inference attack

Evaluation on downstream tasks In this section we benchmark our audio encoders with multiple downstream tasks: automatic speech recognition, voice activity detection, music detection, speaker recognition · 2021

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Data Selection Effects on Self-Supervised Learning of Audio Representations for French Audiovisual Broadcasts

eess.AS · 2026-04-10 · unverdicted · novelty 5.0

Pretraining audio SSL encoders on diverse French broadcast content rather than clean speech yields better downstream performance on ASR, music detection, and speaker recognition, with deduplication mitigating memorization.

citing papers explorer

Showing 1 of 1 citing paper.

Data Selection Effects on Self-Supervised Learning of Audio Representations for French Audiovisual Broadcasts eess.AS · 2026-04-10 · unverdicted · none · ref 5
Pretraining audio SSL encoders on diverse French broadcast content rather than clean speech yields better downstream performance on ASR, music detection, and speaker recognition, with deduplication mitigating memorization.

We also assess the ability of our models to recall their pretraining dataset with a membership inference attack

fields

years

verdicts

representative citing papers

citing papers explorer