ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection

Andreas Nautsch; Hector Delgado; Junichi Yamagishi; Kong Aik Lee; Massimiliano Todisco; Md Sahidullah; Nicholas Evans; Tomi Kinnunen; Ville Vestman; Xin Wang

arxiv: 1904.05441 · v2 · pith:EW3UZQQUnew · submitted 2019-04-09 · 📡 eess.AS · cs.CR· cs.SD

ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection

Massimiliano Todisco , Xin Wang , Ville Vestman , Md Sahidullah , Hector Delgado , Andreas Nautsch , Junichi Yamagishi , Nicholas Evans

show 2 more authors

Tomi Kinnunen Kong Aik Lee

This is my paper

classification 📡 eess.AS cs.CRcs.SD

keywords spoofingasvspoofaudiocountermeasuresdetectionfakeaccessattacks

0 comments

read the original abstract

ASVspoof, now in its third edition, is a series of community-led challenges which promote the development of countermeasures to protect automatic speaker verification (ASV) from the threat of spoofing. Advances in the 2019 edition include: (i) a consideration of both logical access (LA) and physical access (PA) scenarios and the three major forms of spoofing attack, namely synthetic, converted and replayed speech; (ii) spoofing attacks generated with state-of-the-art neural acoustic and waveform models; (iii) an improved, controlled simulation of replay attacks; (iv) use of the tandem detection cost function (t-DCF) that reflects the impact of both spoofing and countermeasures upon ASV reliability. Even if ASV remains the core focus, in retaining the equal error rate (EER) as a secondary metric, ASYspoof also embraces the growing importance of fake audio detection. ASVspoof 2019 attracted the participation of 63 research teams, with more than half of these reporting systems that improve upon the performance of two baseline spoofing countermeasures. This paper describes the 2019 database, protocols and challenge results. It also outlines major findings which demonstrate the real progress made in protecting against the threat of spoofing and fake audio.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MixFake: Benchmarking and Enhancing Audio Deepfake Detection in Diverse Real-world Mixed Audio
cs.SD 2026-05 unverdicted novelty 7.0

MixFake is a new benchmark for mixed-authenticity audio and a multi-stream prompt tuning method achieves 0.95% EER foreground and 7.72% absolute gain in complex background deepfake detection.
Speaker Recognition with Random Digit Strings Using Uncertainty Normalized HMM-based i-vectors
eess.AS 2019-07 unverdicted novelty 6.0

Digit-specific HMM i-vectors with uncertainty normalization reach 1.52% male and 1.77% female EER on RSR2015 part III using only that corpus and simple cosine scoring.
Gender Fairness in Audio Deepfake Detection: Performance and Disparity Analysis
cs.SD 2026-03 unverdicted novelty 5.0

Fairness metrics uncover gender disparities in audio deepfake detection error distributions that standard Equal Error Rate metrics obscure.
Audio Deepfake Detection at the First Greeting: "Hi!"
eess.AS 2026-01 unverdicted novelty 5.0

S-MGAA adds pixel-channel enhancement and frequency compensation modules to improve audio deepfake detection on very short, degraded speech inputs.
EnvTriCascade: An Environment-Aware Tri-Stage Cascaded Framework for ESDD2 2026 Challenge
cs.SD 2026-05 unverdicted novelty 4.0

EnvTriCascade is a tri-stage cascaded framework using mix-consistency detection followed by dual SSL-based five-class classifiers with cross-branch attention and RawBoost augmentation, achieving 0.8266 Macro-F1 on the...