pith. sign in

arxiv: 1805.09164 · v1 · pith:WSCZMWGCnew · submitted 2018-05-22 · 📡 eess.AS · cs.SD

A Study On Convolutional Neural Network Based End-To-End Replay Anti-Spoofing

classification 📡 eess.AS cs.SD
keywords dataend-to-endgeneralizationasvspoofconvolutionaldatasetdevelopmentevaluation
0
0 comments X
read the original abstract

The second Automatic Speaker Verification Spoofing and Countermeasures challenge (ASVspoof 2017) focused on "replay attack" detection. The best deep-learning systems to compete in ASVspoof 2017 used Convolutional Neural Networks (CNNs) as a feature extractor. In this paper, we study their performance in an end-to-end setting. We find that these architectures show poor generalization in the evaluation dataset, but find a compact architecture that shows good generalization on the development data. We demonstrate that for this dataset it is not easy to obtain a similar level of generalization on both the development and evaluation data. This leads to a variety of open questions about what the differences are in the data; why these are more evident in an end-to-end setting; and how these issues can be overcome by increasing the training data.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.