Isolated and Ensemble Audio Preprocessing Methods for Detecting Adversarial Examples against Automatic Speech Recognition

Jugal Kalita; Krishan Rajaratnam; Kunal Shah

arxiv: 1809.04397 · v1 · pith:FKMGNZMOnew · submitted 2018-09-11 · 💻 cs.SD · cs.CL· cs.CR· cs.LG· cs.NE· eess.AS

Isolated and Ensemble Audio Preprocessing Methods for Detecting Adversarial Examples against Automatic Speech Recognition

Krishan Rajaratnam , Kunal Shah , Jugal Kalita This is my paper

classification 💻 cs.SD cs.CLcs.CRcs.LGcs.NEeess.AS

keywords speechadversarialattackaudioexamplescommandsdetectingmodel

0 comments

read the original abstract

An adversarial attack is an exploitative process in which minute alterations are made to natural inputs, causing the inputs to be misclassified by neural models. In the field of speech recognition, this has become an issue of increasing significance. Although adversarial attacks were originally introduced in computer vision, they have since infiltrated the realm of speech recognition. In 2017, a genetic attack was shown to be quite potent against the Speech Commands Model. Limited-vocabulary speech classifiers, such as the Speech Commands Model, are used in a variety of applications, particularly in telephony; as such, adversarial examples produced by this attack pose as a major security threat. This paper explores various methods of detecting these adversarial examples with combinations of audio preprocessing. One particular combined defense incorporating compressions, speech coding, filtering, and audio panning was shown to be quite effective against the attack on the Speech Commands Model, detecting audio adversarial examples with 93.5% precision and 91.2% recall.

This paper has not been read by Pith yet.

Isolated and Ensemble Audio Preprocessing Methods for Detecting Adversarial Examples against Automatic Speech Recognition

discussion (0)