Identifying Classes Susceptible to Adversarial Attacks

Hridesh Rajan; Md Johirul Islam; Rangeet Pan; Shibbir Ahmed

arxiv: 1905.13284 · v1 · pith:24U6PITTnew · submitted 2019-05-30 · 💻 cs.LG · cs.CR· stat.ML

Identifying Classes Susceptible to Adversarial Attacks

Rangeet Pan , Md Johirul Islam , Shibbir Ahmed , Hridesh Rajan This is my paper

classification 💻 cs.LG cs.CRstat.ML

keywords classesadversarialsusceptibleidentifymodelattacksdistance-basedmnist

0 comments

read the original abstract

Despite numerous attempts to defend deep learning based image classifiers, they remain susceptible to the adversarial attacks. This paper proposes a technique to identify susceptible classes, those classes that are more easily subverted. To identify the susceptible classes we use distance-based measures and apply them on a trained model. Based on the distance among original classes, we create mapping among original classes and adversarial classes that helps to reduce the randomness of a model to a significant amount in an adversarial setting. We analyze the high dimensional geometry among the feature classes and identify the k most susceptible target classes in an adversarial attack. We conduct experiments using MNIST, Fashion MNIST, CIFAR-10 (ImageNet and ResNet-32) datasets. Finally, we evaluate our techniques in order to determine which distance-based measure works best and how the randomness of a model changes with perturbation.

This paper has not been read by Pith yet.

Identifying Classes Susceptible to Adversarial Attacks

discussion (0)