pith. sign in

arxiv: 1905.13284 · v1 · pith:24U6PITTnew · submitted 2019-05-30 · 💻 cs.LG · cs.CR· stat.ML

Identifying Classes Susceptible to Adversarial Attacks

classification 💻 cs.LG cs.CRstat.ML
keywords classesadversarialsusceptibleidentifymodelattacksdistance-basedmnist
0
0 comments X
read the original abstract

Despite numerous attempts to defend deep learning based image classifiers, they remain susceptible to the adversarial attacks. This paper proposes a technique to identify susceptible classes, those classes that are more easily subverted. To identify the susceptible classes we use distance-based measures and apply them on a trained model. Based on the distance among original classes, we create mapping among original classes and adversarial classes that helps to reduce the randomness of a model to a significant amount in an adversarial setting. We analyze the high dimensional geometry among the feature classes and identify the k most susceptible target classes in an adversarial attack. We conduct experiments using MNIST, Fashion MNIST, CIFAR-10 (ImageNet and ResNet-32) datasets. Finally, we evaluate our techniques in order to determine which distance-based measure works best and how the randomness of a model changes with perturbation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.