pith. sign in

arxiv: 1708.05207 · v3 · pith:5LWXJPTHnew · submitted 2017-08-17 · 💻 cs.CR · cs.LG· stat.ML

Learning Universal Adversarial Perturbations with Generative Models

classification 💻 cs.CR cs.LGstat.ML
keywords adversarialuniversalclassifierdatasetgenerativeinputknownmisclassification
0
0 comments X
read the original abstract

Neural networks are known to be vulnerable to adversarial examples, inputs that have been intentionally perturbed to remain visually similar to the source input, but cause a misclassification. It was recently shown that given a dataset and classifier, there exists so called universal adversarial perturbations, a single perturbation that causes a misclassification when applied to any input. In this work, we introduce universal adversarial networks, a generative network that is capable of fooling a target classifier when it's generated output is added to a clean sample from a dataset. We show that this technique improves on known universal adversarial attacks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.