Query-Efficient Black-box Adversarial Examples (superceded)

Andrew Ilyas; Anish Athalye; Jessy Lin; Logan Engstrom

arxiv: 1712.07113 · v2 · pith:DNB3MLAFnew · submitted 2017-12-19 · 💻 cs.CV · cs.LG· stat.ML

Query-Efficient Black-box Adversarial Examples (superceded)

Andrew Ilyas , Logan Engstrom , Anish Athalye , Jessy Lin This is my paper

classification 💻 cs.CV cs.LGstat.ML

keywords adversarialblack-boxmethodsaccessattacksexampleslimitedperform

0 comments

read the original abstract

Note that this paper is superceded by "Black-Box Adversarial Attacks with Limited Queries and Information." Current neural network-based image classifiers are susceptible to adversarial examples, even in the black-box setting, where the attacker is limited to query access without access to gradients. Previous methods --- substitute networks and coordinate-based finite-difference methods --- are either unreliable or query-inefficient, making these methods impractical for certain problems. We introduce a new method for reliably generating adversarial examples under more restricted, practical black-box threat models. First, we apply natural evolution strategies to perform black-box attacks using two to three orders of magnitude fewer queries than previous methods. Second, we introduce a new algorithm to perform targeted adversarial attacks in the partial-information setting, where the attacker only has access to a limited number of target classes. Using these techniques, we successfully perform the first targeted adversarial attack against a commercially deployed machine learning system, the Google Cloud Vision API, in the partial information setting.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Hiding Faces in Plain Sight: Disrupting AI Face Synthesis with Adversarial Perturbations
cs.CV 2019-06 unverdicted novelty 6.0

Adversarial perturbations disrupt DNN-based face detectors under white-box, gray-box, and black-box settings to sabotage training data for AI face synthesis.