Intermediate Level Adversarial Attack for Enhanced Transferability

Horace He; Isay Katsman; Pian Pawakapan; Qian Huang; Serge Belongie; Ser-Nam Lim; Zeqi Gu; Zhiqiu Lin

arxiv: 1811.08458 · v1 · pith:ZDSID4QGnew · submitted 2018-11-20 · 💻 cs.LG · cs.CV· stat.ML

Intermediate Level Adversarial Attack for Enhanced Transferability

Qian Huang , Zeqi Gu , Isay Katsman , Horace He , Pian Pawakapan , Zhiqiu Lin , Serge Belongie , Ser-Nam Lim This is my paper

classification 💻 cs.LG cs.CVstat.ML

keywords adversarialmodelexamplesblack-boxmodelssourceattackfool

0 comments

read the original abstract

Neural networks are vulnerable to adversarial examples, malicious inputs crafted to fool trained models. Adversarial examples often exhibit black-box transfer, meaning that adversarial examples for one model can fool another model. However, adversarial examples may be overfit to exploit the particular architecture and feature representation of a source model, resulting in sub-optimal black-box transfer attacks to other target models. This leads us to introduce the Intermediate Level Attack (ILA), which attempts to fine-tune an existing adversarial example for greater black-box transferability by increasing its perturbation on a pre-specified layer of the source model. We show that our method can effectively achieve this goal and that we can decide a nearly-optimal layer of the source model to perturb without any knowledge of the target models.

This paper has not been read by Pith yet.

Intermediate Level Adversarial Attack for Enhanced Transferability

discussion (0)