Fixing Data Augmentation to Improve Adversarial Robustness

Dan A. Calian; Florian Stimberg; Olivia Wiles; Sven Gowal; Sylvestre-Alvise Rebuffi; Timothy Mann

arxiv: 2103.01946 · v2 · pith:J6T5W2CHnew · submitted 2021-03-02 · 💻 cs.CV · cs.LG

Fixing Data Augmentation to Improve Adversarial Robustness

Sylvestre-Alvise Rebuffi , Sven Gowal , Dan A. Calian , Florian Stimberg , Olivia Wiles , Timothy Mann This is my paper

classification 💻 cs.CV cs.LG

keywords robustaccuracydataadversarialepsilonsizetrainingaugmentation

0 comments

read the original abstract

Adversarial training suffers from robust overfitting, a phenomenon where the robust test accuracy starts to decrease during training. In this paper, we focus on both heuristics-driven and data-driven augmentations as a means to reduce robust overfitting. First, we demonstrate that, contrary to previous findings, when combined with model weight averaging, data augmentation can significantly boost robust accuracy. Second, we explore how state-of-the-art generative models can be leveraged to artificially increase the size of the training set and further improve adversarial robustness. Finally, we evaluate our approach on CIFAR-10 against $\ell_\infty$ and $\ell_2$ norm-bounded perturbations of size $\epsilon = 8/255$ and $\epsilon = 128/255$, respectively. We show large absolute improvements of +7.06% and +5.88% in robust accuracy compared to previous state-of-the-art methods. In particular, against $\ell_\infty$ norm-bounded perturbations of size $\epsilon = 8/255$, our model reaches 64.20% robust accuracy without using any external data, beating most prior works that use external data.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

TARO: Temporal Adversarial Rectification Optimization Using Diffusion Models as Purifiers
cs.LG 2026-05 unverdicted novelty 7.0

TARO builds a temporally guided score prior from high-noise and low-noise diffusion views to purify adversarial examples more robustly than uniform timestep methods.
Compression as an Adversarial Amplifier Through Decision Space Reduction
cs.CV 2026-04 unverdicted novelty 6.0

Compression acts as an adversarial amplifier by reducing the decision space of image classifiers, making attacks in compressed representations substantially more effective than pixel-space attacks under the same pertu...
Sample-wise Adaptive Weighting for Transfer Consistency in Adversarial Distillation
cs.CV 2025-12 conditional novelty 6.0

SAAD adaptively weights adversarial training samples by their transferability to the teacher, yielding higher AutoAttack robustness than prior distillation methods on CIFAR and Tiny-ImageNet without extra compute.
Unsolved Problems in ML Safety
cs.LG 2021-09 accept novelty 6.0

The paper presents a roadmap that identifies four unsolved problems in ML safety: robustness against hazards, monitoring for hazards, alignment of model goals with human intent, and systemic safety.