Uncovering the limits of adversarial training against norm-bounded adversarial examples

· 2010 · arXiv 2010.03593

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

representative citing papers

Adversarial Robustness in One-Stage Learning-to-Defer

stat.ML · 2025-10-13 · unverdicted · novelty 7.0

Develops the first adversarial robustness framework for one-stage learning-to-defer, including cost-sensitive surrogate losses and theoretical consistency guarantees for classification and regression.

Towards Generalized Certified Robustness with Multi-Norm Training

cs.LG · 2024-10-03 · unverdicted · novelty 7.0

CURE is the first multi-norm certified training method that improves union robustness across l_p norms and unseen perturbations on MNIST, CIFAR-10 and TinyImagenet.

Detecting Adversarial Data via Provable Adversarial Noise Amplification

cs.LG · 2026-05-04 · unverdicted · novelty 6.0

A provable adversarial noise amplification theorem under sufficient conditions enables a custom-trained detector that identifies adversarial examples at inference time using enhanced layer-wise noise signals.

Sample-wise Adaptive Weighting for Transfer Consistency in Adversarial Distillation

cs.CV · 2025-12-11 · conditional · novelty 6.0

SAAD adaptively weights adversarial training samples by their transferability to the teacher, yielding higher AutoAttack robustness than prior distillation methods on CIFAR and Tiny-ImageNet without extra compute.

Nearest Neighbor Projection Removal Adversarial Training

cs.CV · 2025-09-09 · unverdicted · novelty 6.0

Nearest Neighbor Projection Removal Adversarial Training projects out inter-class dependencies in feature space during training, claims to reduce the Lipschitz constant and Rademacher complexity, and reports competitive robust accuracy on CIFAR-10, CIFAR-100, SVHN, and TinyImagenet.

Improving Clean Accuracy via a Tangent-Space Perspective on Adversarial Training

cs.LG · 2024-08-27 · unverdicted · novelty 6.0

TART improves clean accuracy in adversarial training by modulating perturbation bounds according to the tangential component of adversarial examples.

citing papers explorer

Showing 6 of 6 citing papers.

Adversarial Robustness in One-Stage Learning-to-Defer stat.ML · 2025-10-13 · unverdicted · none · ref 5
Develops the first adversarial robustness framework for one-stage learning-to-defer, including cost-sensitive surrogate losses and theoretical consistency guarantees for classification and regression.
Towards Generalized Certified Robustness with Multi-Norm Training cs.LG · 2024-10-03 · unverdicted · none · ref 11
CURE is the first multi-norm certified training method that improves union robustness across l_p norms and unseen perturbations on MNIST, CIFAR-10 and TinyImagenet.
Detecting Adversarial Data via Provable Adversarial Noise Amplification cs.LG · 2026-05-04 · unverdicted · none · ref 12
A provable adversarial noise amplification theorem under sufficient conditions enables a custom-trained detector that identifies adversarial examples at inference time using enhanced layer-wise noise signals.
Sample-wise Adaptive Weighting for Transfer Consistency in Adversarial Distillation cs.CV · 2025-12-11 · conditional · none · ref 27
SAAD adaptively weights adversarial training samples by their transferability to the teacher, yielding higher AutoAttack robustness than prior distillation methods on CIFAR and Tiny-ImageNet without extra compute.
Nearest Neighbor Projection Removal Adversarial Training cs.CV · 2025-09-09 · unverdicted · none · ref 15
Nearest Neighbor Projection Removal Adversarial Training projects out inter-class dependencies in feature space during training, claims to reduce the Lipschitz constant and Rademacher complexity, and reports competitive robust accuracy on CIFAR-10, CIFAR-100, SVHN, and TinyImagenet.
Improving Clean Accuracy via a Tangent-Space Perspective on Adversarial Training cs.LG · 2024-08-27 · unverdicted · none · ref 19
TART improves clean accuracy in adversarial training by modulating perturbation bounds according to the tangential component of adversarial examples.

Uncovering the limits of adversarial training against norm-bounded adversarial examples

fields

years

verdicts

representative citing papers

citing papers explorer