SOAR: Second-Order Adversarial Regularization

Amir-massoud Farahmand; Avery Ma; Fartash Faghri; Nicolas Papernot

arxiv: 2004.01832 · v2 · pith:J46NQOK5new · submitted 2020-04-04 · 💻 cs.LG · stat.ML

SOAR: Second-Order Adversarial Regularization

Avery Ma , Fartash Faghri , Nicolas Papernot , Amir-massoud Farahmand This is my paper

classification 💻 cs.LG stat.ML

keywords adversarialrobustnesssecond-orderapproachnetworksoptimizationproposedregularization

0 comments

read the original abstract

Adversarial training is a common approach to improving the robustness of deep neural networks against adversarial examples. In this work, we propose a novel regularization approach as an alternative. To derive the regularizer, we formulate the adversarial robustness problem under the robust optimization framework and approximate the loss function using a second-order Taylor series expansion. Our proposed second-order adversarial regularizer (SOAR) is an upper bound based on the Taylor approximation of the inner-max in the robust optimization objective. We empirically show that the proposed method significantly improves the robustness of networks against the $\ell_\infty$ and $\ell_2$ bounded perturbations generated using cross-entropy-based PGD on CIFAR-10 and SVHN.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

SORA: Free Second-Order Attacks in Fast Adversarial Training
cs.LG 2026-05 unverdicted novelty 5.0

SORA is an adaptive step-size adversarial training algorithm that formalizes epsilon overfitting, introduces the PertAlign metric to predict catastrophic overfitting, and dynamically adjusts perturbations to achieve s...
Margin-Adaptive Confidence Ranking for Reliable LLM Judgement
cs.LG 2026-05 unverdicted novelty 5.0

Introduces a margin-adaptive confidence ranking method that learns an estimator from simulated diversity and derives margin-dependent generalization bounds for use in fixed-sequence testing of LLM-human agreement.
Margin-Adaptive Confidence Ranking for Reliable LLM Judgement
cs.LG 2026-05 unverdicted novelty 4.0

Develops a margin-adaptive learned confidence estimator for LLMs with generalization guarantees to improve agreement rates with human judgments over heuristic baselines.