pith. sign in

Adv-BNN: Improved Adversarial Defense through Robust Bayesian Neural Network

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

We present a new algorithm to train a robust neural network against adversarial attacks. Our algorithm is motivated by the following two ideas. First, although recent work has demonstrated that fusing randomness can improve the robustness of neural networks (Liu 2017), we noticed that adding noise blindly to all the layers is not the optimal way to incorporate randomness. Instead, we model randomness under the framework of Bayesian Neural Network (BNN) to formally learn the posterior distribution of models in a scalable way. Second, we formulate the mini-max problem in BNN to learn the best model distribution under adversarial attacks, leading to an adversarial-trained Bayesian neural net. Experiment results demonstrate that the proposed algorithm achieves state-of-the-art performance under strong attacks. On CIFAR-10 with VGG network, our model leads to 14\% accuracy improvement compared with adversarial training (Madry 2017) and random self-ensemble (Liu 2017) under PGD attack with $0.035$ distortion, and the gap becomes even larger on a subset of ImageNet.

fields

cs.LG 1

years

2024 1

verdicts

UNVERDICTED 1

representative citing papers

Spectrally unstable nodes drive reliability failures in graph learning

cs.LG · 2024-12-19 · unverdicted · novelty 5.0

Spectrally unstable nodes are identified via graph-spectral distortion analysis as primary drivers of reliability failures; isolating them yields a stable subgraph for learning with propagation-based recovery for the isolated nodes, improving performance across GNNs and spectral clustering under攻击s.

citing papers explorer

Showing 1 of 1 citing paper.

  • Spectrally unstable nodes drive reliability failures in graph learning cs.LG · 2024-12-19 · unverdicted · none · ref 18 · internal anchor

    Spectrally unstable nodes are identified via graph-spectral distortion analysis as primary drivers of reliability failures; isolating them yields a stable subgraph for learning with propagation-based recovery for the isolated nodes, improving performance across GNNs and spectral clustering under攻击s.