Biologically inspired protection of deep networks from adversarial attacks

Aran Nayebi; Surya Ganguli

arxiv: 1703.09202 · v1 · pith:HH46T4HMnew · submitted 2017-03-27 · 📊 stat.ML · cs.LG· q-bio.NC

Biologically inspired protection of deep networks from adversarial attacks

Aran Nayebi , Surya Ganguli This is my paper

classification 📊 stat.ML cs.LGq-bio.NC

keywords networksadversarialexampleshighlyneuralachieveattacksdeep

0 comments

read the original abstract

Inspired by biophysical principles underlying nonlinear dendritic computation in neural circuits, we develop a scheme to train deep neural networks to make them robust to adversarial attacks. Our scheme generates highly nonlinear, saturated neural networks that achieve state of the art performance on gradient based adversarial examples on MNIST, despite never being exposed to adversarially chosen examples during training. Moreover, these networks exhibit unprecedented robustness to targeted, iterative schemes for generating adversarial examples, including second-order methods. We further identify principles governing how these networks achieve their robustness, drawing on methods from information geometry. We find these networks progressively create highly flat and compressed internal representations that are sensitive to very few input dimensions, while still solving the task. Moreover, they employ highly kurtotic weight distributions, also found in the brain, and we demonstrate how such kurtosis can protect even linear classifiers from adversarial attack.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Defending Adversarial Attacks by Correcting logits
cs.LG 2019-06 unverdicted novelty 5.0

A two-layer network trained on mixed clean and perturbed logits recovers original predictions for a range of adversarial attacks without needing image data.