Bayesian Adversarial Spheres: Bayesian Inference and Adversarial Examples in a Noiseless Setting

Artur Bekasov; Iain Murray

arxiv: 1811.12335 · v1 · pith:QXZOJYNGnew · submitted 2018-11-29 · 📊 stat.ML · cs.LG

Bayesian Adversarial Spheres: Bayesian Inference and Adversarial Examples in a Noiseless Setting

Artur Bekasov , Iain Murray This is my paper

classification 📊 stat.ML cs.LG

keywords adversarialbayesiananalysisdeepexamplesinferencemethodsmodels

0 comments

read the original abstract

Modern deep neural network models suffer from adversarial examples, i.e. confidently misclassified points in the input space. It has been shown that Bayesian neural networks are a promising approach for detecting adversarial points, but careful analysis is problematic due to the complexity of these models. Recently Gilmer et al. (2018) introduced adversarial spheres, a toy set-up that simplifies both practical and theoretical analysis of the problem. In this work, we use the adversarial sphere set-up to understand the properties of approximate Bayesian inference methods for a linear model in a noiseless setting. We compare predictions of Bayesian and non-Bayesian methods, showcasing the advantages of the former, although revealing open challenges for deep learning applications.

This paper has not been read by Pith yet.

Bayesian Adversarial Spheres: Bayesian Inference and Adversarial Examples in a Noiseless Setting

discussion (0)