pith. sign in

arxiv: 1704.00103 · v2 · pith:IXT76OJCnew · submitted 2017-04-01 · 💻 cs.CV · cs.LG

SafetyNet: Detecting and Rejecting Adversarial Examples Robustly

classification 💻 cs.CV cs.LG
keywords adversarialdepthimagessafetynetconstructiondefeatdifficultyexamples
0
0 comments X
read the original abstract

We describe a method to produce a network where current methods such as DeepFool have great difficulty producing adversarial samples. Our construction suggests some insights into how deep networks work. We provide a reasonable analyses that our construction is difficult to defeat, and show experimentally that our method is hard to defeat with both Type I and Type II attacks using several standard networks and datasets. This SafetyNet architecture is used to an important and novel application SceneProof, which can reliably detect whether an image is a picture of a real scene or not. SceneProof applies to images captured with depth maps (RGBD images) and checks if a pair of image and depth map is consistent. It relies on the relative difficulty of producing naturalistic depth maps for images in post processing. We demonstrate that our SafetyNet is robust to adversarial examples built from currently known attacking approaches.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.