Strike (with) a Pose: Neural Networks Are Easily Fooled by Strange Poses of Familiar Objects

Anh Nguyen; Chengfei Wang; Long Mai; Michael A. Alcorn; Qi Li; Wei-shinn Ku; Zhitao Gong

arxiv: 1811.11553 · v3 · pith:UZUW2GZZnew · submitted 2018-11-28 · 💻 cs.CV · cs.LG

Strike (with) a Pose: Neural Networks Are Easily Fooled by Strange Poses of Familiar Objects

Michael A. Alcorn , Qi Li , Zhitao Gong , Chengfei Wang , Long Mai , Wei-shinn Ku , Anh Nguyen This is my paper

classification 💻 cs.CV cs.LG

keywords dnnsposesobjectsposetransferdatasetframeworkimage

0 comments

read the original abstract

Despite excellent performance on stationary test sets, deep neural networks (DNNs) can fail to generalize to out-of-distribution (OoD) inputs, including natural, non-adversarial ones, which are common in real-world settings. In this paper, we present a framework for discovering DNN failures that harnesses 3D renderers and 3D models. That is, we estimate the parameters of a 3D renderer that cause a target DNN to misbehave in response to the rendered image. Using our framework and a self-assembled dataset of 3D objects, we investigate the vulnerability of DNNs to OoD poses of well-known objects in ImageNet. For objects that are readily recognized by DNNs in their canonical poses, DNNs incorrectly classify 97% of their pose space. In addition, DNNs are highly sensitive to slight pose perturbations. Importantly, adversarial poses transfer across models and datasets. We find that 99.9% and 99.4% of the poses misclassified by Inception-v3 also transfer to the AlexNet and ResNet-50 image classifiers trained on the same ImageNet dataset, respectively, and 75.5% transfer to the YOLOv3 object detector trained on MS COCO.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Invariance-inducing regularization using worst-case transformations suffices to boost accuracy and spatial robustness
cs.LG 2019-06 unverdicted novelty 5.0

Invariance-inducing regularization using worst-case transformations reduces relative error by 20% on CIFAR10 transformed examples, improves standard accuracy on SVHN, outperforms equivariant networks, and proves no ac...