Robust Physical-World Attacks on Deep Learning Models

Amir Rahmati; Atul Prakash; Bo Li; Chaowei Xiao; Dawn Song; Earlence Fernandes; Ivan Evtimov; Kevin Eykholt; Tadayoshi Kohno

arxiv: 1707.08945 · v5 · pith:IHHOWAJFnew · submitted 2017-07-27 · 💻 cs.CR · cs.LG

Robust Physical-World Attacks on Deep Learning Models

Kevin Eykholt , Ivan Evtimov , Earlence Fernandes , Bo Li , Amir Rahmati , Chaowei Xiao , Atul Prakash , Tadayoshi Kohno

show 1 more author

Dawn Song

This is my paper

classification 💻 cs.CR cs.LG

keywords adversarialphysicalexamplesrobustperturbationssignattackconditions

0 comments

read the original abstract

Recent studies show that the state-of-the-art deep neural networks (DNNs) are vulnerable to adversarial examples, resulting from small-magnitude perturbations added to the input. Given that that emerging physical systems are using DNNs in safety-critical situations, adversarial examples could mislead these systems and cause dangerous situations.Therefore, understanding adversarial examples in the physical world is an important step towards developing resilient learning algorithms. We propose a general attack algorithm,Robust Physical Perturbations (RP2), to generate robust visual adversarial perturbations under different physical conditions. Using the real-world case of road sign classification, we show that adversarial examples generated using RP2 achieve high targeted misclassification rates against standard-architecture road sign classifiers in the physical world under various environmental conditions, including viewpoints. Due to the current lack of a standardized testing method, we propose a two-stage evaluation methodology for robust physical adversarial examples consisting of lab and field tests. Using this methodology, we evaluate the efficacy of physical adversarial manipulations on real objects. Witha perturbation in the form of only black and white stickers,we attack a real stop sign, causing targeted misclassification in 100% of the images obtained in lab settings, and in 84.8%of the captured video frames obtained on a moving vehicle(field test) for the target classifier.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 10 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Uncovering and Understanding FPR Manipulation Attack in Industrial IoT Networks
cs.CR 2026-01 unverdicted novelty 8.0

FPR manipulation attack perturbs benign MQTT packets to flip labels to attacks in NIDS with 80-100% success, increasing SOC delays without gradient-based methods.
Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning
cs.CR 2017-12 unverdicted novelty 7.0

Injecting around 50 poisoned samples with a stealthy trigger creates backdoors in deep learning models achieving over 90% attack success under a weak threat model with no model or data knowledge required.
AVISE: Framework for Evaluating the Security of AI Systems
cs.CR 2026-04 unverdicted novelty 6.0

AVISE provides a new framework and automated SET that identifies jailbreak vulnerabilities in language models with 92% accuracy, finding all nine tested models vulnerable to an augmented Red Queen attack.
Street-Legal Physical-World Adversarial Rim for License Plates
cs.CV 2026-04 conditional novelty 6.0

SPAR is a street-legal physical rim that cuts modern ALPR accuracy by 60% and reaches 18% targeted impersonation while costing under $100 and requiring no plate modification.
Remote Rowhammer Attack using Adversarial Observations on Federated Learning Clients
cs.LG 2025-05 unverdicted novelty 6.0

A reinforcement learning attacker manipulates client sensor observations in federated learning to induce repetitive server memory updates, achieving around 70% repeated update rate and enabling remote Rowhammer bit fl...
Open DNN Box by Power Side-Channel Attack
cs.CR 2019-07 unverdicted novelty 6.0

Power side-channel analysis recovers DNN architecture and parameters at 96.5% average accuracy on real embedded devices.
Adversarial Objects Against LiDAR-Based Autonomous Driving Systems
cs.CR 2019-07 unverdicted novelty 6.0

LiDAR-Adv generates adversarial objects to fool LiDAR-based autonomous driving detection systems, tested on Baidu Apollo and with physical 3D prints.
Memory Efficient Full-gradient Attacks (MEFA) Framework for Adversarial Defense Evaluations
cs.LG 2026-05 unverdicted novelty 5.0

MEFA enables exact full-gradient white-box attacks on iterative stochastic purification defenses like diffusion and Langevin EBMs by trading recomputation for lower memory, revealing vulnerabilities missed by approxim...
Consumer Law for AI Agents
cs.CY 2025-07 unverdicted novelty 5.0

EU consumer law needs adaptation to accommodate AI agents acting as autonomous purchasing decision-makers.
Connecting Lyapunov Control Theory to Adversarial Attacks
cs.CR 2019-07 unverdicted novelty 5.0

Connects Lyapunov control theory to a provable defense against weaker adversarial attacks on neural networks.