Robust Deep Reinforcement Learning with Adversarial Attacks

Anay Pattanaik; Gautham Bommannan; Girish Chowdhary; Shuijing Liu; Zhenyi Tang

arxiv: 1712.03632 · v1 · pith:ENXSUBFMnew · submitted 2017-12-11 · 💻 cs.LG · cs.AI· cs.RO

Robust Deep Reinforcement Learning with Adversarial Attacks

Anay Pattanaik , Zhenyi Tang , Shuijing Liu , Gautham Bommannan , Girish Chowdhary This is my paper

classification 💻 cs.LG cs.AIcs.RO

keywords attacksdeeplearningadversarialreinforcementrobustnessalgorithmsattack

0 comments

read the original abstract

This paper proposes adversarial attacks for Reinforcement Learning (RL) and then improves the robustness of Deep Reinforcement Learning algorithms (DRL) to parameter uncertainties with the help of these attacks. We show that even a naively engineered attack successfully degrades the performance of DRL algorithm. We further improve the attack using gradient information of an engineered loss function which leads to further degradation in performance. These attacks are then leveraged during training to improve the robustness of RL within robust control framework. We show that this adversarial training of DRL algorithms like Deep Double Q learning and Deep Deterministic Policy Gradients leads to significant increase in robustness to parameter variations for RL benchmarks such as Cart-pole, Mountain Car, Hopper and Half Cheetah environment.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Efficient Preference Poisoning Attack on Offline RLHF
cs.LG 2026-05 unverdicted novelty 8.0

Label-flip attacks on log-linear DPO reduce to binary sparse approximation problems that can be solved efficiently by lattice-based and binary matching pursuit methods with recovery guarantees.
Interaction-Breaking Adversarial Learning Framework for Robust Multi-Agent Reinforcement Learning
cs.LG 2026-05 unverdicted novelty 6.0

The IBAL framework builds information-theoretic attacks that break agent interactions in MARL and trains policies to stay robust under observation and action perturbations.