Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear

Abhishek Kumar; Jianfeng Gao; Kamyar Azizzadenesheli; Li Deng; Lihong Li; Zachary C. Lipton

arxiv: 1611.01211 · v8 · pith:R2UBDYQVnew · submitted 2016-11-03 · 💻 cs.LG · cs.NE· stat.ML

Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear

Zachary C. Lipton , Kamyar Azizzadenesheli , Abhishek Kumar , Lihong Li , Jianfeng Gao , Li Deng This is my paper

classification 💻 cs.LG cs.NEstat.ML

keywords agentsfearlearningenvironmentsintrinsicobjectivereinforcementreward

0 comments

read the original abstract

Many practical environments contain catastrophic states that an optimal agent would visit infrequently or never. Even on toy problems, Deep Reinforcement Learning (DRL) agents tend to periodically revisit these states upon forgetting their existence under a new policy. We introduce intrinsic fear (IF), a learned reward shaping that guards DRL agents against periodic catastrophes. IF agents possess a fear model trained to predict the probability of imminent catastrophe. This score is then used to penalize the Q-learning objective. Our theoretical analysis bounds the reduction in average return due to learning on the perturbed objective. We also prove robustness to classification errors. As a bonus, IF models tend to learn faster, owing to reward shaping. Experiments demonstrate that intrinsic-fear DQNs solve otherwise pathological environments and improve on several Atari games.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Generalizing from a few environments in safety-critical reinforcement learning
cs.LG 2019-07 unverdicted novelty 6.0

RL agents fail dangerously on unseen environments; ensembles reduce catastrophes in gridworld but not CoinRun, with uncertainty enabling intervention prediction.