Deep Exploration via Bootstrapped DQN

Alexander Pritzel; Benjamin Van Roy; Charles Blundell; Ian Osband

arxiv: 1602.04621 · v3 · pith:VESOUDSKnew · submitted 2016-02-15 · 💻 cs.LG · cs.AI· cs.SY· eess.SY· stat.ML

Deep Exploration via Bootstrapped DQN

Ian Osband , Charles Blundell , Alexander Pritzel , Benjamin Van Roy This is my paper

classification 💻 cs.LG cs.AIcs.SYeess.SYstat.ML

keywords bootstrappedexplorationlearningcomplexdeepefficientacrossalgorithm

0 comments

read the original abstract

Efficient exploration in complex environments remains a major challenge for reinforcement learning. We propose bootstrapped DQN, a simple algorithm that explores in a computationally and statistically efficient manner through use of randomized value functions. Unlike dithering strategies such as epsilon-greedy exploration, bootstrapped DQN carries out temporally-extended (or deep) exploration; this can lead to exponentially faster learning. We demonstrate these benefits in complex stochastic MDPs and in the large-scale Arcade Learning Environment. Bootstrapped DQN substantially improves learning times and performance across most Atari games.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Concrete Problems in AI Safety
cs.AI 2016-06 accept novelty 7.0

The paper categorizes five concrete AI safety problems arising from flawed objectives, costly evaluation, and learning dynamics.
Bayesian Neural Networks: An Introduction and Survey
stat.ML 2020-06 unverdicted novelty 1.0

A survey introducing Bayesian Neural Networks and comparing approximate inference methods to enable uncertainty quantification in neural network predictions.