pith. sign in

arxiv: 1602.04621 · v3 · pith:VESOUDSKnew · submitted 2016-02-15 · 💻 cs.LG · cs.AI· cs.SY· eess.SY· stat.ML

Deep Exploration via Bootstrapped DQN

classification 💻 cs.LG cs.AIcs.SYeess.SYstat.ML
keywords bootstrappedexplorationlearningcomplexdeepefficientacrossalgorithm
0
0 comments X
read the original abstract

Efficient exploration in complex environments remains a major challenge for reinforcement learning. We propose bootstrapped DQN, a simple algorithm that explores in a computationally and statistically efficient manner through use of randomized value functions. Unlike dithering strategies such as epsilon-greedy exploration, bootstrapped DQN carries out temporally-extended (or deep) exploration; this can lead to exponentially faster learning. We demonstrate these benefits in complex stochastic MDPs and in the large-scale Arcade Learning Environment. Bootstrapped DQN substantially improves learning times and performance across most Atari games.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Concrete Problems in AI Safety

    cs.AI 2016-06 accept novelty 7.0

    The paper categorizes five concrete AI safety problems arising from flawed objectives, costly evaluation, and learning dynamics.

  2. Bayesian Neural Networks: An Introduction and Survey

    stat.ML 2020-06 unverdicted novelty 1.0

    A survey introducing Bayesian Neural Networks and comparing approximate inference methods to enable uncertainty quantification in neural network predictions.