pith. sign in

arxiv: 1806.04552 · v1 · pith:BZ2CVEW7new · submitted 2018-06-12 · 💻 cs.LG · cs.AI· cs.CV· cs.NE· stat.ML

Combining Model-Free Q-Ensembles and Model-Based Approaches for Informed Exploration

classification 💻 cs.LG cs.AIcs.CVcs.NEstat.ML
keywords q-ensemblesexplorationmodel-basedapproachesmodel-freeapproachbeencompared
0
0 comments X
read the original abstract

Q-Ensembles are a model-free approach where input images are fed into different Q-networks and exploration is driven by the assumption that uncertainty is proportional to the variance of the output Q-values obtained. They have been shown to perform relatively well compared to other exploration strategies. Further, model-based approaches, such as encoder-decoder models have been used successfully for next frame prediction given previous frames. This paper proposes to integrate the model-free Q-ensembles and model-based approaches with the hope of compounding the benefits of both and achieving superior exploration as a result. Results show that a model-based trajectory memory approach when combined with Q-ensembles produces superior performance when compared to only using Q-ensembles.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.