pith. the verified trust layer for science. sign in

arxiv: 1611.09894 · v1 · pith:KULZUIPBnew · submitted 2016-11-29 · 💻 cs.AI · cs.LG· stat.ML

Exploration for Multi-task Reinforcement Learning with Deep Generative Models

classification 💻 cs.AI cs.LGstat.ML
keywords explorationmulti-tasklearningmethodreinforcementdeepgenerativemodels
0
0 comments X p. Extension
Add this Pith Number to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{KULZUIPB}

Prints a linked pith:KULZUIPB badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Exploration in multi-task reinforcement learning is critical in training agents to deduce the underlying MDP. Many of the existing exploration frameworks such as $E^3$, $R_{max}$, Thompson sampling assume a single stationary MDP and are not suitable for system identification in the multi-task setting. We present a novel method to facilitate exploration in multi-task reinforcement learning using deep generative models. We supplement our method with a low dimensional energy model to learn the underlying MDP distribution and provide a resilient and adaptive exploration signal to the agent. We evaluate our method on a new set of environments and provide intuitive interpretation of our results.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.