Adaptive Variance for Changing Sparse-Reward Environments

Carlos Florensa; David Held; Pengsheng Guo; Xingyu Lin

arxiv: 1903.06309 · v2 · pith:AZKTROVPnew · submitted 2019-03-15 · 💻 cs.RO · cs.AI

Adaptive Variance for Changing Sparse-Reward Environments

Xingyu Lin , Pengsheng Guo , Carlos Florensa , David Held This is my paper

classification 💻 cs.RO cs.AI

keywords changesenvironmentsexplorationpolicysparse-rewardadaptchangingenvironment

0 comments

read the original abstract

Robots that are trained to perform a task in a fixed environment often fail when facing unexpected changes to the environment due to a lack of exploration. We propose a principled way to adapt the policy for better exploration in changing sparse-reward environments. Unlike previous works which explicitly model environmental changes, we analyze the relationship between the value function and the optimal exploration for a Gaussian-parameterized policy and show that our theory leads to an effective strategy for adjusting the variance of the policy, enabling fast adapt to changes in a variety of sparse-reward environments.

This paper has not been read by Pith yet.

Adaptive Variance for Changing Sparse-Reward Environments

discussion (0)