pith. machine review for the scientific record. sign in

arxiv: 1906.03710 · v1 · pith:BI7TUMVXnew · submitted 2019-06-09 · 💻 cs.LG · cs.AI· cs.RO· stat.ML

Curiosity-Driven Multi-Criteria Hindsight Experience Replay

classification 💻 cs.LG cs.AIcs.ROstat.ML
keywords curiosity-drivenhindsightsparse-rewardblocksexplorationlearningrewardsparse
0
0 comments X
read the original abstract

Dealing with sparse rewards is a longstanding challenge in reinforcement learning. The recent use of hindsight methods have achieved success on a variety of sparse-reward tasks, but they fail on complex tasks such as stacking multiple blocks with a robot arm in simulation. Curiosity-driven exploration using the prediction error of a learned dynamics model as an intrinsic reward has been shown to be effective for exploring a number of sparse-reward environments. We present a method that combines hindsight with curiosity-driven exploration and curriculum learning in order to solve the challenging sparse-reward block stacking task. We are the first to stack more than two blocks using only sparse reward without human demonstrations.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.