pith. sign in

arxiv: 1707.08616 · v2 · pith:M5YZAVN2new · submitted 2017-07-26 · 💻 cs.AI · cs.CL· cs.LG· stat.ML

Guiding Reinforcement Learning Exploration Using Natural Language

classification 💻 cs.AI cs.CLcs.LGstat.ML
keywords languagelearningnaturalpolicyshapingtechniqueagentenvironments
0
0 comments X
read the original abstract

In this work we present a technique to use natural language to help reinforcement learning generalize to unseen environments. This technique uses neural machine translation, specifically the use of encoder-decoder networks, to learn associations between natural language behavior descriptions and state-action information. We then use this learned model to guide agent exploration using a modified version of policy shaping to make it more effective at learning in unseen environments. We evaluate this technique using the popular arcade game, Frogger, under ideal and non-ideal conditions. This evaluation shows that our modified policy shaping algorithm improves over a Q-learning agent as well as a baseline version of policy shaping.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.