pith. sign in

arxiv: 1603.03267 · v1 · pith:3MAQIEWVnew · submitted 2016-03-10 · 💻 cs.AI

Hierarchical Linearly-Solvable Markov Decision Problems

classification 💻 cs.AI
keywords hierarchicallearningtaskdecisionlinearly-solvablelmdpsmarkovproblems
0
0 comments X
read the original abstract

We present a hierarchical reinforcement learning framework that formulates each task in the hierarchy as a special type of Markov decision process for which the Bellman equation is linear and has analytical solution. Problems of this type, called linearly-solvable MDPs (LMDPs) have interesting properties that can be exploited in a hierarchical setting, such as efficient learning of the optimal value function or task compositionality. The proposed hierarchical approach can also be seen as a novel alternative to solving LMDPs with large state spaces. We derive a hierarchical version of the so-called Z-learning algorithm that learns different tasks simultaneously and show empirically that it significantly outperforms the state-of-the-art learning methods in two classical hierarchical reinforcement learning domains: the taxi domain and an autonomous guided vehicle task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.