pith. sign in

arxiv: 1609.05524 · v3 · pith:SYKLH5SMnew · submitted 2016-09-18 · 💻 cs.LG · stat.ML

Principled Option Learning in Markov Decision Processes

classification 💻 cs.LG stat.ML
keywords optionsusefulalgorithmcharacterizationprincipledalgorithmsautonomouslybenefits
0
0 comments X
read the original abstract

It is well known that options can make planning more efficient, among their many benefits. Thus far, algorithms for autonomously discovering a set of useful options were heuristic. Naturally, a principled way of finding a set of useful options may be more promising and insightful. In this paper we suggest a mathematical characterization of good sets of options using tools from information theory. This characterization enables us to find conditions for a set of options to be optimal and an algorithm that outputs a useful set of options and illustrate the proposed algorithm in simulation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.