Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes

Daniel A. Braun; Felix Leibfried; Jordi Grau-Moya; Tim Genewein

arxiv: 1604.02080 · v1 · pith:OX54Z4Y5new · submitted 2016-04-07 · 💻 cs.AI · cs.SY· eess.SY

Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes

Jordi Grau-Moya , Felix Leibfried , Tim Genewein , Daniel A. Braun This is my paper

classification 💻 cs.AI cs.SYeess.SY

keywords modelplanninggeneralizedinformation-processinguncertaintyconstraintsdecisioniteration

0 comments

read the original abstract

Information-theoretic principles for learning and acting have been proposed to solve particular classes of Markov Decision Problems. Mathematically, such approaches are governed by a variational free energy principle and allow solving MDP planning problems with information-processing constraints expressed in terms of a Kullback-Leibler divergence with respect to a reference distribution. Here we consider a generalization of such MDP planners by taking model uncertainty into account. As model uncertainty can also be formalized as an information-processing constraint, we can derive a unified solution from a single generalized variational principle. We provide a generalized value iteration scheme together with a convergence proof. As limit cases, this generalized scheme includes standard value iteration with a known model, Bayesian MDP planning, and robust planning. We demonstrate the benefits of this approach in a grid world simulation.

This paper has not been read by Pith yet.

Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes

discussion (0)