pith. sign in

arxiv: 1308.3513 · v1 · pith:LXQPIF43new · submitted 2013-08-15 · 💻 cs.LG · cs.AI

Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations

classification 💻 cs.LG cs.AI
keywords taskapproachcontroldecisiondynamicshiddenhip-mdpintroduce
0
0 comments X
read the original abstract

Control applications often feature tasks with similar, but not identical, dynamics. We introduce the Hidden Parameter Markov Decision Process (HiP-MDP), a framework that parametrizes a family of related dynamical systems with a low-dimensional set of latent factors, and introduce a semiparametric regression approach for learning its structure from data. In the control setting, we show that a learned HiP-MDP rapidly identifies the dynamics of a new task instance, allowing an agent to flexibly adapt to task variations.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.