pith. sign in

arxiv: 1706.06544 · v3 · pith:FC6D4TUWnew · submitted 2017-06-20 · 📊 stat.ML · cs.AI· cs.LG

Robust and Efficient Transfer Learning with Hidden-Parameter Markov Decision Processes

classification 📊 stat.ML cs.AIcs.LG
keywords decisionframeworkhip-mdplatentmarkovapplicationsbayesiancomplex
0
0 comments X
read the original abstract

We introduce a new formulation of the Hidden Parameter Markov Decision Process (HiP-MDP), a framework for modeling families of related tasks using low-dimensional latent embeddings. Our new framework correctly models the joint uncertainty in the latent parameters and the state space. We also replace the original Gaussian Process-based model with a Bayesian Neural Network, enabling more scalable inference. Thus, we expand the scope of the HiP-MDP to applications with higher dimensions and more complex dynamics.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.