Approximate Kalman Filter Q-Learning for Continuous State-Space MDPs

Charles Tripp; Ross D. Shachter

arxiv: 1309.6868 · v1 · pith:R2OB7L42new · submitted 2013-09-26 · 💻 cs.LG · stat.ML

Approximate Kalman Filter Q-Learning for Continuous State-Space MDPs

Charles Tripp , Ross D. Shachter This is my paper

classification 💻 cs.LG stat.ML

keywords filterkalmanapproximatecontinuousmodelq-learningstateweights

0 comments

read the original abstract

We seek to learn an effective policy for a Markov Decision Process (MDP) with continuous states via Q-Learning. Given a set of basis functions over state action pairs we search for a corresponding set of linear weights that minimizes the mean Bellman residual. Our algorithm uses a Kalman filter model to estimate those weights and we have developed a simpler approximate Kalman filter model that outperforms the current state of the art projected TD-Learning methods on several standard benchmark problems.

This paper has not been read by Pith yet.

Approximate Kalman Filter Q-Learning for Continuous State-Space MDPs

discussion (0)