Manifold Regularization for Kernelized LSTD

Byron Boots; Krzysztof Choromanski; Vikas Sindhwani; Xinyan Yan

arxiv: 1710.05387 · v1 · pith:5HXMLC5Inew · submitted 2017-10-15 · 💻 cs.LG · cs.AI· stat.ML

Manifold Regularization for Kernelized LSTD

Xinyan Yan , Krzysztof Choromanski , Byron Boots , Vikas Sindhwani This is my paper

classification 💻 cs.LG cs.AIstat.ML

keywords policyapproximationevaluationiterationkernelizedlearningmanifoldmethod

0 comments

read the original abstract

Policy evaluation or value function or Q-function approximation is a key procedure in reinforcement learning (RL). It is a necessary component of policy iteration and can be used for variance reduction in policy gradient methods. Therefore its quality has a significant impact on most RL algorithms. Motivated by manifold regularized learning, we propose a novel kernelized policy evaluation method that takes advantage of the intrinsic geometry of the state space learned from data, in order to achieve better sample efficiency and higher accuracy in Q-function approximation. Applying the proposed method in the Least-Squares Policy Iteration (LSPI) framework, we observe superior performance compared to widely used parametric basis functions on two standard benchmarks in terms of policy quality.

This paper has not been read by Pith yet.

Manifold Regularization for Kernelized LSTD

discussion (0)