pith. sign in

arxiv: 1005.0125 · v1 · submitted 2010-05-02 · 💻 cs.LG · cs.AI

Adaptive Bases for Reinforcement Learning

classification 💻 cs.LG cs.AI
keywords adaptiveapproximationbasisbellmanfunctionlearningproblemreinforcement
0
0 comments X
read the original abstract

We consider the problem of reinforcement learning using function approximation, where the approximating basis can change dynamically while interacting with the environment. A motivation for such an approach is maximizing the value function fitness to the problem faced. Three errors are considered: approximation square error, Bellman residual, and projected Bellman residual. Algorithms under the actor-critic framework are presented, and shown to converge. The advantage of such an adaptive basis is demonstrated in simulations.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.