pith. sign in

arxiv: cs/0301007 · v1 · submitted 2003-01-09 · 💻 cs.LG · cs.AI

Kalman filter control in the reinforcement learning framework

classification 💻 cs.LG cs.AI
keywords learningcontrolestimationreinforcementkalman-filteroptimalvalueallows
0
0 comments X
read the original abstract

There is a growing interest in using Kalman-filter models in brain modelling. In turn, it is of considerable importance to make Kalman-filters amenable for reinforcement learning. In the usual formulation of optimal control it is computed off-line by solving a backward recursion. In this technical note we show that slight modification of the linear-quadratic-Gaussian Kalman-filter model allows the on-line estimation of optimal control and makes the bridge to reinforcement learning. Moreover, the learning rule for value estimation assumes a Hebbian form weighted by the error of the value estimation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.