Model-based reinforcement learning for infinite-horizon approximate optimal tracking

Lindsey Andrews; Patrick Walters; Rushikesh Kamalapurkar; Warren E. Dixon

arxiv: 1506.00685 · v1 · pith:CBGFVY2Unew · submitted 2015-06-01 · 💻 cs.SY · cs.SY· math.OC

Model-based reinforcement learning for infinite-horizon approximate optimal tracking

Rushikesh Kamalapurkar , Lindsey Andrews , Patrick Walters , Warren E. Dixon This is my paper

classification 💻 cs.SY cs.SYmath.OC

keywords learningmodel-basedoptimalreinforcementtrackingapproximatedevelopedinfinite-horizon

0 comments

read the original abstract

This paper provides an approximate online adaptive solution to the infinite-horizon optimal tracking problem for control-affine continuous-time nonlinear systems with unknown drift dynamics. Model-based reinforcement learning is used to relax the persistence of excitation condition. Model-based reinforcement learning is implemented using a concurrent learning-based system identifier to simulate experience by evaluating the Bellman error over unexplored areas of the state space. Tracking of the desired trajectory and convergence of the developed policy to a neighborhood of the optimal policy are established via Lyapunov-based stability analysis. Simulation results demonstrate the effectiveness of the developed technique.

This paper has not been read by Pith yet.

Model-based reinforcement learning for infinite-horizon approximate optimal tracking

discussion (0)