Model Based Reinforcement Learning with Final Time Horizon Optimization

Evangelos Theodorou; Panagiotis Tsiotras; Wei Sun

arxiv: 1509.01186 · v1 · pith:W5RUKOC7new · submitted 2015-09-03 · 💻 cs.SY

Model Based Reinforcement Learning with Final Time Horizon Optimization

Wei Sun , Evangelos Theodorou , Panagiotis Tsiotras This is my paper

classification 💻 cs.SY

keywords optimalhorizonmodeloptimizationtimecontrolfinallearning

0 comments

read the original abstract

We present one of the first algorithms on model based reinforcement learning and trajectory optimization with free final time horizon. Grounded on the optimal control theory and Dynamic Programming, we derive a set of backward differential equations that propagate the value function and provide the optimal control policy and the optimal time horizon. The resulting policy generalizes previous results in model based trajectory optimization. Our analysis shows that the proposed algorithm recovers the theoretical optimal solution on linear low dimensional problem. Finally we provide application results on nonlinear systems.

This paper has not been read by Pith yet.

Model Based Reinforcement Learning with Final Time Horizon Optimization

discussion (0)