Learning Linear-Quadratic Regulators Efficiently with only $\sqrt{T}$ Regret

Alon Cohen; Tomer Koren; Yishay Mansour

arxiv: 1902.06223 · v2 · pith:EIEBSFFOnew · submitted 2019-02-17 · 💻 cs.LG · stat.ML

Learning Linear-Quadratic Regulators Efficiently with only sqrt{T} Regret

Alon Cohen , Tomer Koren , Yishay Mansour This is my paper

classification 💻 cs.LG stat.ML

keywords learningregretsqrtabbasi-yadkorialgorithmcomputationally-efficientcontroldean

0 comments

read the original abstract

We present the first computationally-efficient algorithm with $\widetilde O(\sqrt{T})$ regret for learning in Linear Quadratic Control systems with unknown dynamics. By that, we resolve an open question of Abbasi-Yadkori and Szepesv\'ari (2011) and Dean, Mania, Matni, Recht, and Tu (2018).

This paper has not been read by Pith yet.

Learning Linear-Quadratic Regulators Efficiently with only sqrt{T} Regret

discussion (0)