Learning Linear-Quadratic Regulators Efficiently with only sqrt{T} Regret
classification
💻 cs.LG
stat.ML
keywords
learningregretsqrtabbasi-yadkorialgorithmcomputationally-efficientcontroldean
read the original abstract
We present the first computationally-efficient algorithm with $\widetilde O(\sqrt{T})$ regret for learning in Linear Quadratic Control systems with unknown dynamics. By that, we resolve an open question of Abbasi-Yadkori and Szepesv\'ari (2011) and Dean, Mania, Matni, Recht, and Tu (2018).
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.