New epoch-based direct MRAC algorithm for adaptive discrete-time LQR achieves high-probability regret bounds without requiring an initial stabilizing controller or exploration.
Regret bounds for the adaptive control of linear quadratic systems,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
eess.SY 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Adapt and Stabilize, Then Learn and Optimize: A New Approach to Adaptive LQR
New epoch-based direct MRAC algorithm for adaptive discrete-time LQR achieves high-probability regret bounds without requiring an initial stabilizing controller or exploration.