Bayesian ddLQR adds posterior uncertainty to the design, decomposing expected cost into certainty-equivalence plus variance terms, proving indirect-direct equivalence, and producing a data-length-independent SDP.
Certainty equivalence is efficient for linear quadratic control
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
An online algorithm for zero-sum LQ games with unknown dynamics combines model estimation and surrogate selection to achieve regret bounds on policy convergence.
citing papers explorer
-
A Bayesian Perspective on the Data-Driven LQR
Bayesian ddLQR adds posterior uncertainty to the design, decomposing expected cost into certainty-equivalence plus variance terms, proving indirect-direct equivalence, and producing a data-length-independent SDP.
-
An Online Learning Approach for Two-Player Zero-Sum Linear Quadratic Games
An online algorithm for zero-sum LQ games with unknown dynamics combines model estimation and surrogate selection to achieve regret bounds on policy convergence.