Policy gradient adaptive con- trol for the LQR: Indirect and direct approaches

Zhao, Feiran, Chiuso, Alessandro · 2023 · arXiv 2505.03706

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Direct Data-Driven Linear Quadratic Tracking via Policy Optimization

eess.SY · 2026-05-15 · unverdicted · novelty 7.0

A reference-decoupled reformulation makes direct data-driven LQT equivalent to certainty-equivalence solutions and supports convergent offline and online DeePO algorithms.

Global Convergence of Policy Gradient Methods for ReLU Controllers in Linear Quadratic Regulation

math.OC · 2026-04-24 · unverdicted · novelty 6.0

Model-based policy gradient converges globally to the optimal scalar LQR gain for discounted LQR using overparameterized ReLU networks by reducing the controller to two effective gains on positive and negative half-lines.

Sample-Efficient Model-Free Policy Gradient Methods for Stochastic LQR via Robust Linear Regression

eess.SY · 2025-12-03 · unverdicted · novelty 6.0

Primal-dual robust linear regression enables O(1/epsilon) sample complexity for model-free policy gradient methods on stochastic LQR.

Stability of Certainty-Equivalent Adaptive LQR for Linear Systems with Unknown Time-Varying Parameters

eess.SY · 2025-11-11 · unverdicted · novelty 5.0

LMS estimation paired with certainty-equivalent LQR delivers finite-gain ℓ²-stability for linear systems with unknown time-varying parameters and disturbances.

citing papers explorer

Showing 4 of 4 citing papers.

Direct Data-Driven Linear Quadratic Tracking via Policy Optimization eess.SY · 2026-05-15 · unverdicted · none · ref 78
A reference-decoupled reformulation makes direct data-driven LQT equivalent to certainty-equivalence solutions and supports convergent offline and online DeePO algorithms.
Global Convergence of Policy Gradient Methods for ReLU Controllers in Linear Quadratic Regulation math.OC · 2026-04-24 · unverdicted · none · ref 7
Model-based policy gradient converges globally to the optimal scalar LQR gain for discounted LQR using overparameterized ReLU networks by reducing the controller to two effective gains on positive and negative half-lines.
Sample-Efficient Model-Free Policy Gradient Methods for Stochastic LQR via Robust Linear Regression eess.SY · 2025-12-03 · unverdicted · none · ref 2
Primal-dual robust linear regression enables O(1/epsilon) sample complexity for model-free policy gradient methods on stochastic LQR.
Stability of Certainty-Equivalent Adaptive LQR for Linear Systems with Unknown Time-Varying Parameters eess.SY · 2025-11-11 · unverdicted · none · ref 5
LMS estimation paired with certainty-equivalent LQR delivers finite-gain ℓ²-stability for linear systems with unknown time-varying parameters and disturbances.

Policy gradient adaptive con- trol for the LQR: Indirect and direct approaches

fields

years

verdicts

representative citing papers

citing papers explorer