Global convergence of policy gradient methods for the linear quadratic regulator

· 2018

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

On the Optimization Landscape of Observer-based Dynamic Linear Quadratic Control

eess.SY · 2026-04-12 · unverdicted · novelty 6.0

The stationary point of observer-based dynamic LQR is characterized by a pair of symmetric discrete-time Sylvester equations, and the usual separated LQR-plus-minimum-trace-observer design is not optimal.

Stability-Certified On-Policy Data-Driven LQR via Recursive Learning and Policy Gradient

eess.SY · 2024-03-08 · unverdicted · novelty 6.0

Relearn LQR combines recursive least squares with policy gradient for on-policy data-driven LQR and proves stability of the full scheme via Lyapunov analysis with averaging and timescale separation.

citing papers explorer

Showing 2 of 2 citing papers.

On the Optimization Landscape of Observer-based Dynamic Linear Quadratic Control eess.SY · 2026-04-12 · unverdicted · none · ref 3
The stationary point of observer-based dynamic LQR is characterized by a pair of symmetric discrete-time Sylvester equations, and the usual separated LQR-plus-minimum-trace-observer design is not optimal.
Stability-Certified On-Policy Data-Driven LQR via Recursive Learning and Policy Gradient eess.SY · 2024-03-08 · unverdicted · none · ref 31
Relearn LQR combines recursive least squares with policy gradient for on-policy data-driven LQR and proves stability of the full scheme via Lyapunov analysis with averaging and timescale separation.

Global convergence of policy gradient methods for the linear quadratic regulator

fields

years

verdicts

representative citing papers

citing papers explorer