On the Effect of Quadratic Regularization in Direct Data-Driven LQR
Pith reviewed 2026-05-10 03:42 UTC · model grok-4.3
The pith
Quadratic regularization in direct data-driven LQR translates costs from auxiliary variables to system quantities.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper establishes that the parametric effect of regularization in direct data-driven LQR maps the quadratic regularization terms applied to auxiliary variables into equivalent regularization terms on the system matrices and vectors. This mapping keeps the optimization outcome unchanged, offers intuitive explanations in system terms, and allows the auxiliary variables to be eliminated, reducing the dimensionality and complexity of the resulting optimization problem.
What carries the argument
the parametric effect of regularization, which re-expresses regularization penalties defined on auxiliary data-driven variables as penalties on the estimated system parameters
Load-bearing premise
That the costs of quadratic regularization can be mapped parametrically from auxiliary variables to system quantities while exactly preserving the original optimization result in the data-driven LQR setting.
What would settle it
A concrete data set and LQR problem where the controller obtained after mapping and eliminating auxiliaries differs from the one solved with the original auxiliary-variable formulation.
Figures
read the original abstract
This paper proposes an explainability concept for direct data-driven linear quadratic regulation (LQR) with quadratic regularization. Our perspective follows the parametric effect of regularization, an analysis approach that translates regularization costs from auxiliary variables to system quantities, enabling intuitive interpretations. The framework further enables the elimination of auxiliary variables, thereby reducing computational complexity. We demonstrate the effectiveness of our approach and the identified effect of regularization via simulations.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes an explainability framework for direct data-driven LQR under quadratic regularization. It analyzes the parametric effect of regularization to translate penalties from auxiliary variables (e.g., trajectory or slack variables) onto effective system quantities, yielding intuitive interpretations and permitting elimination of auxiliaries to reduce computational complexity. The approach is validated through simulations.
Significance. If the proposed translation is shown to be exactly equivalent to the original regularized problem (i.e., the reparameterized optimizer coincides for all data sets satisfying the implicit Hankel-matrix constraints), the work would offer both interpretability gains and practical complexity reduction in data-driven control. The simulation results supply initial empirical support, but the overall significance hinges on whether the mapping holds without additional unstated restrictions on data rank or persistency of excitation.
major comments (1)
- [Derivation of the parametric effect (likely §3–4)] The central claim that regularization costs can be parametrically mapped from auxiliary variables to system quantities while preserving the exact optimizer requires an explicit proof of equivalence. The reparameterized problem must remain mathematically identical to the original for every feasible data set; any interaction between the translated cost and the linear constraints imposed by the data Hankel matrices could shift the solution even if the algebraic mapping appears clean.
minor comments (2)
- [Abstract] The abstract summarizes the contribution but contains no equations, explicit mapping, or quantitative results, which hinders immediate technical assessment.
- [Numerical examples] Simulation section should report concrete details: system order, data length, persistency-of-excitation verification, and quantitative metrics (e.g., closed-loop cost or regret) comparing the original and reduced formulations.
Simulated Author's Rebuttal
We thank the referee for their constructive feedback and for highlighting the need to strengthen the equivalence claim in our parametric analysis of quadratic regularization for data-driven LQR. We address the major comment below and will revise the manuscript accordingly to improve rigor and clarity.
read point-by-point responses
-
Referee: The central claim that regularization costs can be parametrically mapped from auxiliary variables to system quantities while preserving the exact optimizer requires an explicit proof of equivalence. The reparameterized problem must remain mathematically identical to the original for every feasible data set; any interaction between the translated cost and the linear constraints imposed by the data Hankel matrices could shift the solution even if the algebraic mapping appears clean.
Authors: We agree that an explicit proof of equivalence is required to fully substantiate the central claim. The manuscript derives the parametric mapping via algebraic completion of squares on the quadratic regularization terms, translating penalties on auxiliary trajectory and slack variables into effective costs on the system matrices and initial state. However, we acknowledge that the current presentation does not include a standalone theorem verifying that the reparameterized optimizer coincides exactly with the original under the Hankel-matrix constraints for arbitrary feasible data sets. In the revised version we will add a formal proof (new Theorem in Section 3) showing that the first-order optimality conditions remain identical: the gradient of the translated cost, when augmented by the same Lagrange multipliers associated with the linear data constraints, recovers the original KKT system. This accounts for potential interactions with the Hankel constraints by construction, without introducing additional restrictions beyond the standard persistency-of-excitation and rank conditions already stated in the paper. The simulation results are consistent with this equivalence, but the proof will be provided explicitly. revision: yes
Circularity Check
No significant circularity in the parametric translation framework for quadratic regularization.
full rationale
The paper derives an explainability concept by following the parametric effect of regularization to translate costs from auxiliary variables to system quantities in the direct data-driven LQR setting. This enables interpretations and elimination of auxiliaries while preserving the optimization outcome. No load-bearing step reduces by construction to a self-definition, a fitted input renamed as prediction, or a self-citation chain whose validity depends on the present result. The translation is presented as an algebraic reparameterization of the regularized problem that remains equivalent under the data Hankel constraints, constituting an independent derivation rather than a tautology. The framework is self-contained against the standard data-driven LQR formulation without requiring external unverified premises for its core claims.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Breschi, V ., Chiuso, A., and Formentin, S. Data-driven predictive control in a stochastic setting: a unified framework.Automatica, 152, 110961, 2023
work page 2023
-
[2]
and Francis, B.Optimal Sampled-Data Control Systems
Chen, T. and Francis, B.Optimal Sampled-Data Control Systems. Springer London, 1995
work page 1995
-
[3]
Coulson, J., Lygeros, J., and D ¨orfler, F. Data-enabled predictive control: In the shallows of the DeePC.18th European Control Conference, 307– 312, 2019
work page 2019
-
[4]
CVX: Matlab software for disciplined convex program- ming, version 2.0, 2012.https://cvxr.com/cvx
CVX Research, I. CVX: Matlab software for disciplined convex program- ming, version 2.0, 2012.https://cvxr.com/cvx
work page 2012
-
[5]
De Persis, C. and Tesi, P. Formulas for data-driven control: Stabilization, optimality, and robustness.IEEE Transactions on Automatic Control, 65, 909–924, 2020
work page 2020
-
[6]
De Persis, C. and Tesi, P. Low-complexity learning of linear quadratic regulators from noisy data.Automatica, 128, 109548, 2021
work page 2021
-
[7]
On the role of regularization in direct data-driven LQR control
D ¨orfler, F., Tesi, P., and De Persis, C. On the role of regularization in direct data-driven LQR control. InIEEE 61st Conference on Decision and Control, 1091–1098,
-
[8]
D ¨orfler, F., Tesi, P., and De Persis, C. On the certainty-equivalence ap- proach to direct data-driven LQR design.IEEE Transactions on Automatic Control, 68(12), 7989–7996, 2023
work page 2023
-
[9]
Kl ¨adtke, M. and Schulze Darup, M. Towards explainable data-driven pre- dictive control with regularizations.at - Automatisierungstechnik, 73(6), 365–382, 2025
work page 2025
-
[10]
Liu, W., Wang, G., Sun, J., Bullo, F., and Chen, J. Learning robust data- based lqg controllers from noisy data.IEEE Transactions on Automatic Control, 69(12), 8526–8538, 2024
work page 2024
-
[11]
Mattsson, P., Bonassi, F., Breschi, V ., and Sch ¨on, T.B. (2024). On the equivalence of direct and indirect data-driven predictive control ap- proaches.IEEE Control Systems Letters, 8, 796–801, 2024
work page 2024
-
[12]
Mejari, M., Breschi, V ., Dehkordi, M.B., Formentin, S., and Piga, D. Bias correction and instrumental variables for direct data-driven model- reference control.European Journal of Control, 101327, 2025
work page 2025
-
[13]
T ¨ut¨unc¨u, R.H., Toh, K.C., and Todd, M. Solving semidefinite-quadratic- linear programs using SDPT3.Mathematical Programming, 95, 189-217, 2003
work page 2003
-
[14]
Noise sensitivity of the semidefinite programs for direct data-driven LQR
Zeng, X., Bako, L., and Ozay, N. Noise sensitivity of the semidefinite programs for direct data-driven LQR. Digital preprint, arXiv:2412.19705, 2024
-
[15]
Zhao, F., Chiuso, A., and D ¨orfler, F. Regularization for covariance pa- rameterization of direct data-driven LQR control.IEEE Control Systems Letters, 9, 961–966, 2025
work page 2025
-
[16]
Zhao, F., D ¨orfler, F., Chiuso, A., and You, K. Data-enabled policy opti- mization for direct adaptive learning of the LQR.IEEE Transactions on Automatic Control, 70(11), 7217–7232, 2025
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.