Model-Free Dynamic Mode Adaptive Control for Data-Driven Control Synthesis
Pith reviewed 2026-05-07 11:46 UTC · model grok-4.3
The pith
A model-free controller called DMAC approximates local linear dynamics from data and synthesizes a stabilizing feedback law online.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
DMAC consists of a dynamics-approximation module that estimates local linear representations using matrix recursive least-squares with a forgetting factor, and a controller-synthesis module that computes an online stabilizing controller. The approach establishes convergence of the recursive approximation and boundedness of the closed-loop system under the DMAC controller.
What carries the argument
The matrix recursive least-squares algorithm with forgetting factor that estimates the local linear dynamics from measurements, which then feeds into the stabilizing controller design.
If this is right
- The method works for representative systems including unstable linear dynamics, nonlinear oscillators, and fluid models like Burgers' equation.
- Robustness holds with respect to algorithm hyperparameters and variations in system parameters.
- The controller provides stabilization using full state feedback and integral action based on the data-driven estimate.
Where Pith is reading between the lines
- DMAC may enable real-time adaptation in environments where system parameters drift over time due to the forgetting factor in the estimator.
- Similar data-driven approaches could be extended to output-feedback cases when full states are not measurable.
- Applications in robotics or chemical process control could benefit from reduced reliance on first-principles models.
Load-bearing premise
The system dynamics can be adequately represented by a local linear model estimated from data, with full state measurements available for feedback.
What would settle it
Observing unbounded growth in the system states under the DMAC controller in a scenario where the data is sufficient to estimate the dynamics would falsify the boundedness guarantee.
read the original abstract
This paper presents a model-free, data-driven control synthesis method called dynamic mode adaptive control (DMAC) for systems whose mathematical models are unavailable or unsuitable for classical control design. The proposed approach combines data-driven dynamics approximation with adaptive control synthesis to enable online controller design using measured system data. DMAC comprises two main components: a dynamics-approximation module and a controller-synthesis module. The dynamics approximation module estimates a local linear representation of the system dynamics directly from measurements using a matrix recursive least-squares algorithm with a forgetting factor. The estimated dynamics are then used to compute an online stabilizing controller with full-state feedback and integral action. Theoretical analysis establishes convergence properties of the recursive dynamics approximation and boundedness of the closed-loop system under the DMAC controller. The performance of the proposed method is demonstrated through numerical examples involving representative dynamical systems, including an unstable linear system, the Van der Pol oscillator, and the Burgers' equation. Sensitivity studies further demonstrate the robustness of DMAC with respect to both algorithm hyperparameters and variations in system parameters.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces Dynamic Mode Adaptive Control (DMAC), a model-free data-driven method that employs a matrix recursive least-squares estimator with forgetting factor to obtain local linear approximations of system dynamics directly from measurements. These estimates are used to synthesize an online stabilizing controller via full-state feedback augmented with integral action. Theoretical analysis is provided for convergence of the estimator and boundedness of the closed-loop trajectories. The approach is illustrated on an unstable linear system, the Van der Pol oscillator, and a discretized Burgers' equation, accompanied by sensitivity studies on hyperparameters and system parameters.
Significance. If the convergence and boundedness results hold, the work provides a useful addition to data-driven control by delivering an adaptive, online synthesis procedure with explicit stability guarantees that does not require an a priori model. The combination of RLS-with-forgetting and integral-action feedback is practically relevant for systems where dynamics are only partially known or slowly time-varying, and the numerical examples across linear and nonlinear cases support broader applicability.
major comments (2)
- [§4] §4, Theorem 1: The convergence proof for the matrix RLS estimator invokes a persistent-excitation condition on the regressor; however, the subsequent boundedness argument for the closed-loop system (Theorem 2) does not explicitly verify that the adaptive controller maintains this excitation once the loop is closed, which is load-bearing for the global stability claim.
- [§5.3] §5.3 (Burgers' equation example): The local-linear approximation is applied to a spatially discretized nonlinear PDE, yet the manuscript provides no quantitative bound on the approximation error induced by the spatial discretization or by the forgetting factor; this gap weakens the extension of the boundedness result from finite-dimensional ODEs to the reported PDE case.
minor comments (3)
- [§3.1] The notation for the estimated state matrix Â(k) and input matrix B̂(k) is introduced without an explicit statement of their dimensions relative to the original system order; adding this clarification in §3.1 would improve readability.
- [Figure 4] Figure 4 (sensitivity plots) uses different vertical scales across subplots without comment; uniform scaling or explicit mention of the plotted quantities would aid interpretation.
- [§3.2] The abstract states that 'full state measurements are available,' but the controller diagram in §3.2 does not label the measurement noise or sensor dynamics; a brief remark on this modeling choice would be helpful.
Simulated Author's Rebuttal
We thank the referee for the positive evaluation and the detailed comments, which help clarify the scope and limitations of the stability results. We address each major comment below and will revise the manuscript to strengthen the presentation of the theoretical claims.
read point-by-point responses
-
Referee: [§4] §4, Theorem 1: The convergence proof for the matrix RLS estimator invokes a persistent-excitation condition on the regressor; however, the subsequent boundedness argument for the closed-loop system (Theorem 2) does not explicitly verify that the adaptive controller maintains this excitation once the loop is closed, which is load-bearing for the global stability claim.
Authors: We agree that the link between the PE condition in Theorem 1 and the closed-loop boundedness in Theorem 2 requires explicit attention. The current proof of Theorem 2 assumes that the regressor remains persistently exciting under the adaptive feedback law. In the revision we will insert a remark immediately after Theorem 2 that (i) recalls the role of the integral action in generating a non-zero reference-tracking error that sustains excitation and (ii) notes that the forgetting factor prevents the covariance matrix from collapsing, thereby preserving the PE property once the estimator has converged. We will also add a brief sentence in the statement of Theorem 2 making this standing assumption explicit. revision: yes
-
Referee: [§5.3] §5.3 (Burgers' equation example): The local-linear approximation is applied to a spatially discretized nonlinear PDE, yet the manuscript provides no quantitative bound on the approximation error induced by the spatial discretization or by the forgetting factor; this gap weakens the extension of the boundedness result from finite-dimensional ODEs to the reported PDE case.
Authors: We acknowledge that the Burgers' example is presented as a finite-dimensional discretization to which the ODE theory directly applies, without quantitative bounds on the discretization or forgetting-factor errors. In the revised manuscript we will add a short paragraph at the end of §5.3 that (i) states the spatial discretization error is controlled by the mesh size (with a reference to standard finite-difference convergence results) and (ii) reports additional numerical sensitivity plots showing that the closed-loop trajectories remain bounded for a range of forgetting factors. We will also insert a sentence clarifying that the boundedness guarantee is for the discretized system and that extension to the infinite-dimensional PDE remains an open question. revision: partial
Circularity Check
No significant circularity in derivation chain
full rationale
The paper derives its DMAC method from standard matrix RLS-with-forgetting-factor estimation of local linear dynamics directly from measurements, followed by separate online controller synthesis using full-state feedback plus integral action. Convergence of the estimator and closed-loop boundedness are established via independent theoretical analysis (not by re-using the estimator output as its own proof) and validated on external benchmarks including an unstable linear system, Van der Pol oscillator, and discretized Burgers' equation. No self-definitional loops, fitted inputs renamed as predictions, or load-bearing self-citations appear; the central claims remain self-contained against the stated assumptions of local linearity and full-state availability.
Axiom & Free-Parameter Ledger
free parameters (1)
- forgetting factor
axioms (2)
- domain assumption The system admits a local linear approximation from input-output data
- domain assumption Full state feedback is available
Reference graph
Works this paper leans on
-
[1]
Åström, K. J. and Wittenmark, B., 1995, Adaptive Control, Addison-Wesley
work page 1995
-
[2]
Annaswamy, A. M. and Narendra, K. S., 1989, Stable adaptive systems, Prentice Hall Hoboken, NJ, USA
work page 1989
-
[3]
Robust and adaptive mot ion control,
Ioannou, P . A. and Sun, J., 2012, “Robust and adaptive mot ion control,”
work page 2012
-
[4]
Theory and applications of self-tuning regulators,
Åström, K. J., Borisson, U., Ljung, L., and Wittenmark, B ., 1977, “Theory and applications of self-tuning regulators,” Automatica, 13(5), pp. 457–476
work page 1977
-
[5]
Åström, K. J. and Wittenmark, B., 1973, “On self tuning re gulators,” Automat- ica, 9(2), pp. 185–199
work page 1973
-
[6]
Adaptive linear quadra tic gaussian control: the cost-biased approach revisited,
Campi, M. C. and Kumar, P ., 1998, “Adaptive linear quadra tic gaussian control: the cost-biased approach revisited,” SIAM Journal on Contr ol and Optimization, 36(6), pp. 1890–1907
work page 1998
-
[7]
Regret bo unds for the adaptive control of linear quadratic systems,
Abbasi- Y adkori, Y . and Szepesvári, C., 2011, “Regret bo unds for the adaptive control of linear quadratic systems,” Proceedings of the 24th annual conference on learning theory , JMLR Workshop and Conference Proceedings, pp. 1–26
work page 2011
-
[8]
Regret bounds for robust adaptive control of the linear quadratic regulator,
Dean, S., Mania, H., Matni, N., Recht, B., and Tu, S., 2018 , “Regret bounds for robust adaptive control of the linear quadratic regulator, ” Advances in Neural Information Processing Systems, 31, pp. 4188–4197
work page 2018
-
[9]
Certainty equiva lence is efficient for linear quadratic control,
Mania, H., Tu, S., and Recht, B., 2019, “Certainty equiva lence is efficient for linear quadratic control,” Advances in neural information processing systems, 32, pp. 10154–10164
work page 2019
-
[10]
Learning L inear-Quadratic Reg- ulators Efficiently with only √ /u1D447 Regret,
Cohen, A., Koren, T., and Mansour, Y ., 2019, “Learning L inear-Quadratic Reg- ulators Efficiently with only √ /u1D447 Regret,” International Conference on Machine Learning, PMLR, pp. 1300–1309
work page 2019
-
[11]
A tutorial and overview of retrospective cost adaptive control,
Rahman, Y ., Xie, A., Hoagg, J. B., and Bernstein, D. S., 2 016, “A tutorial and overview of retrospective cost adaptive control,” 2016 American Control Conference (ACC), IEEE, pp. 3386–3409
work page 2016
-
[12]
Rahman, Y ., Xie, A., and Bernstein, D. S., 2017, “Retros pective cost adap- tive control: Pole placement, frequency response, and conn ections with LQG control,” IEEE Control Systems Magazine, 37(5), pp. 28–69
work page 2017
-
[13]
Predictive Cost Adaptive Control of Shock-Boundary-Layer Interaction in a Scramjet Isolator,
Vander Schaaf, J. C., Acharya, R., and Bernstein, D., 20 25, “Predictive Cost Adaptive Control of Shock-Boundary-Layer Interaction in a Scramjet Isolator,” AIAA SCITECH 2025 Forum , p. 0673
work page 2025
-
[14]
Re inforcement learning with guarantees: a review,
Osinenko, P ., Dobriborsci, D., and Aumer, W ., 2022, “Re inforcement learning with guarantees: a review,” IFAC-PapersOnLine, 55(15), pp. 123–128
work page 2022
-
[15]
Wallace, B. A. and Si, J., 2024, “Continuous-time reinf orcement learning: New design algorithms with theoretical insights and performan ce guarantees,” IEEE Transactions on Neural Networks and Learning Systems, 36(4), pp. 6940–6954
work page 2024
-
[16]
Chen, C., Xie, L., Jiang, Y ., Xie, K., and Xie, S., 2022, “ Robust output regulation and reinforcement learning-based output tracking design f or unknown linear discrete-time systems,” IEEE Transactions on Automatic Co ntrol, 68(4), pp. 2391–2398
work page 2022
-
[17]
Dynamic mode decomposition of num erical and experi- mental data,
Schmid, P . J., 2010, “Dynamic mode decomposition of num erical and experi- mental data,” Journal of fluid mechanics, 656, pp. 5–28
work page 2010
-
[18]
Dynamic mode decomposition: Theory an d applications,
Tu, J. H., 2013, “Dynamic mode decomposition: Theory an d applications,” Ph.D. thesis, Princeton University
work page 2013
-
[19]
Dy namic mode decompo- sition with control,
Proctor, J. L., Brunton, S. L., and Kutz, J. N., 2016, “Dy namic mode decompo- sition with control,” SIAM Journal on Applied Dynamical Sys tems, 15(1), pp. 142–161
work page 2016
-
[20]
Linear predictors for no nlinear dynamical systems: Koopman operator meets model predictive control,
Korda, M. and Mezić, I., 2018, “Linear predictors for no nlinear dynamical systems: Koopman operator meets model predictive control, ” Automatica, 93, pp. 149–160
work page 2018
-
[21]
On- line dynamic mode decomposition for time-varying systems,
Zhang, H., Rowley, C. W ., Deem, E. A., and Cattafesta, L. N., 2019, “On- line dynamic mode decomposition for time-varying systems, ” SIAM Journal on Applied Dynamical Systems, 18(3), pp. 1586–1609
work page 2019
-
[22]
Goel, A., Bruce, A. L., and Bernstein, D. S., 2020, “Recu rsive least squares with variable-direction forgetting: Compensating for the loss of persistency [lecture notes],” IEEE Control Systems Magazine, 40(4), pp. 80–102
work page 2020
-
[23]
Persistence of exc itation criteria,
Y . Mareels, I. and Gevers, M., 1986, “Persistence of exc itation criteria,” Proc. Conf. Dec. Contr , pp. 1933–1935, doi: 10.1109/CDC.1986.267349
-
[24]
Persistency of exc i- tation criteria for linear, multivariable, time-varying s ystems,
Mareels, I. M. and Gevers, M., 1988, “Persistency of exc i- tation criteria for linear, multivariable, time-varying s ystems,” Mathematics of Control, Signals, and Systems , 1(3), pp. 203–226
work page 1988
-
[25]
Strang, G., 2022, Introduction to linear algebra , SIAM
work page 2022
-
[26]
C., and Glover, K., 1996, Robust and Optimal Control , Prentice Hall
Zhou, K., Doyle, J. C., and Glover, K., 1996, Robust and Optimal Control , Prentice Hall
work page 1996
-
[27]
Khalil, H. K. and Grizzle, J. W ., 2002, Nonlinear systems, Vol. 3, Prentice hall Upper Saddle River, NJ
work page 2002
-
[28]
Petersen, K. B., Pedersen, M. S., et al., 2008, “The matr ix cookbook,” Technical University of Denmark, 7(15), p. 510
work page 2008
-
[29]
J., 1996, Linear System Theory , Prentice Hall
Rugh, W . J., 1996, Linear System Theory , Prentice Hall
work page 1996
-
[30]
A generalization of the Sherman–Mor rison–Woodbury for- mula,
Deng, C. Y ., 2011, “A generalization of the Sherman–Mor rison–Woodbury for- mula,” Applied Mathematics Letters, 24(9), pp. 1561–1564
work page 2011
-
[31]
and Rodman, L., 1995, Algebraic riccati equations , Clarendon press
Lancaster, P . and Rodman, L., 1995, Algebraic riccati equations , Clarendon press
work page 1995
-
[32]
Matrix Ric- cati equations in control and systems theory,
Aboukandil, H., Freiling, G., Ionescu, V ., Jank, G., et al., 2003, “Matrix Ric- cati equations in control and systems theory,” IEEE Transac tions in Automatic Control, 49(10), pp. 2094–2095. Journal of Dynamic Systems, Measurement, and Control PREPRINT FOR REVIEW / 15 List of Figures 1 Dynamic Mode Adaptive Control (DMAC) architecture for mod el-free, da...
work page 2003
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.