pith. sign in

arxiv: 2604.26682 · v1 · submitted 2026-04-29 · 📡 eess.SY · cs.SY· math.OC

Model-Free Dynamic Mode Adaptive Control for Data-Driven Control Synthesis

Pith reviewed 2026-05-07 11:46 UTC · model grok-4.3

classification 📡 eess.SY cs.SYmath.OC
keywords model-free controldata-driven controladaptive controlrecursive least squaresdynamic mode approximationstabilizing controlonline identification
0
0 comments X

The pith

A model-free controller called DMAC approximates local linear dynamics from data and synthesizes a stabilizing feedback law online.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper proposes dynamic mode adaptive control (DMAC) as a way to design controllers for systems where no mathematical model is available or reliable. The method uses measured data to recursively estimate a local linear approximation of the dynamics and then computes a full-state feedback controller with integral action. Theoretical results establish that the dynamics estimate converges and the closed-loop system remains bounded. Demonstrations on an unstable linear system, the Van der Pol oscillator, and Burgers' equation illustrate its use, with sensitivity analysis showing robustness to parameter variations. If successful, this would allow control design directly from experiments without prior modeling effort.

Core claim

DMAC consists of a dynamics-approximation module that estimates local linear representations using matrix recursive least-squares with a forgetting factor, and a controller-synthesis module that computes an online stabilizing controller. The approach establishes convergence of the recursive approximation and boundedness of the closed-loop system under the DMAC controller.

What carries the argument

The matrix recursive least-squares algorithm with forgetting factor that estimates the local linear dynamics from measurements, which then feeds into the stabilizing controller design.

If this is right

  • The method works for representative systems including unstable linear dynamics, nonlinear oscillators, and fluid models like Burgers' equation.
  • Robustness holds with respect to algorithm hyperparameters and variations in system parameters.
  • The controller provides stabilization using full state feedback and integral action based on the data-driven estimate.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • DMAC may enable real-time adaptation in environments where system parameters drift over time due to the forgetting factor in the estimator.
  • Similar data-driven approaches could be extended to output-feedback cases when full states are not measurable.
  • Applications in robotics or chemical process control could benefit from reduced reliance on first-principles models.

Load-bearing premise

The system dynamics can be adequately represented by a local linear model estimated from data, with full state measurements available for feedback.

What would settle it

Observing unbounded growth in the system states under the DMAC controller in a scenario where the data is sufficient to estimate the dynamics would falsify the boundedness guarantee.

read the original abstract

This paper presents a model-free, data-driven control synthesis method called dynamic mode adaptive control (DMAC) for systems whose mathematical models are unavailable or unsuitable for classical control design. The proposed approach combines data-driven dynamics approximation with adaptive control synthesis to enable online controller design using measured system data. DMAC comprises two main components: a dynamics-approximation module and a controller-synthesis module. The dynamics approximation module estimates a local linear representation of the system dynamics directly from measurements using a matrix recursive least-squares algorithm with a forgetting factor. The estimated dynamics are then used to compute an online stabilizing controller with full-state feedback and integral action. Theoretical analysis establishes convergence properties of the recursive dynamics approximation and boundedness of the closed-loop system under the DMAC controller. The performance of the proposed method is demonstrated through numerical examples involving representative dynamical systems, including an unstable linear system, the Van der Pol oscillator, and the Burgers' equation. Sensitivity studies further demonstrate the robustness of DMAC with respect to both algorithm hyperparameters and variations in system parameters.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The manuscript introduces Dynamic Mode Adaptive Control (DMAC), a model-free data-driven method that employs a matrix recursive least-squares estimator with forgetting factor to obtain local linear approximations of system dynamics directly from measurements. These estimates are used to synthesize an online stabilizing controller via full-state feedback augmented with integral action. Theoretical analysis is provided for convergence of the estimator and boundedness of the closed-loop trajectories. The approach is illustrated on an unstable linear system, the Van der Pol oscillator, and a discretized Burgers' equation, accompanied by sensitivity studies on hyperparameters and system parameters.

Significance. If the convergence and boundedness results hold, the work provides a useful addition to data-driven control by delivering an adaptive, online synthesis procedure with explicit stability guarantees that does not require an a priori model. The combination of RLS-with-forgetting and integral-action feedback is practically relevant for systems where dynamics are only partially known or slowly time-varying, and the numerical examples across linear and nonlinear cases support broader applicability.

major comments (2)
  1. [§4] §4, Theorem 1: The convergence proof for the matrix RLS estimator invokes a persistent-excitation condition on the regressor; however, the subsequent boundedness argument for the closed-loop system (Theorem 2) does not explicitly verify that the adaptive controller maintains this excitation once the loop is closed, which is load-bearing for the global stability claim.
  2. [§5.3] §5.3 (Burgers' equation example): The local-linear approximation is applied to a spatially discretized nonlinear PDE, yet the manuscript provides no quantitative bound on the approximation error induced by the spatial discretization or by the forgetting factor; this gap weakens the extension of the boundedness result from finite-dimensional ODEs to the reported PDE case.
minor comments (3)
  1. [§3.1] The notation for the estimated state matrix Â(k) and input matrix B̂(k) is introduced without an explicit statement of their dimensions relative to the original system order; adding this clarification in §3.1 would improve readability.
  2. [Figure 4] Figure 4 (sensitivity plots) uses different vertical scales across subplots without comment; uniform scaling or explicit mention of the plotted quantities would aid interpretation.
  3. [§3.2] The abstract states that 'full state measurements are available,' but the controller diagram in §3.2 does not label the measurement noise or sensor dynamics; a brief remark on this modeling choice would be helpful.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the positive evaluation and the detailed comments, which help clarify the scope and limitations of the stability results. We address each major comment below and will revise the manuscript to strengthen the presentation of the theoretical claims.

read point-by-point responses
  1. Referee: [§4] §4, Theorem 1: The convergence proof for the matrix RLS estimator invokes a persistent-excitation condition on the regressor; however, the subsequent boundedness argument for the closed-loop system (Theorem 2) does not explicitly verify that the adaptive controller maintains this excitation once the loop is closed, which is load-bearing for the global stability claim.

    Authors: We agree that the link between the PE condition in Theorem 1 and the closed-loop boundedness in Theorem 2 requires explicit attention. The current proof of Theorem 2 assumes that the regressor remains persistently exciting under the adaptive feedback law. In the revision we will insert a remark immediately after Theorem 2 that (i) recalls the role of the integral action in generating a non-zero reference-tracking error that sustains excitation and (ii) notes that the forgetting factor prevents the covariance matrix from collapsing, thereby preserving the PE property once the estimator has converged. We will also add a brief sentence in the statement of Theorem 2 making this standing assumption explicit. revision: yes

  2. Referee: [§5.3] §5.3 (Burgers' equation example): The local-linear approximation is applied to a spatially discretized nonlinear PDE, yet the manuscript provides no quantitative bound on the approximation error induced by the spatial discretization or by the forgetting factor; this gap weakens the extension of the boundedness result from finite-dimensional ODEs to the reported PDE case.

    Authors: We acknowledge that the Burgers' example is presented as a finite-dimensional discretization to which the ODE theory directly applies, without quantitative bounds on the discretization or forgetting-factor errors. In the revised manuscript we will add a short paragraph at the end of §5.3 that (i) states the spatial discretization error is controlled by the mesh size (with a reference to standard finite-difference convergence results) and (ii) reports additional numerical sensitivity plots showing that the closed-loop trajectories remain bounded for a range of forgetting factors. We will also insert a sentence clarifying that the boundedness guarantee is for the discretized system and that extension to the infinite-dimensional PDE remains an open question. revision: partial

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper derives its DMAC method from standard matrix RLS-with-forgetting-factor estimation of local linear dynamics directly from measurements, followed by separate online controller synthesis using full-state feedback plus integral action. Convergence of the estimator and closed-loop boundedness are established via independent theoretical analysis (not by re-using the estimator output as its own proof) and validated on external benchmarks including an unstable linear system, Van der Pol oscillator, and discretized Burgers' equation. No self-definitional loops, fitted inputs renamed as predictions, or load-bearing self-citations appear; the central claims remain self-contained against the stated assumptions of local linearity and full-state availability.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The central claim depends on the convergence of the matrix RLS algorithm and the stability properties of the synthesized controller. Since only the abstract is available, additional free parameters or assumptions in the full paper may exist.

free parameters (1)
  • forgetting factor
    A parameter in the recursive least-squares algorithm that controls the weight on past data; its specific value or selection method is not detailed in the abstract.
axioms (2)
  • domain assumption The system admits a local linear approximation from input-output data
    The dynamics-approximation module relies on estimating a local linear representation directly from measurements.
  • domain assumption Full state feedback is available
    The controller uses full-state feedback and integral action.

pith-pipeline@v0.9.0 · 5480 in / 1311 out tokens · 67822 ms · 2026-05-07T11:46:19.940771+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

32 extracted references · 32 canonical work pages

  1. [1]

    Åström, K. J. and Wittenmark, B., 1995, Adaptive Control, Addison-Wesley

  2. [2]

    Annaswamy, A. M. and Narendra, K. S., 1989, Stable adaptive systems, Prentice Hall Hoboken, NJ, USA

  3. [3]

    Robust and adaptive mot ion control,

    Ioannou, P . A. and Sun, J., 2012, “Robust and adaptive mot ion control,”

  4. [4]

    Theory and applications of self-tuning regulators,

    Åström, K. J., Borisson, U., Ljung, L., and Wittenmark, B ., 1977, “Theory and applications of self-tuning regulators,” Automatica, 13(5), pp. 457–476

  5. [5]

    On self tuning re gulators,

    Åström, K. J. and Wittenmark, B., 1973, “On self tuning re gulators,” Automat- ica, 9(2), pp. 185–199

  6. [6]

    Adaptive linear quadra tic gaussian control: the cost-biased approach revisited,

    Campi, M. C. and Kumar, P ., 1998, “Adaptive linear quadra tic gaussian control: the cost-biased approach revisited,” SIAM Journal on Contr ol and Optimization, 36(6), pp. 1890–1907

  7. [7]

    Regret bo unds for the adaptive control of linear quadratic systems,

    Abbasi- Y adkori, Y . and Szepesvári, C., 2011, “Regret bo unds for the adaptive control of linear quadratic systems,” Proceedings of the 24th annual conference on learning theory , JMLR Workshop and Conference Proceedings, pp. 1–26

  8. [8]

    Regret bounds for robust adaptive control of the linear quadratic regulator,

    Dean, S., Mania, H., Matni, N., Recht, B., and Tu, S., 2018 , “Regret bounds for robust adaptive control of the linear quadratic regulator, ” Advances in Neural Information Processing Systems, 31, pp. 4188–4197

  9. [9]

    Certainty equiva lence is efficient for linear quadratic control,

    Mania, H., Tu, S., and Recht, B., 2019, “Certainty equiva lence is efficient for linear quadratic control,” Advances in neural information processing systems, 32, pp. 10154–10164

  10. [10]

    Learning L inear-Quadratic Reg- ulators Efficiently with only √ /u1D447 Regret,

    Cohen, A., Koren, T., and Mansour, Y ., 2019, “Learning L inear-Quadratic Reg- ulators Efficiently with only √ /u1D447 Regret,” International Conference on Machine Learning, PMLR, pp. 1300–1309

  11. [11]

    A tutorial and overview of retrospective cost adaptive control,

    Rahman, Y ., Xie, A., Hoagg, J. B., and Bernstein, D. S., 2 016, “A tutorial and overview of retrospective cost adaptive control,” 2016 American Control Conference (ACC), IEEE, pp. 3386–3409

  12. [12]

    Retros pective cost adap- tive control: Pole placement, frequency response, and conn ections with LQG control,

    Rahman, Y ., Xie, A., and Bernstein, D. S., 2017, “Retros pective cost adap- tive control: Pole placement, frequency response, and conn ections with LQG control,” IEEE Control Systems Magazine, 37(5), pp. 28–69

  13. [13]

    Predictive Cost Adaptive Control of Shock-Boundary-Layer Interaction in a Scramjet Isolator,

    Vander Schaaf, J. C., Acharya, R., and Bernstein, D., 20 25, “Predictive Cost Adaptive Control of Shock-Boundary-Layer Interaction in a Scramjet Isolator,” AIAA SCITECH 2025 Forum , p. 0673

  14. [14]

    Re inforcement learning with guarantees: a review,

    Osinenko, P ., Dobriborsci, D., and Aumer, W ., 2022, “Re inforcement learning with guarantees: a review,” IFAC-PapersOnLine, 55(15), pp. 123–128

  15. [15]

    Continuous-time reinf orcement learning: New design algorithms with theoretical insights and performan ce guarantees,

    Wallace, B. A. and Si, J., 2024, “Continuous-time reinf orcement learning: New design algorithms with theoretical insights and performan ce guarantees,” IEEE Transactions on Neural Networks and Learning Systems, 36(4), pp. 6940–6954

  16. [16]

    Robust output regulation and reinforcement learning-based output tracking design f or unknown linear discrete-time systems,

    Chen, C., Xie, L., Jiang, Y ., Xie, K., and Xie, S., 2022, “ Robust output regulation and reinforcement learning-based output tracking design f or unknown linear discrete-time systems,” IEEE Transactions on Automatic Co ntrol, 68(4), pp. 2391–2398

  17. [17]

    Dynamic mode decomposition of num erical and experi- mental data,

    Schmid, P . J., 2010, “Dynamic mode decomposition of num erical and experi- mental data,” Journal of fluid mechanics, 656, pp. 5–28

  18. [18]

    Dynamic mode decomposition: Theory an d applications,

    Tu, J. H., 2013, “Dynamic mode decomposition: Theory an d applications,” Ph.D. thesis, Princeton University

  19. [19]

    Dy namic mode decompo- sition with control,

    Proctor, J. L., Brunton, S. L., and Kutz, J. N., 2016, “Dy namic mode decompo- sition with control,” SIAM Journal on Applied Dynamical Sys tems, 15(1), pp. 142–161

  20. [20]

    Linear predictors for no nlinear dynamical systems: Koopman operator meets model predictive control,

    Korda, M. and Mezić, I., 2018, “Linear predictors for no nlinear dynamical systems: Koopman operator meets model predictive control, ” Automatica, 93, pp. 149–160

  21. [21]

    On- line dynamic mode decomposition for time-varying systems,

    Zhang, H., Rowley, C. W ., Deem, E. A., and Cattafesta, L. N., 2019, “On- line dynamic mode decomposition for time-varying systems, ” SIAM Journal on Applied Dynamical Systems, 18(3), pp. 1586–1609

  22. [22]

    Recu rsive least squares with variable-direction forgetting: Compensating for the loss of persistency [lecture notes],

    Goel, A., Bruce, A. L., and Bernstein, D. S., 2020, “Recu rsive least squares with variable-direction forgetting: Compensating for the loss of persistency [lecture notes],” IEEE Control Systems Magazine, 40(4), pp. 80–102

  23. [23]

    Persistence of exc itation criteria,

    Y . Mareels, I. and Gevers, M., 1986, “Persistence of exc itation criteria,” Proc. Conf. Dec. Contr , pp. 1933–1935, doi: 10.1109/CDC.1986.267349

  24. [24]

    Persistency of exc i- tation criteria for linear, multivariable, time-varying s ystems,

    Mareels, I. M. and Gevers, M., 1988, “Persistency of exc i- tation criteria for linear, multivariable, time-varying s ystems,” Mathematics of Control, Signals, and Systems , 1(3), pp. 203–226

  25. [25]

    Strang, G., 2022, Introduction to linear algebra , SIAM

  26. [26]

    C., and Glover, K., 1996, Robust and Optimal Control , Prentice Hall

    Zhou, K., Doyle, J. C., and Glover, K., 1996, Robust and Optimal Control , Prentice Hall

  27. [27]

    Khalil, H. K. and Grizzle, J. W ., 2002, Nonlinear systems, Vol. 3, Prentice hall Upper Saddle River, NJ

  28. [28]

    The matr ix cookbook,

    Petersen, K. B., Pedersen, M. S., et al., 2008, “The matr ix cookbook,” Technical University of Denmark, 7(15), p. 510

  29. [29]

    J., 1996, Linear System Theory , Prentice Hall

    Rugh, W . J., 1996, Linear System Theory , Prentice Hall

  30. [30]

    A generalization of the Sherman–Mor rison–Woodbury for- mula,

    Deng, C. Y ., 2011, “A generalization of the Sherman–Mor rison–Woodbury for- mula,” Applied Mathematics Letters, 24(9), pp. 1561–1564

  31. [31]

    and Rodman, L., 1995, Algebraic riccati equations , Clarendon press

    Lancaster, P . and Rodman, L., 1995, Algebraic riccati equations , Clarendon press

  32. [32]

    Matrix Ric- cati equations in control and systems theory,

    Aboukandil, H., Freiling, G., Ionescu, V ., Jank, G., et al., 2003, “Matrix Ric- cati equations in control and systems theory,” IEEE Transac tions in Automatic Control, 49(10), pp. 2094–2095. Journal of Dynamic Systems, Measurement, and Control PREPRINT FOR REVIEW / 15 List of Figures 1 Dynamic Mode Adaptive Control (DMAC) architecture for mod el-free, da...