pith. sign in

arxiv: 2604.08328 · v1 · submitted 2026-04-09 · 📡 eess.SY · cs.SY

Data-Driven Moving Horizon Estimators for Linear Systems with Sample Complexity Analysis

Pith reviewed 2026-05-10 17:16 UTC · model grok-4.3

classification 📡 eess.SY cs.SY
keywords data-driven estimationmoving horizon estimationlinear systemssample complexitystate estimationGaussian noisepersistent excitation
0
0 comments X

The pith

A data-driven moving horizon estimator keeps the expected state estimation error ultimately bounded for unknown linear systems.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper designs a data-driven moving horizon estimator that solves an optimization problem built from both offline and online measurements to recover the state of a linear system whose matrices are unknown. It proves that the expected 2-norm of the resulting estimation error remains ultimately bounded and supplies an explicit upper bound in terms of the Gaussian noise covariances. Sample complexity analysis then shows that longer offline data records tighten this bound, while also quantifying how close the data-driven estimator comes to the performance of a conventional moving horizon estimator that knows the exact system matrices.

Core claim

By formulating and solving an optimization problem that incorporates both offline and online system data, a novel data-driven moving horizon estimator (DDMHE) is designed. The expected 2-norm of the estimation error is ultimately bounded, with an explicit relationship to the system noise covariances. Sample complexity analysis establishes how the length of the offline data affects the estimation error, and the performance gap to the traditional moving horizon estimator with known matrices is quantified.

What carries the argument

The data-driven moving horizon estimator (DDMHE), an optimization problem that replaces explicit system matrices with data matrices assembled from offline and online measurements to produce the state estimate.

If this is right

  • The expected 2-norm of the estimation error decreases as the length of the offline data increases, following the derived sample complexity bounds.
  • The performance difference between the data-driven estimator and the model-based moving horizon estimator is explicitly bounded by terms involving noise covariances and data quality.
  • The ultimate boundedness result continues to hold for any fixed noise level once the offline data length exceeds a threshold determined by the sample complexity analysis.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same data-driven construction could be tested on systems whose parameters drift slowly, provided fresh offline batches are collected periodically.
  • In closed-loop control settings the bounded-error guarantee may allow the estimator to be placed directly inside a feedback loop without a separate identification step.

Load-bearing premise

The offline data must be rich enough to satisfy persistent excitation so that the data-driven formulation can stand in for the unknown system dynamics.

What would settle it

Numerical or experimental trials in which the expected estimation error grows unbounded as the offline data record is lengthened under persistent excitation would contradict the claimed ultimate boundedness.

Figures

Figures reproduced from arXiv: 2604.08328 by Guanghui Wen, Jiabao He, Peihu Duan, Yuezu Lv.

Figure 1
Figure 1. Figure 1: The pre-collected state-input-output system trajectory, where yellow, [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: The true state trajectories and the corresponding estimates of the [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗
Figure 4
Figure 4. Figure 4: The values of MSE of 200 Monte Carlo trials using different estimation [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗
read the original abstract

This paper investigates the state estimation problem for linear systems subject to Gaussian noise, where the model parameters are unknown. By formulating and solving an optimization problem that incorporates both offline and online system data, a novel data-driven moving horizon estimator (DDMHE) is designed. We prove that the expected 2-norm of the estimation error of the proposed DDMHE is ultimately bounded. Further, we establish an explicit relationship between the system noise covariances and the estimation error of the proposed DDMHE. Moreover, through a sample complexity analysis, we show how the length of the offline data affects the estimation error of the proposed DDMHE. We also quantify the performance gap between the proposed DDMHE using noisy data and the traditional moving horizon estimator with known system matrices. Finally, the theoretical results are validated through numerical simulations.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The paper proposes a data-driven moving horizon estimator (DDMHE) for linear systems with unknown parameters subject to Gaussian noise. It replaces unknown system matrices in the MHE optimization with Hankel matrices built from offline input-output trajectories, proves that the expected 2-norm of the estimation error is ultimately bounded with an explicit dependence on noise covariances, derives a sample-complexity result linking offline data length to the error bound, quantifies the performance gap to the model-based MHE, and validates the claims via numerical simulations.

Significance. If the central bounds hold, the work provides a useful advance in data-driven state estimation by supplying rigorous error guarantees and concrete data-length requirements. The sample-complexity analysis and explicit gap quantification are particular strengths, offering practical guidance for applications where system matrices are unavailable. These elements could influence adaptive control and identification tasks.

major comments (2)
  1. [Proof of the main bounded-error theorem] The proof of the ultimate boundedness result combines the standard MHE error recursion with a data-driven perturbation term controlled by offline data length. The step establishing that the expected error remains ultimately bounded requires verifying that the perturbation bound is uniform in time and that the expectation can be interchanged with the limit; this justification is not fully detailed and is load-bearing for the central claim.
  2. [Sample complexity section] The sample-complexity analysis supplies the minimal offline trajectory length to make the representation error smaller than a tolerance. However, the explicit dependence of the high-probability statement on both the data length and the noise variance is not stated in closed form, which affects how the ultimate error bound can be computed a priori.
minor comments (3)
  1. [DDMHE formulation] The definition and dimensions of the Hankel matrices should be recalled explicitly at the start of the DDMHE formulation section for readers unfamiliar with data-driven methods.
  2. [Numerical examples] The numerical simulations would be strengthened by a direct comparison of observed error decay rates against the predicted sample-complexity bound.
  3. [Introduction and references] A few recent references on noisy extensions of Willems' fundamental lemma would help contextualize the data-driven representation step.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the positive assessment and constructive comments on our manuscript. We address each major comment point by point below, providing clarifications where needed and indicating the revisions we will make to strengthen the presentation of the proofs and results.

read point-by-point responses
  1. Referee: [Proof of the main bounded-error theorem] The proof of the ultimate boundedness result combines the standard MHE error recursion with a data-driven perturbation term controlled by offline data length. The step establishing that the expected error remains ultimately bounded requires verifying that the perturbation bound is uniform in time and that the expectation can be interchanged with the limit; this justification is not fully detailed and is load-bearing for the central claim.

    Authors: We appreciate the referee pointing out the need for additional detail on this step. The perturbation term is generated by the fixed offline data matrices and is therefore independent of the online time index, yielding a uniform bound over all t. For the interchange, the error recursion satisfies a uniform contraction property with an integrable dominating function (arising from the Gaussian noise assumption), so the limsup of the expectation equals the expectation of the limsup by the dominated convergence theorem. In the revised manuscript we will insert a short lemma immediately after the main theorem that explicitly states these two facts and verifies the conditions. revision: yes

  2. Referee: [Sample complexity section] The sample-complexity analysis supplies the minimal offline trajectory length to make the representation error smaller than a tolerance. However, the explicit dependence of the high-probability statement on both the data length and the noise variance is not stated in closed form, which affects how the ultimate error bound can be computed a priori.

    Authors: We agree that an explicit closed-form expression would improve usability. The high-probability bound on the Hankel-matrix representation error is obtained via a matrix concentration inequality and takes the form P(‖E_N‖ > ε) ≤ 2 exp(−c N ε² / σ²), where N is the offline length, σ² is the noise variance, and c is a universal constant depending only on the system dimension. We will add this explicit expression to the sample-complexity theorem and substitute it directly into the ultimate error bound, thereby making the a-priori computation transparent. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected in the derivation chain.

full rationale

The central derivation replaces unknown system matrices with Hankel matrices constructed from offline input-output trajectories under an explicit persistent-excitation hypothesis, then obtains the ultimate bound on expected 2-norm error by adding a perturbation term to the standard MHE recursion and controlling the perturbation size via sample-complexity bounds on the data length. The performance gap to the known-matrix MHE is quantified by direct comparison of the two optimization problems and their respective error recursions. No equation reduces the claimed bound to a fitted parameter renamed as a prediction, no load-bearing premise rests solely on a self-citation, and the persistent-excitation and Gaussian-noise assumptions are stated as external hypotheses rather than derived from the result itself. The analysis is therefore self-contained against the stated assumptions and external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claims rest on standard assumptions for linear systems and Gaussian noise plus the existence of sufficiently informative offline data; no new entities are invented and no free parameters are explicitly fitted beyond typical horizon length choices.

axioms (2)
  • domain assumption The system is linear time-invariant with additive Gaussian noise.
    Stated in the abstract as the setting for the state estimation problem.
  • domain assumption Offline data is available and can be used to replace unknown system matrices in the estimator.
    Core to the data-driven formulation.

pith-pipeline@v0.9.0 · 5444 in / 1368 out tokens · 40327 ms · 2026-05-10T17:16:59.043962+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

40 extracted references · 40 canonical work pages

  1. [1]

    Nonlinear predictive control and moving horizon estimation—an intro- ductory overview,

    F. Allg ¨ower, T. A. Badgwell, J. S. Qin, J. B. Rawlings, and S. J. Wright, “Nonlinear predictive control and moving horizon estimation—an intro- ductory overview,”Advances in control: Highlights of ECC’99, pp. 391– 449, 1999

  2. [2]

    Remote estimation for energy harvesting systems under multiplicative noises: A binary encoding scheme with probabilistic bit flips,

    Q. Liu, Z. Wang, H. Dong, and C. Jiang, “Remote estimation for energy harvesting systems under multiplicative noises: A binary encoding scheme with probabilistic bit flips,” IEEE Transactions on Automatic Control, vol. 68, no. 1, pp. 343–354, 2022

  3. [3]

    Moving-horizon estimation for linear dynamic networks with binary encoding schemes,

    Q. Liu and Z. Wang, “Moving-horizon estimation for linear dynamic networks with binary encoding schemes,” IEEE Transactions on Auto- matic Control, vol. 66, no. 4, pp. 1763–1770, 2020

  4. [4]

    Constrained linear state estimation—a moving horizon approach,

    C. V . Rao, J. B. Rawlings, and J. H. Lee, “Constrained linear state estimation—a moving horizon approach,” Automatica, vol. 37, no. 10, pp. 1619–1628, 2001

  5. [5]

    Robust stability of moving horizon estimation under bounded disturbances,

    L. Ji, J. B. Rawlings, W. Hu, A. Wynn, and M. Diehl, “Robust stability of moving horizon estimation under bounded disturbances,” IEEE Transactions on Automatic Control , vol. 61, no. 11, pp. 3509– 3514, 2015

  6. [6]

    Nonlinear moving horizon estimation in the presence of bounded disturbances,

    M. A. M ¨uller, “Nonlinear moving horizon estimation in the presence of bounded disturbances,” Automatica, vol. 79, pp. 306–314, 2017

  7. [7]

    Linear moving horizon estimation with pre-estimating observer,

    D. Sui, T. A. Johansen, and L. Feng, “Linear moving horizon estimation with pre-estimating observer,” IEEE Transactions on Automatic Control, vol. 55, no. 10, pp. 2363–2368, 2010. 11

  8. [8]

    Moving horizon estimation with unknown inputs under dynamic quantization effects,

    L. Zou, Z. Wang, J. Hu, and D. Zhou, “Moving horizon estimation with unknown inputs under dynamic quantization effects,” IEEE Transactions on Automatic Control , vol. 65, no. 12, pp. 5368–5375, 2020

  9. [9]

    Finite-horizon variance- constrained estimation for complex networks subject to dynamical bias using binary encoding schemes,

    W. Li, N. Hou, F. Yang, X. Bu, and L. Sun, “Finite-horizon variance- constrained estimation for complex networks subject to dynamical bias using binary encoding schemes,” IEEE Access , vol. 11, pp. 142589– 142600, 2023

  10. [10]

    Sample complexity of Kalman filtering for unknown systems,

    A. Tsiamis, N. Matni, and G. Pappas, “Sample complexity of Kalman filtering for unknown systems,” in Learning for Dynamics and Control , pp. 435–444, 2020

  11. [11]

    Ljung, System Identification: Theory for the User

    L. Ljung, System Identification: Theory for the User . Upper Saddle River, USA: PTR Prentice Hall, 1999

  12. [12]

    Kalmannet: Neural network aided Kalman filtering for partially known dynamics,

    G. Revach, N. Shlezinger, X. Ni, A. L. Escoriza, R. J. G. van Sloun, and Y . C. Eldar, “Kalmannet: Neural network aided Kalman filtering for partially known dynamics,” IEEE Transactions on Signal Processing , vol. 70, pp. 1532–1547, 2022

  13. [13]

    Learning robust data-based LQG controllers from noisy data,

    W. Liu, J. Sun, G. Wang, F. Bullo, and J. Chen, “Learning robust data-based LQG controllers from noisy data,” IEEE Transactions on Automatic Control, 2024. DOI: 10.1109/TAC.2024.3409749

  14. [14]

    Data- driven set-based estimation using matrix zonotopes with set containment guarantees,

    A. Alanwar, A. Berndt, K. H. Johansson, and H. Sandberg, “Data- driven set-based estimation using matrix zonotopes with set containment guarantees,” in European Control Conference, pp. 875–881, 2022

  15. [15]

    Data-driven state estimation for linear systems,

    V . K. Mishra, S. A. Hiremath, and N. Bajcinca, “Data-driven state estimation for linear systems,” in 2024 European Control Conference , pp. 906–913, IEEE, 2024

  16. [16]

    Wasserstein distributionally robust Kalman filtering,

    S. Shafieezadeh Abadeh, V . A. Nguyen, D. Kuhn, and P. M. Moha- jerin Esfahani, “Wasserstein distributionally robust Kalman filtering,” Advances in Neural Information Processing Systems , vol. 31, 2018

  17. [17]

    On the relationship between data-enabled predictive control and subspace predictive control,

    F. Fiedler and S. Lucia, “On the relationship between data-enabled predictive control and subspace predictive control,” in 2021 European Control Conference, pp. 222–229, IEEE, 2021

  18. [18]

    A note on persistency of excitation,

    J. C. Willems, P. Rapisarda, I. Markovsky, and B. L. De Moor, “A note on persistency of excitation,” Systems & Control Letters , vol. 54, no. 4, pp. 325–329, 2005

  19. [19]

    Moving- horizon state estimation for nonlinear systems using neural networks,

    A. Alessandri, M. Baglietto, G. Battistelli, and M. Gaggero, “Moving- horizon state estimation for nonlinear systems using neural networks,” IEEE Transactions on Neural Networks , vol. 22, no. 5, pp. 768–780, 2011

  20. [20]

    Physics-informed online learning of gray-box models by moving hori- zon estimation,

    K. F. Løwenstein, D. Bernardini, L. Fagiano, and A. Bemporad, “Physics-informed online learning of gray-box models by moving hori- zon estimation,” European Journal of Control, vol. 74, p. 100861, 2023

  21. [21]

    Neural moving horizon estimation for robust flight control,

    B. Wang, Z. Ma, S. Lai, and L. Zhao, “Neural moving horizon estimation for robust flight control,” IEEE Transactions on Robotics , 2023

  22. [22]

    Robust data-driven moving horizon estimation for linear discrete-time systems,

    T. M. Wolff, V . G. Lopez, and M. A. M¨uller, “Robust data-driven moving horizon estimation for linear discrete-time systems,” IEEE Transactions on Automatic Control , vol. 69, no. 8, pp. 5598–5604, 2024

  23. [23]

    Data-driven moving hori- zon estimation for angular velocity of space noncooperative tar- get in eddy current de-tumbling mission,

    X. Liu, H. Chang, Z. Lu, and P. Huang, “Data-driven moving hori- zon estimation for angular velocity of space noncooperative tar- get in eddy current de-tumbling mission,” 2023. [Online]. Available: https://arxiv.org/abs/2301.05351

  24. [24]

    A real- time algorithm for moving horizon state and parameter estimation,

    P. K ¨uhl, M. Diehl, T. Kraus, J. P. Schl ¨oder, and H. G. Bock, “A real- time algorithm for moving horizon state and parameter estimation,” Computers & chemical engineering , vol. 35, no. 1, pp. 71–83, 2011

  25. [25]

    Min-max moving- horizon estimation for uncertain discrete-time linear systems,

    A. Alessandri, M. Baglietto, and G. Battistelli, “Min-max moving- horizon estimation for uncertain discrete-time linear systems,” SIAM Journal on Control and Optimization , vol. 50, no. 3, pp. 1439–1465, 2012

  26. [26]

    Learning-based moving horizon estimation through differentiable convex optimization layers,

    S. Muntwiler, K. P. Wabersich, and M. N. Zeilinger, “Learning-based moving horizon estimation through differentiable convex optimization layers,” in Learning for Dynamics and Control Conference , pp. 153– 165, PMLR, 2022

  27. [27]

    Bequette, Process Control: Modeling, Design, and Simulation

    B. Bequette, Process Control: Modeling, Design, and Simulation . New Jersey: Prentice Hall Professional, 2003

  28. [28]

    Moving horizon estimation for networked systems with quantized measurements and packet dropouts,

    A. Liu, L. Yu, W.-A. Zhang, and M. Z. Chen, “Moving horizon estimation for networked systems with quantized measurements and packet dropouts,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 60, no. 7, pp. 1823–1834, 2013

  29. [29]

    Maximum likelihood estimation in data-driven modeling and control,

    M. Yin, A. Iannelli, and R. S. Smith, “Maximum likelihood estimation in data-driven modeling and control,” IEEE Transactions on Automatic Control, vol. 68, no. 1, pp. 317–328, 2021

  30. [30]

    A stabilizing model-based predictive control algorithm for nonlinear systems,

    L. Magni, G. De Nicolao, L. Magnani, and R. Scattolini, “A stabilizing model-based predictive control algorithm for nonlinear systems,” Auto- matica, vol. 37, no. 9, pp. 1351–1362, 2001

  31. [31]

    Robust Kalman filtering for uncertain discrete-time systems,

    L. Xie, Y . C. Soh, and C. E. De Souza, “Robust Kalman filtering for uncertain discrete-time systems,” IEEE Transactions on Automatic Control, vol. 39, no. 6, pp. 1310–1314, 1994

  32. [32]

    Robust minimum variance filtering,

    U. Shaked and C. E. de Souza, “Robust minimum variance filtering,” IEEE Transactions on Signal Processing, vol. 43, no. 11, pp. 2474–2483, 2002

  33. [33]

    Robust estimation of uncertain linear systems using a moving horizon approach,

    J. F. Garc ´ıa Tirado, “Robust estimation of uncertain linear systems using a moving horizon approach,” Dissertation, 2012

  34. [34]

    Receding-horizon estimation for discrete-time linear systems,

    A. Alessandri, M. Baglietto, and G. Battistelli, “Receding-horizon estimation for discrete-time linear systems,” IEEE Transactions on Automatic Control, vol. 48, no. 3, pp. 473–478, 2003

  35. [35]

    Moving horizon estimation meets multi-sensor information fusion: Development, opportunities and challenges,

    L. Zou, Z. Wang, J. Hu, and Q.-L. Han, “Moving horizon estimation meets multi-sensor information fusion: Development, opportunities and challenges,” Information Fusion, vol. 60, pp. 1–10, 2020

  36. [36]

    Distributed optimization and statistical learning via the alternating direction method of multipliers,

    P. Neal, C. Eric, P. Borja, and E. Jonathan, “Distributed optimization and statistical learning via the alternating direction method of multipliers,” Foundations and Trends® in Machine learning, vol. 3, no. 1, pp. 1–122, 2011

  37. [37]

    Vershynin, High-Dimensional Probability: An Introduction with Ap- plications in Data Science

    R. Vershynin, High-Dimensional Probability: An Introduction with Ap- plications in Data Science. Cambridge UK: Cambridge University Press, 2018

  38. [38]

    Adaptive human–robot interaction control for robots driven by series elastic actuators,

    X. Li, Y . Pan, G. Chen, and H. Yu, “Adaptive human–robot interaction control for robots driven by series elastic actuators,” IEEE Transactions on Robotics, vol. 33, no. 1, pp. 169–182, 2016

  39. [39]

    Robust data-driven Kalman filtering for unknown linear systems using maximum likelihood optimization,

    P. Duan, T. Liu, Y . Xing, and K. H. Johansson, “Robust data-driven Kalman filtering for unknown linear systems using maximum likelihood optimization,” Automatica, vol. 180, p. 112474, 2025

  40. [40]

    On the sample com- plexity of the linear quadratic regulator,

    S. Dean, H. Mania, N. Matni, B. Recht, and S. Tu, “On the sample com- plexity of the linear quadratic regulator,” Foundations of Computational Mathematics, vol. 20, no. 4, pp. 633–679, 2020