Data-Driven Moving Horizon Estimators for Linear Systems with Sample Complexity Analysis
Pith reviewed 2026-05-10 17:16 UTC · model grok-4.3
The pith
A data-driven moving horizon estimator keeps the expected state estimation error ultimately bounded for unknown linear systems.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By formulating and solving an optimization problem that incorporates both offline and online system data, a novel data-driven moving horizon estimator (DDMHE) is designed. The expected 2-norm of the estimation error is ultimately bounded, with an explicit relationship to the system noise covariances. Sample complexity analysis establishes how the length of the offline data affects the estimation error, and the performance gap to the traditional moving horizon estimator with known matrices is quantified.
What carries the argument
The data-driven moving horizon estimator (DDMHE), an optimization problem that replaces explicit system matrices with data matrices assembled from offline and online measurements to produce the state estimate.
If this is right
- The expected 2-norm of the estimation error decreases as the length of the offline data increases, following the derived sample complexity bounds.
- The performance difference between the data-driven estimator and the model-based moving horizon estimator is explicitly bounded by terms involving noise covariances and data quality.
- The ultimate boundedness result continues to hold for any fixed noise level once the offline data length exceeds a threshold determined by the sample complexity analysis.
Where Pith is reading between the lines
- The same data-driven construction could be tested on systems whose parameters drift slowly, provided fresh offline batches are collected periodically.
- In closed-loop control settings the bounded-error guarantee may allow the estimator to be placed directly inside a feedback loop without a separate identification step.
Load-bearing premise
The offline data must be rich enough to satisfy persistent excitation so that the data-driven formulation can stand in for the unknown system dynamics.
What would settle it
Numerical or experimental trials in which the expected estimation error grows unbounded as the offline data record is lengthened under persistent excitation would contradict the claimed ultimate boundedness.
Figures
read the original abstract
This paper investigates the state estimation problem for linear systems subject to Gaussian noise, where the model parameters are unknown. By formulating and solving an optimization problem that incorporates both offline and online system data, a novel data-driven moving horizon estimator (DDMHE) is designed. We prove that the expected 2-norm of the estimation error of the proposed DDMHE is ultimately bounded. Further, we establish an explicit relationship between the system noise covariances and the estimation error of the proposed DDMHE. Moreover, through a sample complexity analysis, we show how the length of the offline data affects the estimation error of the proposed DDMHE. We also quantify the performance gap between the proposed DDMHE using noisy data and the traditional moving horizon estimator with known system matrices. Finally, the theoretical results are validated through numerical simulations.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a data-driven moving horizon estimator (DDMHE) for linear systems with unknown parameters subject to Gaussian noise. It replaces unknown system matrices in the MHE optimization with Hankel matrices built from offline input-output trajectories, proves that the expected 2-norm of the estimation error is ultimately bounded with an explicit dependence on noise covariances, derives a sample-complexity result linking offline data length to the error bound, quantifies the performance gap to the model-based MHE, and validates the claims via numerical simulations.
Significance. If the central bounds hold, the work provides a useful advance in data-driven state estimation by supplying rigorous error guarantees and concrete data-length requirements. The sample-complexity analysis and explicit gap quantification are particular strengths, offering practical guidance for applications where system matrices are unavailable. These elements could influence adaptive control and identification tasks.
major comments (2)
- [Proof of the main bounded-error theorem] The proof of the ultimate boundedness result combines the standard MHE error recursion with a data-driven perturbation term controlled by offline data length. The step establishing that the expected error remains ultimately bounded requires verifying that the perturbation bound is uniform in time and that the expectation can be interchanged with the limit; this justification is not fully detailed and is load-bearing for the central claim.
- [Sample complexity section] The sample-complexity analysis supplies the minimal offline trajectory length to make the representation error smaller than a tolerance. However, the explicit dependence of the high-probability statement on both the data length and the noise variance is not stated in closed form, which affects how the ultimate error bound can be computed a priori.
minor comments (3)
- [DDMHE formulation] The definition and dimensions of the Hankel matrices should be recalled explicitly at the start of the DDMHE formulation section for readers unfamiliar with data-driven methods.
- [Numerical examples] The numerical simulations would be strengthened by a direct comparison of observed error decay rates against the predicted sample-complexity bound.
- [Introduction and references] A few recent references on noisy extensions of Willems' fundamental lemma would help contextualize the data-driven representation step.
Simulated Author's Rebuttal
We thank the referee for the positive assessment and constructive comments on our manuscript. We address each major comment point by point below, providing clarifications where needed and indicating the revisions we will make to strengthen the presentation of the proofs and results.
read point-by-point responses
-
Referee: [Proof of the main bounded-error theorem] The proof of the ultimate boundedness result combines the standard MHE error recursion with a data-driven perturbation term controlled by offline data length. The step establishing that the expected error remains ultimately bounded requires verifying that the perturbation bound is uniform in time and that the expectation can be interchanged with the limit; this justification is not fully detailed and is load-bearing for the central claim.
Authors: We appreciate the referee pointing out the need for additional detail on this step. The perturbation term is generated by the fixed offline data matrices and is therefore independent of the online time index, yielding a uniform bound over all t. For the interchange, the error recursion satisfies a uniform contraction property with an integrable dominating function (arising from the Gaussian noise assumption), so the limsup of the expectation equals the expectation of the limsup by the dominated convergence theorem. In the revised manuscript we will insert a short lemma immediately after the main theorem that explicitly states these two facts and verifies the conditions. revision: yes
-
Referee: [Sample complexity section] The sample-complexity analysis supplies the minimal offline trajectory length to make the representation error smaller than a tolerance. However, the explicit dependence of the high-probability statement on both the data length and the noise variance is not stated in closed form, which affects how the ultimate error bound can be computed a priori.
Authors: We agree that an explicit closed-form expression would improve usability. The high-probability bound on the Hankel-matrix representation error is obtained via a matrix concentration inequality and takes the form P(‖E_N‖ > ε) ≤ 2 exp(−c N ε² / σ²), where N is the offline length, σ² is the noise variance, and c is a universal constant depending only on the system dimension. We will add this explicit expression to the sample-complexity theorem and substitute it directly into the ultimate error bound, thereby making the a-priori computation transparent. revision: yes
Circularity Check
No significant circularity detected in the derivation chain.
full rationale
The central derivation replaces unknown system matrices with Hankel matrices constructed from offline input-output trajectories under an explicit persistent-excitation hypothesis, then obtains the ultimate bound on expected 2-norm error by adding a perturbation term to the standard MHE recursion and controlling the perturbation size via sample-complexity bounds on the data length. The performance gap to the known-matrix MHE is quantified by direct comparison of the two optimization problems and their respective error recursions. No equation reduces the claimed bound to a fitted parameter renamed as a prediction, no load-bearing premise rests solely on a self-citation, and the persistent-excitation and Gaussian-noise assumptions are stated as external hypotheses rather than derived from the result itself. The analysis is therefore self-contained against the stated assumptions and external benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption The system is linear time-invariant with additive Gaussian noise.
- domain assumption Offline data is available and can be used to replace unknown system matrices in the estimator.
Reference graph
Works this paper leans on
-
[1]
Nonlinear predictive control and moving horizon estimation—an intro- ductory overview,
F. Allg ¨ower, T. A. Badgwell, J. S. Qin, J. B. Rawlings, and S. J. Wright, “Nonlinear predictive control and moving horizon estimation—an intro- ductory overview,”Advances in control: Highlights of ECC’99, pp. 391– 449, 1999
work page 1999
-
[2]
Q. Liu, Z. Wang, H. Dong, and C. Jiang, “Remote estimation for energy harvesting systems under multiplicative noises: A binary encoding scheme with probabilistic bit flips,” IEEE Transactions on Automatic Control, vol. 68, no. 1, pp. 343–354, 2022
work page 2022
-
[3]
Moving-horizon estimation for linear dynamic networks with binary encoding schemes,
Q. Liu and Z. Wang, “Moving-horizon estimation for linear dynamic networks with binary encoding schemes,” IEEE Transactions on Auto- matic Control, vol. 66, no. 4, pp. 1763–1770, 2020
work page 2020
-
[4]
Constrained linear state estimation—a moving horizon approach,
C. V . Rao, J. B. Rawlings, and J. H. Lee, “Constrained linear state estimation—a moving horizon approach,” Automatica, vol. 37, no. 10, pp. 1619–1628, 2001
work page 2001
-
[5]
Robust stability of moving horizon estimation under bounded disturbances,
L. Ji, J. B. Rawlings, W. Hu, A. Wynn, and M. Diehl, “Robust stability of moving horizon estimation under bounded disturbances,” IEEE Transactions on Automatic Control , vol. 61, no. 11, pp. 3509– 3514, 2015
work page 2015
-
[6]
Nonlinear moving horizon estimation in the presence of bounded disturbances,
M. A. M ¨uller, “Nonlinear moving horizon estimation in the presence of bounded disturbances,” Automatica, vol. 79, pp. 306–314, 2017
work page 2017
-
[7]
Linear moving horizon estimation with pre-estimating observer,
D. Sui, T. A. Johansen, and L. Feng, “Linear moving horizon estimation with pre-estimating observer,” IEEE Transactions on Automatic Control, vol. 55, no. 10, pp. 2363–2368, 2010. 11
work page 2010
-
[8]
Moving horizon estimation with unknown inputs under dynamic quantization effects,
L. Zou, Z. Wang, J. Hu, and D. Zhou, “Moving horizon estimation with unknown inputs under dynamic quantization effects,” IEEE Transactions on Automatic Control , vol. 65, no. 12, pp. 5368–5375, 2020
work page 2020
-
[9]
W. Li, N. Hou, F. Yang, X. Bu, and L. Sun, “Finite-horizon variance- constrained estimation for complex networks subject to dynamical bias using binary encoding schemes,” IEEE Access , vol. 11, pp. 142589– 142600, 2023
work page 2023
-
[10]
Sample complexity of Kalman filtering for unknown systems,
A. Tsiamis, N. Matni, and G. Pappas, “Sample complexity of Kalman filtering for unknown systems,” in Learning for Dynamics and Control , pp. 435–444, 2020
work page 2020
-
[11]
Ljung, System Identification: Theory for the User
L. Ljung, System Identification: Theory for the User . Upper Saddle River, USA: PTR Prentice Hall, 1999
work page 1999
-
[12]
Kalmannet: Neural network aided Kalman filtering for partially known dynamics,
G. Revach, N. Shlezinger, X. Ni, A. L. Escoriza, R. J. G. van Sloun, and Y . C. Eldar, “Kalmannet: Neural network aided Kalman filtering for partially known dynamics,” IEEE Transactions on Signal Processing , vol. 70, pp. 1532–1547, 2022
work page 2022
-
[13]
Learning robust data-based LQG controllers from noisy data,
W. Liu, J. Sun, G. Wang, F. Bullo, and J. Chen, “Learning robust data-based LQG controllers from noisy data,” IEEE Transactions on Automatic Control, 2024. DOI: 10.1109/TAC.2024.3409749
-
[14]
Data- driven set-based estimation using matrix zonotopes with set containment guarantees,
A. Alanwar, A. Berndt, K. H. Johansson, and H. Sandberg, “Data- driven set-based estimation using matrix zonotopes with set containment guarantees,” in European Control Conference, pp. 875–881, 2022
work page 2022
-
[15]
Data-driven state estimation for linear systems,
V . K. Mishra, S. A. Hiremath, and N. Bajcinca, “Data-driven state estimation for linear systems,” in 2024 European Control Conference , pp. 906–913, IEEE, 2024
work page 2024
-
[16]
Wasserstein distributionally robust Kalman filtering,
S. Shafieezadeh Abadeh, V . A. Nguyen, D. Kuhn, and P. M. Moha- jerin Esfahani, “Wasserstein distributionally robust Kalman filtering,” Advances in Neural Information Processing Systems , vol. 31, 2018
work page 2018
-
[17]
On the relationship between data-enabled predictive control and subspace predictive control,
F. Fiedler and S. Lucia, “On the relationship between data-enabled predictive control and subspace predictive control,” in 2021 European Control Conference, pp. 222–229, IEEE, 2021
work page 2021
-
[18]
A note on persistency of excitation,
J. C. Willems, P. Rapisarda, I. Markovsky, and B. L. De Moor, “A note on persistency of excitation,” Systems & Control Letters , vol. 54, no. 4, pp. 325–329, 2005
work page 2005
-
[19]
Moving- horizon state estimation for nonlinear systems using neural networks,
A. Alessandri, M. Baglietto, G. Battistelli, and M. Gaggero, “Moving- horizon state estimation for nonlinear systems using neural networks,” IEEE Transactions on Neural Networks , vol. 22, no. 5, pp. 768–780, 2011
work page 2011
-
[20]
Physics-informed online learning of gray-box models by moving hori- zon estimation,
K. F. Løwenstein, D. Bernardini, L. Fagiano, and A. Bemporad, “Physics-informed online learning of gray-box models by moving hori- zon estimation,” European Journal of Control, vol. 74, p. 100861, 2023
work page 2023
-
[21]
Neural moving horizon estimation for robust flight control,
B. Wang, Z. Ma, S. Lai, and L. Zhao, “Neural moving horizon estimation for robust flight control,” IEEE Transactions on Robotics , 2023
work page 2023
-
[22]
Robust data-driven moving horizon estimation for linear discrete-time systems,
T. M. Wolff, V . G. Lopez, and M. A. M¨uller, “Robust data-driven moving horizon estimation for linear discrete-time systems,” IEEE Transactions on Automatic Control , vol. 69, no. 8, pp. 5598–5604, 2024
work page 2024
-
[23]
X. Liu, H. Chang, Z. Lu, and P. Huang, “Data-driven moving hori- zon estimation for angular velocity of space noncooperative tar- get in eddy current de-tumbling mission,” 2023. [Online]. Available: https://arxiv.org/abs/2301.05351
-
[24]
A real- time algorithm for moving horizon state and parameter estimation,
P. K ¨uhl, M. Diehl, T. Kraus, J. P. Schl ¨oder, and H. G. Bock, “A real- time algorithm for moving horizon state and parameter estimation,” Computers & chemical engineering , vol. 35, no. 1, pp. 71–83, 2011
work page 2011
-
[25]
Min-max moving- horizon estimation for uncertain discrete-time linear systems,
A. Alessandri, M. Baglietto, and G. Battistelli, “Min-max moving- horizon estimation for uncertain discrete-time linear systems,” SIAM Journal on Control and Optimization , vol. 50, no. 3, pp. 1439–1465, 2012
work page 2012
-
[26]
Learning-based moving horizon estimation through differentiable convex optimization layers,
S. Muntwiler, K. P. Wabersich, and M. N. Zeilinger, “Learning-based moving horizon estimation through differentiable convex optimization layers,” in Learning for Dynamics and Control Conference , pp. 153– 165, PMLR, 2022
work page 2022
-
[27]
Bequette, Process Control: Modeling, Design, and Simulation
B. Bequette, Process Control: Modeling, Design, and Simulation . New Jersey: Prentice Hall Professional, 2003
work page 2003
-
[28]
Moving horizon estimation for networked systems with quantized measurements and packet dropouts,
A. Liu, L. Yu, W.-A. Zhang, and M. Z. Chen, “Moving horizon estimation for networked systems with quantized measurements and packet dropouts,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 60, no. 7, pp. 1823–1834, 2013
work page 2013
-
[29]
Maximum likelihood estimation in data-driven modeling and control,
M. Yin, A. Iannelli, and R. S. Smith, “Maximum likelihood estimation in data-driven modeling and control,” IEEE Transactions on Automatic Control, vol. 68, no. 1, pp. 317–328, 2021
work page 2021
-
[30]
A stabilizing model-based predictive control algorithm for nonlinear systems,
L. Magni, G. De Nicolao, L. Magnani, and R. Scattolini, “A stabilizing model-based predictive control algorithm for nonlinear systems,” Auto- matica, vol. 37, no. 9, pp. 1351–1362, 2001
work page 2001
-
[31]
Robust Kalman filtering for uncertain discrete-time systems,
L. Xie, Y . C. Soh, and C. E. De Souza, “Robust Kalman filtering for uncertain discrete-time systems,” IEEE Transactions on Automatic Control, vol. 39, no. 6, pp. 1310–1314, 1994
work page 1994
-
[32]
Robust minimum variance filtering,
U. Shaked and C. E. de Souza, “Robust minimum variance filtering,” IEEE Transactions on Signal Processing, vol. 43, no. 11, pp. 2474–2483, 2002
work page 2002
-
[33]
Robust estimation of uncertain linear systems using a moving horizon approach,
J. F. Garc ´ıa Tirado, “Robust estimation of uncertain linear systems using a moving horizon approach,” Dissertation, 2012
work page 2012
-
[34]
Receding-horizon estimation for discrete-time linear systems,
A. Alessandri, M. Baglietto, and G. Battistelli, “Receding-horizon estimation for discrete-time linear systems,” IEEE Transactions on Automatic Control, vol. 48, no. 3, pp. 473–478, 2003
work page 2003
-
[35]
L. Zou, Z. Wang, J. Hu, and Q.-L. Han, “Moving horizon estimation meets multi-sensor information fusion: Development, opportunities and challenges,” Information Fusion, vol. 60, pp. 1–10, 2020
work page 2020
-
[36]
P. Neal, C. Eric, P. Borja, and E. Jonathan, “Distributed optimization and statistical learning via the alternating direction method of multipliers,” Foundations and Trends® in Machine learning, vol. 3, no. 1, pp. 1–122, 2011
work page 2011
-
[37]
Vershynin, High-Dimensional Probability: An Introduction with Ap- plications in Data Science
R. Vershynin, High-Dimensional Probability: An Introduction with Ap- plications in Data Science. Cambridge UK: Cambridge University Press, 2018
work page 2018
-
[38]
Adaptive human–robot interaction control for robots driven by series elastic actuators,
X. Li, Y . Pan, G. Chen, and H. Yu, “Adaptive human–robot interaction control for robots driven by series elastic actuators,” IEEE Transactions on Robotics, vol. 33, no. 1, pp. 169–182, 2016
work page 2016
-
[39]
P. Duan, T. Liu, Y . Xing, and K. H. Johansson, “Robust data-driven Kalman filtering for unknown linear systems using maximum likelihood optimization,” Automatica, vol. 180, p. 112474, 2025
work page 2025
-
[40]
On the sample com- plexity of the linear quadratic regulator,
S. Dean, H. Mania, N. Matni, B. Recht, and S. Tu, “On the sample com- plexity of the linear quadratic regulator,” Foundations of Computational Mathematics, vol. 20, no. 4, pp. 633–679, 2020
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.