Model Predictive Control and Moving Horizon Estimation using Statistically Weighted Data-Based Ensemble Models
Pith reviewed 2026-05-17 04:52 UTC · model grok-4.3
The pith
Ensemble models for predictive control adjust their weights dynamically using Mahalanobis distance based on system input.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors propose a combination rule for ensemble models based on the statistical Mahalanobis distance that lets the ensemble weights vary across the prediction window according to the system input, together with a moving horizon estimation scheme that supplies a state observer for such ensembles, and they demonstrate the approach on a benchmark energy system operating under multiple conditions.
What carries the argument
The Mahalanobis-distance-based combination rule for ensemble models, which produces input-dependent weights that can change at each step of the prediction horizon.
If this is right
- The controller can maintain performance when the plant shifts between operating conditions without explicit mode detection or switching logic.
- Prediction quality over the horizon improves because weights reflect statistical differences in model suitability for the current input.
- State estimates are obtained consistently for the entire ensemble rather than requiring separate observers for each model.
- The method applies directly to energy networks and similar plants where historical data spans multiple regimes.
Where Pith is reading between the lines
- The same weighting idea could be tested with other statistical distances or uncertainty measures to see whether further gains appear in prediction or control.
- Online adaptation of ensemble weights may reduce the engineering effort spent on offline model selection or regime classification.
- The framework might transfer to process control or robotics domains where simulation data and real measurements need to be blended without manual tuning.
Load-bearing premise
The individual data-based models are accurate enough across the operating conditions of interest and the Mahalanobis distance produces weights that meaningfully improve prediction quality rather than simply reflecting training-data density.
What would settle it
Apply the proposed controller and the same ensemble to the benchmark energy system but replace the Mahalanobis weighting with uniform weights or a single best model and measure whether closed-loop tracking error or prediction mismatch fails to decrease.
Figures
read the original abstract
This paper presents a model predictive control (MPC) framework leveraging an ensemble of data-based models to optimally control complex systems under multiple operating conditions. A novel combination rule for ensemble models is proposed, based on the statistical Mahalanobis distance, enabling the ensemble weights to suitably vary across the prediction window based on the system input. In addition, a novel state observer for ensemble models is developed using moving horizon estimation (MHE). The effectiveness of the proposed methodology is demonstrated on a benchmark energy system operating under multiple conditions.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a model predictive control (MPC) framework that employs an ensemble of data-based models to control complex systems under multiple operating conditions. It introduces a novel ensemble combination rule based on the statistical Mahalanobis distance, which allows the weights to vary across the prediction window according to the system input. A novel state observer for ensemble models is developed using moving horizon estimation (MHE). The approach is demonstrated on a benchmark energy system operating under multiple conditions.
Significance. If the central claims hold, the work could provide a statistically motivated approach to adaptive weighting in data-driven ensemble MPC and a tailored MHE observer, which may be useful for systems with regime-dependent behavior. The benchmark demonstration on the energy system is a positive element for reproducibility, but the overall significance hinges on whether the Mahalanobis weighting and per-step reweighting deliver measurable gains beyond simpler ensemble rules.
major comments (2)
- [Benchmark demonstration / Numerical results] Benchmark demonstration section: the reported closed-loop performance on the energy system is shown for the proposed method, but no ablation or comparison is provided against uniform weights, fixed weights, or Euclidean-distance weighting. Without these controls it is impossible to determine whether the Mahalanobis-based, input-dependent reweighting across the horizon is load-bearing for the claimed improvement in prediction quality or control performance.
- [Ensemble combination rule] Ensemble weighting rule (definition of weights via Mahalanobis distance): the paper states that weights vary suitably across the prediction window based on the input, yet no quantitative evidence (e.g., plots of weight trajectories or sensitivity analysis) is given to show that the variation is both non-trivial and beneficial relative to a constant-weight baseline. If the weights remain nearly constant over typical horizons, the novelty of the combination rule is not substantiated.
minor comments (2)
- [Preliminaries / Model formulation] Notation for the combined prediction and the individual model outputs could be made more explicit to avoid ambiguity when the ensemble is used inside the MPC optimization.
- [MHE observer] The MHE observer derivation would benefit from a short remark on how the ensemble structure is incorporated into the arrival-cost or weighting matrices.
Simulated Author's Rebuttal
We thank the referee for their constructive comments on our manuscript. We address each major comment point by point below and will revise the manuscript accordingly to strengthen the evidence for our claims.
read point-by-point responses
-
Referee: [Benchmark demonstration / Numerical results] Benchmark demonstration section: the reported closed-loop performance on the energy system is shown for the proposed method, but no ablation or comparison is provided against uniform weights, fixed weights, or Euclidean-distance weighting. Without these controls it is impossible to determine whether the Mahalanobis-based, input-dependent reweighting across the horizon is load-bearing for the claimed improvement in prediction quality or control performance.
Authors: We agree that the current benchmark results would be strengthened by direct comparisons. In the revised manuscript we will add closed-loop performance comparisons on the energy system benchmark against uniform weights, fixed weights, and Euclidean-distance weighting, including quantitative metrics on prediction quality and control performance to demonstrate the contribution of the proposed Mahalanobis-based, input-dependent reweighting. revision: yes
-
Referee: [Ensemble combination rule] Ensemble weighting rule (definition of weights via Mahalanobis distance): the paper states that weights vary suitably across the prediction window based on the input, yet no quantitative evidence (e.g., plots of weight trajectories or sensitivity analysis) is given to show that the variation is both non-trivial and beneficial relative to a constant-weight baseline. If the weights remain nearly constant over typical horizons, the novelty of the combination rule is not substantiated.
Authors: We acknowledge that explicit quantitative support for the weight variation is needed. In the revision we will include plots of the ensemble weight trajectories over the prediction horizon for representative inputs, together with a sensitivity analysis that compares performance under the proposed varying weights against a constant-weight baseline. These additions will provide direct evidence that the weights vary non-trivially and that the variation improves prediction and control. revision: yes
Circularity Check
No significant circularity; derivation remains self-contained
full rationale
The paper proposes a Mahalanobis-distance-based weighting rule for ensemble models and an MHE observer for state estimation, then demonstrates performance on a benchmark energy system. No equations, combination rules, or performance claims in the abstract or described methodology reduce by construction to fitted parameters or self-citations; the weighting is defined externally via statistical distance on inputs, and the benchmark serves as an independent test rather than a tautological fit. The central claims therefore retain independent content and do not collapse to the inputs.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
A novel combination rule for ensemble models is proposed, based on the statistical Mahalanobis distance, enabling the ensemble weights to suitably vary across the prediction window based on the system input.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Model predictive control: Theory and practice—a survey,
C. E. Garcia, D. M. Prett, and M. Morari, “Model predictive control: Theory and practice—a survey,”Automatica, vol. 25, no. 3, pp. 335– 348, 1989
work page 1989
-
[2]
Modeling and predictive control of networked systems via physics-informed neural networks,
L. Boca de Giuli, A. La Bella, M. Farina, and R. Scattolini, “Modeling and predictive control of networked systems via physics-informed neural networks,” in2024 IEEE 63rd Conference on Decision and Control (CDC), 2024, pp. 3005–3010
work page 2024
-
[3]
Nonlinear MPC design for incrementally ISS systems with application to GRU networks,
F. Bonassi, A. La Bella, M. Farina, and R. Scattolini, “Nonlinear MPC design for incrementally ISS systems with application to GRU networks,”Automatica, vol. 159, p. 111381, 2024
work page 2024
-
[4]
Learning-based MPC for fuel efficient control of autonomous vehicles with discrete gear selection,
S. Mallick, G. Battocletti, Q. Dong, A. Dabiri, and B. De Schutter, “Learning-based MPC for fuel efficient control of autonomous vehicles with discrete gear selection,”IEEE Control Systems Letters, 2025
work page 2025
-
[5]
A survey of uncertainty in deep neural networks,
J. Gawlikowski, C. R. N. Tassi, M. Ali, J. Lee, M. Humt, J. Feng, A. Kruspe, R. Triebel, P. Jung, R. Roscheret al., “A survey of uncertainty in deep neural networks,”Artificial Intelligence Review, vol. 56, no. Suppl 1, pp. 1513–1589, 2023
work page 2023
-
[6]
L. Boca de Giuli, A. La Bella, G. De Nicolao, and R. Scattolini, “Lifelong learning for monitoring and adaptation of data-based dynam- ical models: a statistical process control approach,” in2024 European Control Conference (ECC), 2024, pp. 947–952
work page 2024
-
[7]
C. Zhang and Y . Ma,Ensemble Machine Learning: Methods and Applications. Springer Science & Business Media, 2012
work page 2012
-
[8]
Three types of incremental learning,
G. M. Van de Ven, T. Tuytelaars, and A. S. Tolias, “Three types of incremental learning,”Nature Machine Intelligence, vol. 4, no. 12, pp. 1185–1197, 2022
work page 2022
-
[9]
J. Leoni, V . Breschi, S. Formentin, and M. Tanelli, “Explainable data- driven modeling via mixture of experts: Towards effective blending of gray and black-box models,”Automatica, vol. 173, p. 112066, 2025
work page 2025
-
[10]
Theory on mixture- of-experts in continual learning,
H. Li, S. Lin, L. Duan, Y . Liang, and N. B. Shroff, “Theory on mixture- of-experts in continual learning,”arXiv preprint arXiv:2406.16437, 2024
-
[11]
Machine learning- based predictive control of nonlinear processes. part i: theory,
Z. Wu, A. Tran, D. Rincon, and P. D. Christofides, “Machine learning- based predictive control of nonlinear processes. part i: theory,”AIChE Journal, vol. 65, no. 11, p. e16729, 2019
work page 2019
-
[12]
Coscl: Cooperation of small continual learners is stronger than a big one,
L. Wang, X. Zhang, Q. Li, J. Zhu, and Y . Zhong, “Coscl: Cooperation of small continual learners is stronger than a big one,” inEuropean Conference on Computer Vision, 2022, pp. 254–271
work page 2022
-
[13]
Z. Wu and P. D. Christofides, “Optimizing process economics and operational safety via economic MPC using barrier functions and recurrent neural network models,”Chemical Engineering Research and Design, vol. 152, pp. 455–465, 2019
work page 2019
-
[14]
Hierarchical mixtures of experts and the EM algorithm,
M. I. Jordan and R. A. Jacobs, “Hierarchical mixtures of experts and the EM algorithm,”Neural computation, vol. 6, no. 2, pp. 181–214, 1994
work page 1994
-
[15]
A. Yeganeh, S. A. Abbasi, F. Pourpanah, A. Shadman, A. Johannssen, and N. Chukhrova, “An ensemble neural network framework for im- proving the detection ability of a base control chart in non-parametric profile monitoring,”Expert Systems with Applications, vol. 204, p. 117572, 2022
work page 2022
-
[16]
Continual learning us- ing bayesian neural networks,
H. Li, P. Barnaghi, S. Enshaeifar, and F. Ganz, “Continual learning us- ing bayesian neural networks,”IEEE Transactions on Neural Networks and Learning systems, vol. 32, no. 9, pp. 4243–4252, 2020
work page 2020
-
[17]
Neural Network-Based KKL observer for nonlinear discrete-time systems,
J. Peralez, M. Nadri, and D. Astolfi, “Neural Network-Based KKL observer for nonlinear discrete-time systems,” in2022 IEEE 61st Conference on Decision and Control (CDC), 2022, pp. 2105–2110
work page 2022
-
[18]
Hybrid multi-observer for improving estimation performance,
E. Petri, R. Postoyan, D. Astolfi, D. Ne ˇsi´c, and V . Andrieu, “Hybrid multi-observer for improving estimation performance,”IEEE Transac- tions on Automatic Control, 2024
work page 2024
-
[19]
A static reduced-order multiple-model adaptive estimator for noise identification,
J. Khalife and Z. M. Kassas, “A static reduced-order multiple-model adaptive estimator for noise identification,”IEEE Transactions on Aerospace and Electronic Systems, vol. 59, no. 3, pp. 2672–2686, 2023
work page 2023
-
[20]
F. Bonassi, “Reconciling deep learning and control theory: recurrent neural networks for model-based control design,” Ph.D. dissertation, Politecnico di Milano, 2022
work page 2022
-
[21]
D. C. Montgomery,Statistical Quality Control. Wiley New York, 2009, vol. 7
work page 2009
-
[22]
D. A. Allan and J. B. Rawlings, “Moving horizon estimation,” in Handbook of Model Predictive Control. Springer, 2018, pp. 99–124
work page 2018
-
[23]
F. Bonassi, M. Farina, J. Xie, and R. Scattolini, “On recurrent neural networks for learning-based control: recent results and ideas for future developments,”Journal of Process Control, vol. 114, pp. 92–104, 2022
work page 2022
-
[24]
Nonlinear optimization of district heating networks,
R. Krug, V . Mehrmann, and M. Schmidt, “Nonlinear optimization of district heating networks,”Optimization and Engineering, vol. 22, no. 2, pp. 783–819, 2021
work page 2021
-
[25]
On the certainty equivalence principle and the optimal control of partially observed dynamic games,
M. R. James, “On the certainty equivalence principle and the optimal control of partially observed dynamic games,”IEEE Transactions on Automatic Control, vol. 39, no. 11, pp. 2321–2324, 1994
work page 1994
-
[26]
Learning, fast and slow: a two-fold algorithm for data-based model adaptation,
L. Boca de Giuli, A. La Bella, and R. Scattolini, “Learning, fast and slow: a two-fold algorithm for data-based model adaptation,”arXiv preprint arXiv:2507.12187, 2025
-
[27]
A. La Bella and A. Del Corno, “Optimal management and data-based predictive control of district heating systems: The Novate Milanese experimental case-study,”Control Engineering Practice, vol. 132, p. 105429, 2023
work page 2023
-
[28]
W. Li, J. Cai, C. Duan, S. Chen, P. Ding, J. Lin, and D. Cui, “Learning and ensemble based MPC with differential dynamic programming for nuclear power autonomous control,”Expert Systems with Applications, vol. 215, p. 119416, 2023
work page 2023
-
[29]
Control-oriented modeling, simulation, and predictive control of district heating net- works,
L. Nigro, A. La Bella, F. Casella, and R. Scattolini, “Control-oriented modeling, simulation, and predictive control of district heating net- works,”IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 7064–7079, 2025
work page 2025
-
[30]
M. A. M. Alvarado, C. Anderis, R. Lazzari, L. Nigro, and A. La Bella, “Development and experimental validation of an open-source model library for district heating network simulation,” in2024 Open Source Modelling and Simulation of Energy Systems (OSMSES). IEEE, 2024, pp. 1–6
work page 2024
-
[31]
Physics-informed neural network modeling and predictive control of district heating systems,
L. Boca de Giuli, A. La Bella, and R. Scattolini, “Physics-informed neural network modeling and predictive control of district heating systems,”IEEE Transactions on Control Systems Technology, vol. 32, no. 4, pp. 1182–1195, 2024
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.