pith. sign in

arxiv: 2511.21343 · v2 · submitted 2025-11-26 · 📡 eess.SY · cs.SY

Model Predictive Control and Moving Horizon Estimation using Statistically Weighted Data-Based Ensemble Models

Pith reviewed 2026-05-17 04:52 UTC · model grok-4.3

classification 📡 eess.SY cs.SY
keywords model predictive controlensemble modelsMahalanobis distancemoving horizon estimationdata-based modelsstate observerenergy systems
0
0 comments X

The pith

Ensemble models for predictive control adjust their weights dynamically using Mahalanobis distance based on system input.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a model predictive control framework built on an ensemble of data-based models instead of one fixed model. It introduces a weighting method that draws on the statistical Mahalanobis distance so each model's contribution can change across the prediction steps in response to the current input. The work also presents a moving horizon estimation procedure that acts as a state observer for the full ensemble. These elements are shown to work on a benchmark energy system that runs under several different conditions. A sympathetic reader would see this as a way to keep control reliable when the plant moves between regimes where different models are more trustworthy.

Core claim

The authors propose a combination rule for ensemble models based on the statistical Mahalanobis distance that lets the ensemble weights vary across the prediction window according to the system input, together with a moving horizon estimation scheme that supplies a state observer for such ensembles, and they demonstrate the approach on a benchmark energy system operating under multiple conditions.

What carries the argument

The Mahalanobis-distance-based combination rule for ensemble models, which produces input-dependent weights that can change at each step of the prediction horizon.

If this is right

  • The controller can maintain performance when the plant shifts between operating conditions without explicit mode detection or switching logic.
  • Prediction quality over the horizon improves because weights reflect statistical differences in model suitability for the current input.
  • State estimates are obtained consistently for the entire ensemble rather than requiring separate observers for each model.
  • The method applies directly to energy networks and similar plants where historical data spans multiple regimes.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same weighting idea could be tested with other statistical distances or uncertainty measures to see whether further gains appear in prediction or control.
  • Online adaptation of ensemble weights may reduce the engineering effort spent on offline model selection or regime classification.
  • The framework might transfer to process control or robotics domains where simulation data and real measurements need to be blended without manual tuning.

Load-bearing premise

The individual data-based models are accurate enough across the operating conditions of interest and the Mahalanobis distance produces weights that meaningfully improve prediction quality rather than simply reflecting training-data density.

What would settle it

Apply the proposed controller and the same ensemble to the benchmark energy system but replace the Mahalanobis weighting with uniform weights or a single best model and measure whether closed-loop tracking error or prediction mismatch fails to decrease.

Figures

Figures reproduced from arXiv: 2511.21343 by Alessio La Bella, Azita Dabiri, Bart De Schutter, Laura Boca de Giuli, Riccardo Scattolini, Samuel Mallick.

Figure 1
Figure 1. Figure 1: Schematic representation of the AROMA DHS and its variables. where cp is the specific water heat coefficient. Furthermore, the constraint sets are defined as X = {x ∈ R nx | − 1nx ≤ x ≤ 1nx }, U =  u˜ ∈ R nu˜ |65 ≤ T s 0 ≤ 85, −15 ≤ qTES ≤ 15 , Y(h) =  y ∈ R ny |2 ≤ q r 0 ≤ 25, Tr 0 (h) ≤ T r 0 ≤ 75, T s j (h) ≤ T s j ≤ 85, i ∈ N , (11) where T r 0 and T s j are time-varying lower temperature bounds, e.… view at source ↗
Figure 2
Figure 2. Figure 2: a) Five thermal demand profiles. b) Electricity price trend. c) Temperature lower bound profiles: dashed for T r 0 , solid line for T c j . satisfies all constraints is used, and the storage flow is chosen as qTES(h) =    −7.5 c(h) < 0.125 7.5 c(h) > 0.175 0 otherwise , (16) such that charging and discharging of thermal energy is conducted when power production is cheap and expensive, respectively. Al… view at source ↗
Figure 3
Figure 3. Figure 3: First day of simulation showing the five load supply temperatures T s j along with the corresponding constraints (black dashed lines) under the different control strategies. RB AV LS MD-1 MD-2 J [e] 5965 5709 5547 5550 5495 V 0 46.8 31.6 0 0 t [s] - 13.7±7.69 19.4±7.18 27.1±12.2 19.6±9.22 TABLE I: Performance of each control strategy: economic cost J, constraint violation V , and MPC average computation ti… view at source ↗
Figure 4
Figure 4. Figure 4: Output error e [i] for model in open-loop (solid blue) and with MHE-based state estimation (dashed red). depend on system states and outputs, which are unavailable in prediction. In this work, we propose a novel combi￾nation rule for ensemble models based on the statistical Mahalanobis distance, allowing the ensemble weights to evolve over the prediction window solely as functions of the inputs. In additio… view at source ↗
read the original abstract

This paper presents a model predictive control (MPC) framework leveraging an ensemble of data-based models to optimally control complex systems under multiple operating conditions. A novel combination rule for ensemble models is proposed, based on the statistical Mahalanobis distance, enabling the ensemble weights to suitably vary across the prediction window based on the system input. In addition, a novel state observer for ensemble models is developed using moving horizon estimation (MHE). The effectiveness of the proposed methodology is demonstrated on a benchmark energy system operating under multiple conditions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes a model predictive control (MPC) framework that employs an ensemble of data-based models to control complex systems under multiple operating conditions. It introduces a novel ensemble combination rule based on the statistical Mahalanobis distance, which allows the weights to vary across the prediction window according to the system input. A novel state observer for ensemble models is developed using moving horizon estimation (MHE). The approach is demonstrated on a benchmark energy system operating under multiple conditions.

Significance. If the central claims hold, the work could provide a statistically motivated approach to adaptive weighting in data-driven ensemble MPC and a tailored MHE observer, which may be useful for systems with regime-dependent behavior. The benchmark demonstration on the energy system is a positive element for reproducibility, but the overall significance hinges on whether the Mahalanobis weighting and per-step reweighting deliver measurable gains beyond simpler ensemble rules.

major comments (2)
  1. [Benchmark demonstration / Numerical results] Benchmark demonstration section: the reported closed-loop performance on the energy system is shown for the proposed method, but no ablation or comparison is provided against uniform weights, fixed weights, or Euclidean-distance weighting. Without these controls it is impossible to determine whether the Mahalanobis-based, input-dependent reweighting across the horizon is load-bearing for the claimed improvement in prediction quality or control performance.
  2. [Ensemble combination rule] Ensemble weighting rule (definition of weights via Mahalanobis distance): the paper states that weights vary suitably across the prediction window based on the input, yet no quantitative evidence (e.g., plots of weight trajectories or sensitivity analysis) is given to show that the variation is both non-trivial and beneficial relative to a constant-weight baseline. If the weights remain nearly constant over typical horizons, the novelty of the combination rule is not substantiated.
minor comments (2)
  1. [Preliminaries / Model formulation] Notation for the combined prediction and the individual model outputs could be made more explicit to avoid ambiguity when the ensemble is used inside the MPC optimization.
  2. [MHE observer] The MHE observer derivation would benefit from a short remark on how the ensemble structure is incorporated into the arrival-cost or weighting matrices.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments on our manuscript. We address each major comment point by point below and will revise the manuscript accordingly to strengthen the evidence for our claims.

read point-by-point responses
  1. Referee: [Benchmark demonstration / Numerical results] Benchmark demonstration section: the reported closed-loop performance on the energy system is shown for the proposed method, but no ablation or comparison is provided against uniform weights, fixed weights, or Euclidean-distance weighting. Without these controls it is impossible to determine whether the Mahalanobis-based, input-dependent reweighting across the horizon is load-bearing for the claimed improvement in prediction quality or control performance.

    Authors: We agree that the current benchmark results would be strengthened by direct comparisons. In the revised manuscript we will add closed-loop performance comparisons on the energy system benchmark against uniform weights, fixed weights, and Euclidean-distance weighting, including quantitative metrics on prediction quality and control performance to demonstrate the contribution of the proposed Mahalanobis-based, input-dependent reweighting. revision: yes

  2. Referee: [Ensemble combination rule] Ensemble weighting rule (definition of weights via Mahalanobis distance): the paper states that weights vary suitably across the prediction window based on the input, yet no quantitative evidence (e.g., plots of weight trajectories or sensitivity analysis) is given to show that the variation is both non-trivial and beneficial relative to a constant-weight baseline. If the weights remain nearly constant over typical horizons, the novelty of the combination rule is not substantiated.

    Authors: We acknowledge that explicit quantitative support for the weight variation is needed. In the revision we will include plots of the ensemble weight trajectories over the prediction horizon for representative inputs, together with a sensitivity analysis that compares performance under the proposed varying weights against a constant-weight baseline. These additions will provide direct evidence that the weights vary non-trivially and that the variation improves prediction and control. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation remains self-contained

full rationale

The paper proposes a Mahalanobis-distance-based weighting rule for ensemble models and an MHE observer for state estimation, then demonstrates performance on a benchmark energy system. No equations, combination rules, or performance claims in the abstract or described methodology reduce by construction to fitted parameters or self-citations; the weighting is defined externally via statistical distance on inputs, and the benchmark serves as an independent test rather than a tautological fit. The central claims therefore retain independent content and do not collapse to the inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only view supplies no explicit free parameters, axioms, or invented entities; all modeling assumptions remain implicit in the data-based ensemble construction.

pith-pipeline@v0.9.0 · 5397 in / 1077 out tokens · 22317 ms · 2026-05-17T04:52:48.628730+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

31 extracted references · 31 canonical work pages

  1. [1]

    Model predictive control: Theory and practice—a survey,

    C. E. Garcia, D. M. Prett, and M. Morari, “Model predictive control: Theory and practice—a survey,”Automatica, vol. 25, no. 3, pp. 335– 348, 1989

  2. [2]

    Modeling and predictive control of networked systems via physics-informed neural networks,

    L. Boca de Giuli, A. La Bella, M. Farina, and R. Scattolini, “Modeling and predictive control of networked systems via physics-informed neural networks,” in2024 IEEE 63rd Conference on Decision and Control (CDC), 2024, pp. 3005–3010

  3. [3]

    Nonlinear MPC design for incrementally ISS systems with application to GRU networks,

    F. Bonassi, A. La Bella, M. Farina, and R. Scattolini, “Nonlinear MPC design for incrementally ISS systems with application to GRU networks,”Automatica, vol. 159, p. 111381, 2024

  4. [4]

    Learning-based MPC for fuel efficient control of autonomous vehicles with discrete gear selection,

    S. Mallick, G. Battocletti, Q. Dong, A. Dabiri, and B. De Schutter, “Learning-based MPC for fuel efficient control of autonomous vehicles with discrete gear selection,”IEEE Control Systems Letters, 2025

  5. [5]

    A survey of uncertainty in deep neural networks,

    J. Gawlikowski, C. R. N. Tassi, M. Ali, J. Lee, M. Humt, J. Feng, A. Kruspe, R. Triebel, P. Jung, R. Roscheret al., “A survey of uncertainty in deep neural networks,”Artificial Intelligence Review, vol. 56, no. Suppl 1, pp. 1513–1589, 2023

  6. [6]

    Lifelong learning for monitoring and adaptation of data-based dynam- ical models: a statistical process control approach,

    L. Boca de Giuli, A. La Bella, G. De Nicolao, and R. Scattolini, “Lifelong learning for monitoring and adaptation of data-based dynam- ical models: a statistical process control approach,” in2024 European Control Conference (ECC), 2024, pp. 947–952

  7. [7]

    Zhang and Y

    C. Zhang and Y . Ma,Ensemble Machine Learning: Methods and Applications. Springer Science & Business Media, 2012

  8. [8]

    Three types of incremental learning,

    G. M. Van de Ven, T. Tuytelaars, and A. S. Tolias, “Three types of incremental learning,”Nature Machine Intelligence, vol. 4, no. 12, pp. 1185–1197, 2022

  9. [9]

    Explainable data- driven modeling via mixture of experts: Towards effective blending of gray and black-box models,

    J. Leoni, V . Breschi, S. Formentin, and M. Tanelli, “Explainable data- driven modeling via mixture of experts: Towards effective blending of gray and black-box models,”Automatica, vol. 173, p. 112066, 2025

  10. [10]

    Theory on mixture- of-experts in continual learning,

    H. Li, S. Lin, L. Duan, Y . Liang, and N. B. Shroff, “Theory on mixture- of-experts in continual learning,”arXiv preprint arXiv:2406.16437, 2024

  11. [11]

    Machine learning- based predictive control of nonlinear processes. part i: theory,

    Z. Wu, A. Tran, D. Rincon, and P. D. Christofides, “Machine learning- based predictive control of nonlinear processes. part i: theory,”AIChE Journal, vol. 65, no. 11, p. e16729, 2019

  12. [12]

    Coscl: Cooperation of small continual learners is stronger than a big one,

    L. Wang, X. Zhang, Q. Li, J. Zhu, and Y . Zhong, “Coscl: Cooperation of small continual learners is stronger than a big one,” inEuropean Conference on Computer Vision, 2022, pp. 254–271

  13. [13]

    Optimizing process economics and operational safety via economic MPC using barrier functions and recurrent neural network models,

    Z. Wu and P. D. Christofides, “Optimizing process economics and operational safety via economic MPC using barrier functions and recurrent neural network models,”Chemical Engineering Research and Design, vol. 152, pp. 455–465, 2019

  14. [14]

    Hierarchical mixtures of experts and the EM algorithm,

    M. I. Jordan and R. A. Jacobs, “Hierarchical mixtures of experts and the EM algorithm,”Neural computation, vol. 6, no. 2, pp. 181–214, 1994

  15. [15]

    An ensemble neural network framework for im- proving the detection ability of a base control chart in non-parametric profile monitoring,

    A. Yeganeh, S. A. Abbasi, F. Pourpanah, A. Shadman, A. Johannssen, and N. Chukhrova, “An ensemble neural network framework for im- proving the detection ability of a base control chart in non-parametric profile monitoring,”Expert Systems with Applications, vol. 204, p. 117572, 2022

  16. [16]

    Continual learning us- ing bayesian neural networks,

    H. Li, P. Barnaghi, S. Enshaeifar, and F. Ganz, “Continual learning us- ing bayesian neural networks,”IEEE Transactions on Neural Networks and Learning systems, vol. 32, no. 9, pp. 4243–4252, 2020

  17. [17]

    Neural Network-Based KKL observer for nonlinear discrete-time systems,

    J. Peralez, M. Nadri, and D. Astolfi, “Neural Network-Based KKL observer for nonlinear discrete-time systems,” in2022 IEEE 61st Conference on Decision and Control (CDC), 2022, pp. 2105–2110

  18. [18]

    Hybrid multi-observer for improving estimation performance,

    E. Petri, R. Postoyan, D. Astolfi, D. Ne ˇsi´c, and V . Andrieu, “Hybrid multi-observer for improving estimation performance,”IEEE Transac- tions on Automatic Control, 2024

  19. [19]

    A static reduced-order multiple-model adaptive estimator for noise identification,

    J. Khalife and Z. M. Kassas, “A static reduced-order multiple-model adaptive estimator for noise identification,”IEEE Transactions on Aerospace and Electronic Systems, vol. 59, no. 3, pp. 2672–2686, 2023

  20. [20]

    Reconciling deep learning and control theory: recurrent neural networks for model-based control design,

    F. Bonassi, “Reconciling deep learning and control theory: recurrent neural networks for model-based control design,” Ph.D. dissertation, Politecnico di Milano, 2022

  21. [21]

    D. C. Montgomery,Statistical Quality Control. Wiley New York, 2009, vol. 7

  22. [22]

    Moving horizon estimation,

    D. A. Allan and J. B. Rawlings, “Moving horizon estimation,” in Handbook of Model Predictive Control. Springer, 2018, pp. 99–124

  23. [23]

    On recurrent neural networks for learning-based control: recent results and ideas for future developments,

    F. Bonassi, M. Farina, J. Xie, and R. Scattolini, “On recurrent neural networks for learning-based control: recent results and ideas for future developments,”Journal of Process Control, vol. 114, pp. 92–104, 2022

  24. [24]

    Nonlinear optimization of district heating networks,

    R. Krug, V . Mehrmann, and M. Schmidt, “Nonlinear optimization of district heating networks,”Optimization and Engineering, vol. 22, no. 2, pp. 783–819, 2021

  25. [25]

    On the certainty equivalence principle and the optimal control of partially observed dynamic games,

    M. R. James, “On the certainty equivalence principle and the optimal control of partially observed dynamic games,”IEEE Transactions on Automatic Control, vol. 39, no. 11, pp. 2321–2324, 1994

  26. [26]

    Learning, fast and slow: a two-fold algorithm for data-based model adaptation,

    L. Boca de Giuli, A. La Bella, and R. Scattolini, “Learning, fast and slow: a two-fold algorithm for data-based model adaptation,”arXiv preprint arXiv:2507.12187, 2025

  27. [27]

    Optimal management and data-based predictive control of district heating systems: The Novate Milanese experimental case-study,

    A. La Bella and A. Del Corno, “Optimal management and data-based predictive control of district heating systems: The Novate Milanese experimental case-study,”Control Engineering Practice, vol. 132, p. 105429, 2023

  28. [28]

    Learning and ensemble based MPC with differential dynamic programming for nuclear power autonomous control,

    W. Li, J. Cai, C. Duan, S. Chen, P. Ding, J. Lin, and D. Cui, “Learning and ensemble based MPC with differential dynamic programming for nuclear power autonomous control,”Expert Systems with Applications, vol. 215, p. 119416, 2023

  29. [29]

    Control-oriented modeling, simulation, and predictive control of district heating net- works,

    L. Nigro, A. La Bella, F. Casella, and R. Scattolini, “Control-oriented modeling, simulation, and predictive control of district heating net- works,”IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 7064–7079, 2025

  30. [30]

    Development and experimental validation of an open-source model library for district heating network simulation,

    M. A. M. Alvarado, C. Anderis, R. Lazzari, L. Nigro, and A. La Bella, “Development and experimental validation of an open-source model library for district heating network simulation,” in2024 Open Source Modelling and Simulation of Energy Systems (OSMSES). IEEE, 2024, pp. 1–6

  31. [31]

    Physics-informed neural network modeling and predictive control of district heating systems,

    L. Boca de Giuli, A. La Bella, and R. Scattolini, “Physics-informed neural network modeling and predictive control of district heating systems,”IEEE Transactions on Control Systems Technology, vol. 32, no. 4, pp. 1182–1195, 2024