A Hybrid Intelligent Framework for Uncertainty-Aware Condition Monitoring of Industrial Systems

Homayoun Najjaran; Maryam Ahang; Masoud Jalayer; Todd Charter

arxiv: 2604.09932 · v1 · submitted 2026-04-10 · 💻 cs.LG · cs.AI· eess.SP

A Hybrid Intelligent Framework for Uncertainty-Aware Condition Monitoring of Industrial Systems

Maryam Ahang , Todd Charter , Masoud Jalayer , Homayoun Najjaran This is my paper

Pith reviewed 2026-05-10 16:51 UTC · model grok-4.3

classification 💻 cs.LG cs.AIeess.SP

keywords hybrid learningcondition monitoringphysics-informed residualsconformal predictionfault diagnosisuncertainty quantificationindustrial systemsensemble methods

0 comments

The pith

Integrating sensor data with temporal features and physics-informed residuals via feature fusion or model ensembles improves diagnostic accuracy and uncertainty calibration on industrial benchmarks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a hybrid condition monitoring framework that augments primary sensor measurements with lagged temporal features and residuals computed from nominal surrogate models. It compares two integration approaches: feature-level fusion that expands the input space, and model-level ensembles that combine separate classifiers at the decision stage. On a continuous stirred-tank reactor benchmark, both hybrids raise fault detection accuracy over single-source baselines, with the strongest model-level ensemble gaining 2.9 percent. When conformal prediction is added to produce uncertainty sets, the hybrid versions deliver smaller sets while maintaining the target coverage rate. The work shows that lightweight physics residuals can be combined with data-driven methods to support more reliable alerts in nonlinear industrial systems.

Core claim

The central claim is that both feature-level fusion of sensor readings, temporal lags, and physics-informed residuals and model-level ensembles of classifiers trained on different feature subsets improve diagnostic accuracy relative to single-source baselines on the CSTR benchmark, with the best ensemble reaching a 2.9% gain, and that the same hybrids yield smaller, well-calibrated prediction sets under conformal prediction at matched coverage levels.

What carries the argument

Hybrid integration strategies that combine data-driven features with physics-informed residuals either by direct input augmentation or by decision-level ensembling of separate models.

If this is right

Both feature-level and model-level hybrids raise accuracy over single-source baselines.
Model-level ensembles deliver the largest accuracy improvement of 2.9% over the best baseline ensemble.
Hybrid models produce smaller prediction sets under conformal prediction while preserving coverage.
Uncertainty quantification becomes better calibrated without added computational overhead from heavy physics simulation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same lightweight residual approach could support online monitoring in other continuous processes if surrogate models are available.
Combining the hybrids with adaptive retraining might handle gradual system drift.
The framework offers a practical middle path between pure data-driven and full physics-based monitoring when complete first-principles models are unavailable.

Load-bearing premise

Physics-informed residuals from nominal surrogate models supply information that is complementary rather than redundant with raw sensor and temporal features, and that gains seen on the single CSTR benchmark will generalize to other nonlinear industrial systems.

What would settle it

No measurable accuracy gain and no reduction in conformal prediction set size when residuals are added to a different nonlinear industrial process would falsify the central claim.

Figures

Figures reproduced from arXiv: 2604.09932 by Homayoun Najjaran, Maryam Ahang, Masoud Jalayer, Todd Charter.

**Figure 2.** Figure 2: Accuracy comparison of different ensemble models [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: Confidence and prediction set size comparison using stacking (LogR) [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

read the original abstract

Hybrid approaches that combine data-driven learning with physics-based insight have shown promise for improving the reliability of industrial condition monitoring. This work develops a hybrid condition monitoring framework that integrates primary sensor measurements, lagged temporal features, and physics-informed residuals derived from nominal surrogate models. Two hybrid integration strategies are examined. The first is a feature-level fusion approach that augments the input space with residual and temporal information. The second is a model-level ensemble approach in which machine learning classifiers trained on different feature types are combined at the decision level. Both hybrid approaches of the condition monitoring framework are evaluated on a continuous stirred-tank reactor (CSTR) benchmark using several machine learning models and ensemble configurations. Both feature-level and model-level hybridization improve diagnostic accuracy relative to single-source baselines, with the best model-level ensemble achieving a 2.9\% improvement over the best baseline ensemble. To assess predictive reliability, conformal prediction is applied to quantify coverage, prediction-set size, and abstention behavior. The results show that hybrid integration enhances uncertainty management, producing smaller and well-calibrated prediction sets at matched coverage levels. These findings demonstrate that lightweight physics-informed residuals, temporal augmentation, and ensemble learning can be combined effectively to improve both accuracy and decision reliability in nonlinear industrial systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Modest accuracy lift and tighter conformal sets from hybrid fusion on CSTR, but everything rests on one benchmark with no generalization checks.

read the letter

The main point is that this paper combines raw sensors, lagged features, and physics residuals from a nominal surrogate in two ways—feature augmentation and decision-level ensembles—and reports a 2.9% accuracy gain plus smaller, well-calibrated prediction sets on the CSTR simulation. The conformal prediction layer is applied cleanly to show coverage and abstention behavior at matched levels. That combination is executed in a straightforward way that gives a usable template for adding lightweight physics signals without much extra cost. The empirical comparisons against single-source baselines are presented directly, which is the part that actually works. The soft spot is the narrow scope. All quantitative claims come from a single continuous stirred-tank reactor benchmark. There are no additional processes, no cross-system validation, and no sensitivity runs on noise or fault profiles. If the residuals largely overlap with the temporal features already in the data, the reported benefit could shrink or disappear outside this specific dynamics. The abstract also leaves out statistical tests and exact data-split details, so the robustness of the 2.9% number is hard to judge from the given evidence. This paper is for applied researchers or engineers who need a ready recipe for industrial condition monitoring and are willing to adapt the hybrid steps to their own systems. It is not for readers looking for new theory or broad claims. I would send it to peer review. The methods are standard but the application is carried through consistently enough that referees can focus on tightening the generalization question and verifying the experimental controls.

Referee Report

2 major / 2 minor

Summary. The paper proposes a hybrid condition monitoring framework for industrial systems that fuses primary sensor measurements with lagged temporal features and physics-informed residuals derived from nominal surrogate models. It compares two integration strategies—feature-level augmentation of the input space and model-level ensembles of classifiers trained on different feature types—on a continuous stirred-tank reactor (CSTR) benchmark. The central empirical claims are that both strategies improve diagnostic accuracy (with the best model-level ensemble showing a 2.9% gain over the best baseline ensemble) and that hybrid integration yields smaller, well-calibrated conformal prediction sets at matched coverage levels.

Significance. If the hybridization benefits hold, the work provides evidence that lightweight physics-informed residuals can supply complementary information to raw and temporal features, improving both accuracy and uncertainty quantification in nonlinear systems. The explicit use of conformal prediction to report coverage, prediction-set size, and abstention behavior is a methodological strength that supports reliable decision-making claims.

major comments (2)

[§4] §4 (CSTR benchmark evaluation): All quantitative results, including the 2.9% accuracy improvement and the conformal prediction set-size reductions, derive from a single simulated CSTR system. No additional nonlinear processes, cross-system validation, or sensitivity analysis to fault types/noise profiles are reported, which directly limits support for the abstract's claim of effectiveness 'in nonlinear industrial systems.'
[§4.2] §4.2 (ensemble and baseline comparisons): The reported accuracy gains and uncertainty benefits lack accompanying statistical significance tests, variance across multiple random seeds or data splits, or explicit baseline definitions, making it impossible to determine whether the 2.9% margin is robust or sensitive to post-hoc configuration choices.

minor comments (2)

[Abstract] The abstract and introduction use 'CSTR' without an initial expansion; add '(continuous stirred-tank reactor)' on first use for clarity.
[Figures/Tables] Figure captions and table headers should explicitly state the coverage level (e.g., 95%) used for the conformal prediction set-size comparisons to allow direct interpretation.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the constructive feedback. We address the major comments point by point below, indicating where revisions will be made to strengthen the manuscript.

read point-by-point responses

Referee: [§4] §4 (CSTR benchmark evaluation): All quantitative results, including the 2.9% accuracy improvement and the conformal prediction set-size reductions, derive from a single simulated CSTR system. No additional nonlinear processes, cross-system validation, or sensitivity analysis to fault types/noise profiles are reported, which directly limits support for the abstract's claim of effectiveness 'in nonlinear industrial systems.'

Authors: We agree that the evaluation is limited to a single CSTR benchmark. CSTR is a standard nonlinear process benchmark, but we acknowledge this restricts broad claims about 'nonlinear industrial systems.' In revision we will add sensitivity analysis to noise profiles and fault severities within the existing CSTR model, expand the discussion of limitations, and qualify the generalizability statements. We maintain that the hybrid fusion approach is not CSTR-specific, yet we accept that additional benchmarks would provide stronger support. revision: partial
Referee: [§4.2] §4.2 (ensemble and baseline comparisons): The reported accuracy gains and uncertainty benefits lack accompanying statistical significance tests, variance across multiple random seeds or data splits, or explicit baseline definitions, making it impossible to determine whether the 2.9% margin is robust or sensitive to post-hoc configuration choices.

Authors: We accept this criticism. The current results are from single runs without reported variance or significance testing. In the revised manuscript we will rerun all experiments across multiple random seeds and data splits, report means and standard deviations, add statistical significance tests (e.g., paired t-tests or Wilcoxon tests) between hybrid and baseline models, and provide explicit, reproducible definitions of all baselines and ensemble configurations in §4.2. revision: yes

standing simulated objections not resolved

Cross-system validation on additional real nonlinear industrial processes beyond the simulated CSTR, due to lack of access to further public or proprietary datasets.

Circularity Check

0 steps flagged

No circularity: empirical comparisons on CSTR benchmark with no derivations reducing to inputs

full rationale

The paper presents an empirical framework for hybrid condition monitoring that augments sensor data with temporal features and physics-informed residuals, then compares feature-level fusion and model-level ensembles against single-source baselines on a single CSTR simulation. All reported gains (e.g., 2.9% accuracy improvement, smaller calibrated prediction sets) are direct experimental outcomes from the same dataset and models; no equations, first-principles derivations, fitted parameters renamed as predictions, or self-citation chains are invoked to support the central claims. The results are therefore self-contained as benchmark comparisons rather than tautological by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Based on the abstract alone, no explicit free parameters, axioms, or invented entities are stated; the framework relies on standard machine-learning classifiers and conformal prediction methods drawn from prior work.

pith-pipeline@v0.9.0 · 5530 in / 1179 out tokens · 45699 ms · 2026-05-10T16:51:36.626659+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

16 extracted references · 16 canonical work pages

[1]

Condition monitoring using machine learning: A review of theory, applications, and recent advances,

O. Surucu, S. A. Gadsden, and J. Yawney, “Condition monitoring using machine learning: A review of theory, applications, and recent advances,” Expert Systems with Applications, vol. 221, p. 119738, 2023

work page 2023
[2]

Model-based fault-detection and diagnosis–status and applications,

R. Isermann, “Model-based fault-detection and diagnosis–status and applications,”Annual Reviews in control, vol. 29, no. 1, pp. 71–85, 2005

work page 2005
[3]

Overview on hybrid approaches to fault detection and diagnosis: Combining data- driven, physics-based and knowledge-based models,

Y . Wilhelm, P. Reimann, W. Gauchel, and B. Mitschang, “Overview on hybrid approaches to fault detection and diagnosis: Combining data- driven, physics-based and knowledge-based models,”Procedia Cirp, vol. 99, pp. 278–283, 2021

work page 2021
[4]

Intelligent condition monitoring of industrial plants: An overview of methodologies and uncertainty management strategies,

M. Ahang, T. Charter, M. Abbasi, M. Khadivi, O. Ogunfowora, and H. Najjaran, “Intelligent condition monitoring of industrial plants: An overview of methodologies and uncertainty management strategies,” arXiv preprint arXiv:2401.10266, 2024

work page arXiv 2024
[5]

Informed machine learning–a taxonomy and survey of integrating prior knowledge into learning systems,

L. V on Rueden, S. Mayer, K. Beckh, B. Georgiev, S. Giesselbach, R. Heese, B. Kirsch, J. Pfrommer, A. Pick, R. Ramamurthy,et al., “Informed machine learning–a taxonomy and survey of integrating prior knowledge into learning systems,”IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 1, pp. 614–633, 2021

work page 2021
[6]

A generic framework for decision fusion in fault detection and diagnosis,

K. Tidriri, T. Tiplica, N. Chatti, and S. Verron, “A generic framework for decision fusion in fault detection and diagnosis,”Engineering Appli- cations of Artificial Intelligence, vol. 71, pp. 73–86, 2018

work page 2018
[7]

Fusing physics-based and deep learning models for prognostics,

M. A. Chao, C. Kulkarni, K. Goebel, and O. Fink, “Fusing physics-based and deep learning models for prognostics,”Reliability Engineering & System Safety, vol. 217, p. 107961, 2022

work page 2022
[8]

Assumption-based design of hybrid diagnosis systems: Analyzing model-based and data-driven principles,

D. Jung and M. Krysander, “Assumption-based design of hybrid diagnosis systems: Analyzing model-based and data-driven principles,” inAnnual Conference of the PHM Society, vol. 16, 2024

work page 2024
[9]

Ensemble learning: A survey,

O. Sagi and L. Rokach, “Ensemble learning: A survey,”Wiley interdis- ciplinary reviews: data mining and knowledge discovery, vol. 8, no. 4, p. e1249, 2018

work page 2018
[10]

A literature review of fault diagnosis based on ensemble learning,

Z. Mian, X. Deng, X. Dong, Y . Tian, T. Cao, K. Chen, and T. Al Jaber, “A literature review of fault diagnosis based on ensemble learning,” Engineering Applications of Artificial Intelligence, vol. 127, p. 107357, 2024

work page 2024
[11]

A tutorial on conformal prediction.,

G. Shafer and V . V ovk, “A tutorial on conformal prediction.,”Journal of Machine Learning Research, vol. 9, no. 3, 2008

work page 2008
[12]

Uncertainty quantification in machine learning for engineering design and health prognostics: A tutorial,

V . Nemani, L. Biggio, X. Huan, Z. Hu, O. Fink, A. Tran, Y . Wang, X. Zhang, and C. Hu, “Uncertainty quantification in machine learning for engineering design and health prognostics: A tutorial,”Mechanical Systems and Signal Processing, vol. 205, p. 110796, 2023

work page 2023
[13]

Uncertainty-aware fault diagnosis with conformal prediction,

A. Heddoub, A. R. Diallo, L. Homri, J.-Y . Dantan, and A. Siadat, “Uncertainty-aware fault diagnosis with conformal prediction,”IFAC- PapersOnLine, vol. 59, no. 10, pp. 536–541, 2025

work page 2025
[14]

Query based learning via conformal uncertainty for rul prediction,

H. Wu, Y . Wang, Z. Tian, and M. Zuo, “Query based learning via conformal uncertainty for rul prediction,” in2025 IEEE International Conference on Prognostics and Health Management (ICPHM), pp. 1–8, IEEE, 2025

work page 2025
[15]

Canonical variate dissimilarity analysis for process incipient fault detection,

K. E. S. Pilario and Y . Cao, “Canonical variate dissimilarity analysis for process incipient fault detection,”IEEE Transactions on Industrial Informatics, vol. 14, no. 12, pp. 5308–5315, 2018

work page 2018
[16]

Transfer learning for process fault diagnosis: Knowledge transfer from simulation to physical processes,

W. Li, S. Gu, X. Zhang, and T. Chen, “Transfer learning for process fault diagnosis: Knowledge transfer from simulation to physical processes,” Computers & Chemical Engineering, vol. 139, p. 106904, 2020

work page 2020

[1] [1]

Condition monitoring using machine learning: A review of theory, applications, and recent advances,

O. Surucu, S. A. Gadsden, and J. Yawney, “Condition monitoring using machine learning: A review of theory, applications, and recent advances,” Expert Systems with Applications, vol. 221, p. 119738, 2023

work page 2023

[2] [2]

Model-based fault-detection and diagnosis–status and applications,

R. Isermann, “Model-based fault-detection and diagnosis–status and applications,”Annual Reviews in control, vol. 29, no. 1, pp. 71–85, 2005

work page 2005

[3] [3]

Overview on hybrid approaches to fault detection and diagnosis: Combining data- driven, physics-based and knowledge-based models,

Y . Wilhelm, P. Reimann, W. Gauchel, and B. Mitschang, “Overview on hybrid approaches to fault detection and diagnosis: Combining data- driven, physics-based and knowledge-based models,”Procedia Cirp, vol. 99, pp. 278–283, 2021

work page 2021

[4] [4]

Intelligent condition monitoring of industrial plants: An overview of methodologies and uncertainty management strategies,

M. Ahang, T. Charter, M. Abbasi, M. Khadivi, O. Ogunfowora, and H. Najjaran, “Intelligent condition monitoring of industrial plants: An overview of methodologies and uncertainty management strategies,” arXiv preprint arXiv:2401.10266, 2024

work page arXiv 2024

[5] [5]

Informed machine learning–a taxonomy and survey of integrating prior knowledge into learning systems,

L. V on Rueden, S. Mayer, K. Beckh, B. Georgiev, S. Giesselbach, R. Heese, B. Kirsch, J. Pfrommer, A. Pick, R. Ramamurthy,et al., “Informed machine learning–a taxonomy and survey of integrating prior knowledge into learning systems,”IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 1, pp. 614–633, 2021

work page 2021

[6] [6]

A generic framework for decision fusion in fault detection and diagnosis,

K. Tidriri, T. Tiplica, N. Chatti, and S. Verron, “A generic framework for decision fusion in fault detection and diagnosis,”Engineering Appli- cations of Artificial Intelligence, vol. 71, pp. 73–86, 2018

work page 2018

[7] [7]

Fusing physics-based and deep learning models for prognostics,

M. A. Chao, C. Kulkarni, K. Goebel, and O. Fink, “Fusing physics-based and deep learning models for prognostics,”Reliability Engineering & System Safety, vol. 217, p. 107961, 2022

work page 2022

[8] [8]

Assumption-based design of hybrid diagnosis systems: Analyzing model-based and data-driven principles,

D. Jung and M. Krysander, “Assumption-based design of hybrid diagnosis systems: Analyzing model-based and data-driven principles,” inAnnual Conference of the PHM Society, vol. 16, 2024

work page 2024

[9] [9]

Ensemble learning: A survey,

O. Sagi and L. Rokach, “Ensemble learning: A survey,”Wiley interdis- ciplinary reviews: data mining and knowledge discovery, vol. 8, no. 4, p. e1249, 2018

work page 2018

[10] [10]

A literature review of fault diagnosis based on ensemble learning,

Z. Mian, X. Deng, X. Dong, Y . Tian, T. Cao, K. Chen, and T. Al Jaber, “A literature review of fault diagnosis based on ensemble learning,” Engineering Applications of Artificial Intelligence, vol. 127, p. 107357, 2024

work page 2024

[11] [11]

A tutorial on conformal prediction.,

G. Shafer and V . V ovk, “A tutorial on conformal prediction.,”Journal of Machine Learning Research, vol. 9, no. 3, 2008

work page 2008

[12] [12]

Uncertainty quantification in machine learning for engineering design and health prognostics: A tutorial,

V . Nemani, L. Biggio, X. Huan, Z. Hu, O. Fink, A. Tran, Y . Wang, X. Zhang, and C. Hu, “Uncertainty quantification in machine learning for engineering design and health prognostics: A tutorial,”Mechanical Systems and Signal Processing, vol. 205, p. 110796, 2023

work page 2023

[13] [13]

Uncertainty-aware fault diagnosis with conformal prediction,

A. Heddoub, A. R. Diallo, L. Homri, J.-Y . Dantan, and A. Siadat, “Uncertainty-aware fault diagnosis with conformal prediction,”IFAC- PapersOnLine, vol. 59, no. 10, pp. 536–541, 2025

work page 2025

[14] [14]

Query based learning via conformal uncertainty for rul prediction,

H. Wu, Y . Wang, Z. Tian, and M. Zuo, “Query based learning via conformal uncertainty for rul prediction,” in2025 IEEE International Conference on Prognostics and Health Management (ICPHM), pp. 1–8, IEEE, 2025

work page 2025

[15] [15]

Canonical variate dissimilarity analysis for process incipient fault detection,

K. E. S. Pilario and Y . Cao, “Canonical variate dissimilarity analysis for process incipient fault detection,”IEEE Transactions on Industrial Informatics, vol. 14, no. 12, pp. 5308–5315, 2018

work page 2018

[16] [16]

Transfer learning for process fault diagnosis: Knowledge transfer from simulation to physical processes,

W. Li, S. Gu, X. Zhang, and T. Chen, “Transfer learning for process fault diagnosis: Knowledge transfer from simulation to physical processes,” Computers & Chemical Engineering, vol. 139, p. 106904, 2020

work page 2020