Replay-guided Test-time Adaptation for Fault Diagnosis Under Unseen Operating Conditions
Pith reviewed 2026-05-22 04:25 UTC · model grok-4.3
The pith
A dual-memory replay mechanism lets fault diagnosis models adapt online to unseen loads and speeds while keeping prior knowledge.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
An integrated offline domain-generalization stage via adversarial feature alignment followed by an online dual-memory replay stage produces a model that adapts to distribution shifts from varying operating conditions while limiting catastrophic forgetting, as demonstrated by competitive results on a real-world motor dataset.
What carries the argument
The dual-memory replay mechanism stores selected high-confidence online pseudo-labeled samples and interleaves them with historical offline data during test-time updates to support adaptation and knowledge retention.
If this is right
- Models can track continuous changes in load and speed during operation rather than requiring periodic full retraining.
- Retention of offline-learned knowledge occurs alongside adaptation to new data streams.
- Industrial fault diagnosis systems become more reliable when operating conditions vary unpredictably.
Where Pith is reading between the lines
- The same replay structure might transfer to other time-series monitoring tasks such as vibration analysis in turbines or pumps.
- Performance would likely degrade if the initial offline model lacks sufficient invariance, pointing to a need for stronger domain-generalization baselines.
- Extending the memory buffer to include uncertainty estimates could further reduce confirmation of erroneous labels.
Load-bearing premise
The offline adversarial training yields a model with enough initial robustness that online replay can track new distributions without locking in wrong pseudo-labels or erasing earlier knowledge.
What would settle it
Run the online stage on a sequence of motor signals where most early pseudo-labels are deliberately incorrect and measure whether accuracy collapses or remains stable.
Figures
read the original abstract
In modern industrial systems, machinery frequently operates under dynamic environments with continuously varying loads and speeds. Consequently, deep learning-based fault diagnosis models often suffer from severe performance degradation under unseen operating conditions due to complex data distribution shifts. Since existing methods predominantly rely on static offline training, they lack the capability to dynamically adapt to these continuous variations. To address this issue, an integrated framework combining offline domain generalization (DG) and online test-time adaptation (OTTA) is proposed. Initially, a model with preliminary generalization capability is obtained offline by extracting domain-invariant features via adversarial learning. During the online phase, a dual-memory replay mechanism is developed. By selectively storing high-confidence online pseudo-labeled samples and replaying them with historical offline data, the model facilitates adaptation to changing data distributions and helps reduce forgetting of previously learned knowledge Experiments on a real-world motor dataset show that the proposed approach achieves competitive performance under the considered unseen operating conditions.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes an integrated framework for fault diagnosis under unseen operating conditions in dynamic industrial environments. It first performs offline adversarial training to learn domain-invariant features and obtain a model with preliminary generalization capability. In the online phase, a dual-memory replay mechanism selectively stores high-confidence pseudo-labeled samples from the test stream and replays them together with historical offline data to enable test-time adaptation while reducing catastrophic forgetting. The central claim, supported by experiments on a real-world motor dataset, is that the approach achieves competitive performance under the considered load and speed shifts.
Significance. If the empirical results hold with proper validation of pseudo-label quality and ablation controls, the work addresses a practically important gap between static offline domain generalization and online adaptation for fault diagnosis. The dual-memory replay idea is a reasonable way to balance adaptation and retention, and successful demonstration on real motor data would be a useful contribution to signal-processing applications in variable-condition machinery monitoring.
major comments (2)
- [Online adaptation / dual-memory replay] Online adaptation description (method section): The dual-memory replay stores only high-confidence pseudo-labeled samples, yet no measurement or analysis of pseudo-label accuracy on the online motor stream under the described severe load/speed shifts is provided. This is load-bearing for the claim that replay avoids confirmation bias and error reinforcement, because the offline adversarially trained model may still produce high-confidence but incorrect labels on unseen conditions.
- [Experiments] Experiments section: The abstract asserts competitive performance, but the manuscript supplies no quantitative metrics (accuracy, F1, etc.), no baseline comparisons, no ablation results on the replay buffer or confidence threshold, and no details on how the threshold was chosen. Without these, the central empirical claim cannot be assessed.
minor comments (2)
- [Abstract] Abstract: missing punctuation after 'knowledge' ('knowledge Experiments' should read 'knowledge. Experiments').
- [Method] Method description is entirely procedural; adding a compact loss equation for the adversarial domain-invariant term and a replay objective would improve clarity and reproducibility.
Simulated Author's Rebuttal
We sincerely thank the referee for the thoughtful and constructive comments on our manuscript. The feedback highlights important aspects for strengthening the validation of our proposed framework, particularly regarding pseudo-label quality and experimental details. We will revise the manuscript to incorporate these suggestions.
read point-by-point responses
-
Referee: [Online adaptation / dual-memory replay] Online adaptation description (method section): The dual-memory replay stores only high-confidence pseudo-labeled samples, yet no measurement or analysis of pseudo-label accuracy on the online motor stream under the described severe load/speed shifts is provided. This is load-bearing for the claim that replay avoids confirmation bias and error reinforcement, because the offline adversarially trained model may still produce high-confidence but incorrect labels on unseen conditions.
Authors: We agree with the referee that an analysis of pseudo-label accuracy is essential to support the effectiveness of the dual-memory replay in avoiding confirmation bias. Although the current manuscript focuses on the overall performance, we will add a new subsection or figure in the experiments section that evaluates the accuracy of the high-confidence pseudo-labels generated during the online phase. This analysis will be conducted using the available ground-truth labels from the motor dataset under various unseen load and speed conditions. We believe this will confirm that the confidence-based selection helps maintain label quality. revision: yes
-
Referee: [Experiments] Experiments section: The abstract asserts competitive performance, but the manuscript supplies no quantitative metrics (accuracy, F1, etc.), no baseline comparisons, no ablation results on the replay buffer or confidence threshold, and no details on how the threshold was chosen. Without these, the central empirical claim cannot be assessed.
Authors: We appreciate this observation and acknowledge that the experimental results in the submitted version are presented primarily through figures without accompanying quantitative tables or detailed ablations. In the revised manuscript, we will include a comprehensive table reporting accuracy, F1-score, and other relevant metrics for our method and several baselines across different operating condition shifts. We will also add ablation studies varying the replay buffer size and confidence threshold, along with an explanation of how the threshold was selected based on preliminary experiments to balance adaptation and reliability. These additions will allow for a clearer assessment of the central claims. revision: yes
Circularity Check
No circularity: procedural method with experimental validation only
full rationale
The paper describes an integrated framework of offline adversarial domain generalization followed by online dual-memory replay using high-confidence pseudo-labels. No equations, derivations, or mathematical claims are present in the provided text. Performance is asserted solely via experiments on a real-world motor dataset under unseen conditions. This is self-contained against external benchmarks with no reduction of any result to its own inputs by construction, no self-citation load-bearing steps, and no fitted quantities renamed as predictions. The reader's assessment of score 2.0 is consistent with the absence of any derivational chain.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Adversarial learning extracts domain-invariant features that generalize to unseen operating conditions.
- domain assumption High-confidence online pseudo-labeled samples can be safely replayed with historical data without reinforcing errors.
invented entities (1)
-
Dual-memory replay mechanism
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
an integrated framework combining offline domain generalization (DG) and online test-time adaptation (OTTA) ... dual-memory replay mechanism ... high-confidence online pseudo-labeled samples
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
S. Hu, Z. Liu, M. Li, and X. He, “Cadm+: Confusion-based learning framework with drift detection and adaptation for real-time safety assessment,”IEEE Transactions on Neural Networks and Learning Systems, vol. 36, no. 3, pp. 5126–5139, 2024
work page 2024
-
[2]
P. Han, S. Chen, Z. Liu, and X. He, “Imbalanced real-time fault diagnosis based on minority-prioritized online semi-supervised random vector functional link network,”IEEE Transactions on Instrumentation and Measurement, vol. 73, pp. 1–10, 2024
work page 2024
-
[3]
X. Lu, W. Xu, Q. Jiang, Y . Shen, F. Xu, and Q. Zhu, “Category- aware dual adversarial domain adaptation model for rolling bearings fault diagnosis under variable conditions,”Measurement Science and Technology, vol. 34, p. 095104, Sept. 2023
work page 2023
-
[4]
H. Su, X. Yang, L. Xiang, A. Hu, and Y . Xu, “A novel method based on deep transfer unsupervised learning network for bearing fault diagnosis under variable working condition of unequal quantity,”Knowledge- Based Systems, vol. 242, p. 108381, Apr. 2022
work page 2022
-
[5]
S. Han and Z. Feng, “Intelligent fault diagnosis of planetary gearboxes under time-varying conditions based on dynamic adversarial balance adaptation with multi-label information confusion,”Measurement Sci- ence and Technology, vol. 34, p. 065014, June 2023
work page 2023
-
[6]
Multi-condition fault diagnosis of dynamic systems: A survey, insights, and prospects,
P. Han, Z. Liu, X. He, S. X. Ding, and D. Zhou, “Multi-condition fault diagnosis of dynamic systems: A survey, insights, and prospects,” IEEE Transactions on Automation Science and Engineering, vol. 22, pp. 15728–15744, 2025
work page 2025
-
[7]
X. Yang, X. Yuan, T. Ye, W. Zhu, F. Zhou, and J. Jin, “PSNN- TADA: Prototype and Stochastic Neural Network-Based Twice Adver- sarial Domain Adaptation for Fault Diagnosis Under Varying Working Conditions,”IEEE Transactions on Instrumentation and Measurement, vol. 73, pp. 1–12, 2024
work page 2024
-
[8]
Y . Zhang, J. Ding, Y . Li, Z. Ren, and K. Feng, “Multi-modal data cross- domain fusion network for gearbox fault diagnosis under variable op- erating conditions,”Engineering Applications of Artificial Intelligence, vol. 133, p. 108236, July 2024
work page 2024
-
[9]
A. Choudhary, T. Mian, S. Fatima, and B. K. Panigrahi, “Passive Thermography Based Bearing Fault Diagnosis Using Transfer Learning TABLE I: DIAGNOSTIC PERFORMANCE OF ALL METHODS ACROSS VARIOUS EXPERIMENTAL SETTINGS. Injection Spot Fault Type Operation Condition Method Baseline Without Update Without ReplayProposed 30% Dynamic Eccentricity UC 20Nm 0.788±0....
work page 2060
-
[10]
Q. Qian, J. Zhou, and Y . Qin, “Relationship Transfer Domain General- ization Network for Rotating Machinery Fault Diagnosis Under Different Working Conditions,”IEEE Transactions on Industrial Informatics, vol. 19, pp. 9898–9908, Sept. 2023
work page 2023
-
[11]
Q. Li, L. Chen, L. Kong, D. Wang, M. Xia, and C. Shen, “Cross- domain augmentation diagnosis: An adversarial domain-augmented gen- eralization method for fault diagnosis under unseen working conditions,” Reliability Engineering & System Safety, vol. 234, p. 109171, June 2023
work page 2023
-
[12]
J. Chen, W. Hu, D. Cao, Z. Zhang, Z. Chen, and F. Blaabjerg, “A Meta- Learning Method for Electric Machine Bearing Fault Diagnosis Under Varying Working Conditions With Limited Data,”IEEE Transactions on Industrial Informatics, vol. 19, pp. 2552–2564, Mar. 2023
work page 2023
-
[13]
Incremental learning-enabled fault diagnosis of dynamic systems: A comprehensive review,
Z. Liu, X. He, B. Huang, and D. Zhou, “Incremental learning-enabled fault diagnosis of dynamic systems: A comprehensive review,”IEEE Transactions on Cybernetics, vol. 55, no. 12, pp. 5633–5649, 2025
work page 2025
-
[14]
W. Li, Z. Liu, P. Han, X. He, L. Wang, and T. Zhang, “A dynamic anchor- based online semi-supervised learning approach for fault diagnosis under variable operating conditions,”Neurocomputing, vol. 638, p. 130137, 2025
work page 2025
-
[15]
In search of lost online test-time adaptation: A survey,
Z. Wang, Y . Luo, L. Zheng, Z. Chen, S. Wang, and Z. Huang, “In search of lost online test-time adaptation: A survey,”International Journal of Computer Vision, vol. 133, no. 3, pp. 1106–1139, 2025
work page 2025
-
[16]
Online adaptive fault diagnosis with test-time domain adaptation,
K. Wu, J. Li, L. Meng, F. Li, and K. Lu, “Online adaptive fault diagnosis with test-time domain adaptation,”IEEE Transactions on Industrial Informatics, vol. 21, no. 1, pp. 107–117, 2025
work page 2025
-
[17]
Domain-adversarial training of neural networks,
Y . Ganin, E. Ustinova, H. Ajakan, P. Germain, H. Larochelle, F. Lavi- olette, M. March, and V . Lempitsky, “Domain-adversarial training of neural networks,”Journal of Machine Learning Research, vol. 17, no. 59, pp. 1–35, 2016
work page 2016
-
[18]
S. Chen, Z. Liu, C. Li, D. Zou, X. He, and D. Zhou, “Multi-mode fault diagnosis datasets of three-phase asynchronous motor under variable working conditions,”arXiv preprint arXiv:2601.02278, 2026
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.