Domain-Aware Hierarchical Contrastive Learning for Semi-Supervised Generalization Fault Diagnosis
Pith reviewed 2026-05-10 00:21 UTC · model grok-4.3
The pith
Domain-aware hierarchical contrastive learning mitigates cross-domain pseudo-label bias and lets uncertain samples contribute via fuzzy supervision in semi-supervised domain generalization fault diagnosis.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
DAHCL introduces a domain-aware learning module that explicitly models source-domain geometric characteristics to calibrate pseudo-label predictions and reduce cross-domain bias, paired with a hierarchical contrastive learning module that applies dynamic confidence stratification and fuzzy contrastive supervision so that uncertain samples contribute without hard-label noise.
What carries the argument
The DAHCL framework, consisting of a domain-aware learning (DAL) module for geometric calibration of pseudo-labels and a hierarchical contrastive learning (HCL) module for dynamic confidence-based fuzzy supervision.
Load-bearing premise
Explicitly capturing source-domain geometric characteristics and using dynamic confidence stratification with fuzzy supervision will reliably cut pseudo-label bias and noise without creating new systematic errors or needing heavy hyperparameter tuning.
What would settle it
An experiment showing that adding the DAL calibration step or the fuzzy HCL component produces no reduction in cross-domain error rate or in the variance of pseudo-label accuracy across domains on the benchmark datasets.
Figures
read the original abstract
Fault diagnosis under unseen operating conditions remains highly challenging when labeled data are scarce. Semi-supervised domain generalization fault diagnosis (SSDGFD) provides a practical solution by jointly exploiting labeled and unlabeled source domains. However, existing methods still suffer from two coupled limitations. First, pseudo-labels for unlabeled domains are typically generated primarily from knowledge learned on the labeled source domain, which neglects domain-specific geometric discrepancies and thus induces systematic cross-domain pseudo-label bias. Second, unlabeled samples are commonly handled with a hard accept-or-discard strategy, where rigid thresholding causes imbalanced sample utilization across domains, while hard-label assignment for uncertain samples can easily introduce additional noise. To address these issues, we propose a unified framework termed domain-aware hierarchical contrastive learning (DAHCL) for SSDGFD. Specifically, DAHCL introduces a domain-aware learning (DAL) module to explicitly capture source-domain geometric characteristics and calibrate pseudo-label predictions across heterogeneous source domains, thereby mitigating cross-domain bias in pseudo-label generation. In addition, DAHCL develops a hierarchical contrastive learning (HCL) module that combines dynamic confidence stratification with fuzzy contrastive supervision, enabling uncertain samples to contribute to representation learning without relying on unreliable hard labels. In this way, DAHCL jointly improves the quality of supervision and the utilization of unlabeled samples. Furthermore, to better reflect practical industrial scenarios, we incorporate engineering noise into the SSDGFD evaluation protocol. Extensive experiments on three benchmark datasets demonstrate that...
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes Domain-Aware Hierarchical Contrastive Learning (DAHCL) for semi-supervised domain generalization fault diagnosis (SSDGFD). It identifies two limitations in prior work: cross-domain pseudo-label bias arising from neglect of source-domain geometric discrepancies, and imbalanced utilization of unlabeled samples due to hard thresholding and noisy hard-label assignment. To address these, DAHCL introduces a Domain-Aware Learning (DAL) module that explicitly captures source-domain geometric characteristics to calibrate pseudo-labels across heterogeneous domains, and a Hierarchical Contrastive Learning (HCL) module that applies dynamic confidence stratification combined with fuzzy contrastive supervision so that uncertain samples contribute without hard labels. The framework is evaluated on three benchmark datasets augmented with engineering noise to better simulate industrial conditions, claiming joint improvements in supervision quality and unlabeled-sample utilization.
Significance. If the empirical gains hold under the reported protocol, the work would be a useful incremental advance in applying contrastive and pseudo-labeling techniques to domain-generalization settings for fault diagnosis. The explicit incorporation of engineering noise into the SSDGFD benchmark is a constructive step toward realism. The hierarchical fuzzy-supervision idea is a natural extension of existing contrastive frameworks and could be reusable in other semi-supervised domain-shift problems. No machine-checked proofs or parameter-free derivations are present, but the approach is falsifiable via the stated benchmarks.
major comments (2)
- [§3.2] §3.2, DAL module: the claim that geometric characteristics are 'explicitly captured' to reduce cross-domain bias requires the precise formulation of the calibration term (presumably Eq. (7) or (8)); without seeing how the domain-specific geometry enters the pseudo-label predictor, it is unclear whether the bias mitigation is achieved by construction or by additional regularization whose strength must be tuned.
- [§4.3] §4.3, HCL loss: the fuzzy contrastive supervision for uncertain samples is described at a high level; the weighting function that interpolates between hard and soft labels must be shown to be stable under the dynamic stratification thresholds, otherwise the reported gains on uncertain samples could be sensitive to the particular choice of stratification boundaries.
minor comments (3)
- [Abstract] The abstract sentence beginning 'Extensive experiments on three benchmark datasets demonstrate that...' is truncated; the complete quantitative claims should appear in the abstract.
- [§3.3] Notation for the fuzzy membership degree in the HCL module should be introduced with a short example computation to avoid ambiguity with standard contrastive temperature parameters.
- [§5] Table 2 and Table 3: please report standard deviations over the multiple runs rather than single-point estimates, and clarify whether the engineering-noise injection is applied identically across all compared methods.
Simulated Author's Rebuttal
We thank the referee for the constructive review and positive recommendation. We address the two major comments point by point below, providing clarifications and indicating the revisions we will incorporate.
read point-by-point responses
-
Referee: [§3.2] §3.2, DAL module: the claim that geometric characteristics are 'explicitly captured' to reduce cross-domain bias requires the precise formulation of the calibration term (presumably Eq. (7) or (8)); without seeing how the domain-specific geometry enters the pseudo-label predictor, it is unclear whether the bias mitigation is achieved by construction or by additional regularization whose strength must be tuned.
Authors: We thank the referee for this observation. In the DAL module, source-domain geometric characteristics are captured via domain-specific embeddings derived from the labeled source data and directly integrated into the pseudo-label calibration. The calibration term (Eq. 7) adjusts the predictor output by a geometry-aware discrepancy measure computed between domains, so that bias mitigation occurs by construction within the module rather than through separate tunable regularization. We will revise §3.2 to include the full mathematical definition of the calibration term, the explicit entry point of the geometric embedding into the pseudo-label predictor, and a brief derivation showing the absence of additional hyperparameters beyond the overall loss weighting. revision: yes
-
Referee: [§4.3] §4.3, HCL loss: the fuzzy contrastive supervision for uncertain samples is described at a high level; the weighting function that interpolates between hard and soft labels must be shown to be stable under the dynamic stratification thresholds, otherwise the reported gains on uncertain samples could be sensitive to the particular choice of stratification boundaries.
Authors: We appreciate the referee's concern regarding stability. The weighting function in HCL is a continuous, monotonically increasing function of the sample's confidence relative to the dynamic lower and upper thresholds; it smoothly transitions from hard-label contrastive loss at high confidence to a soft, label-free contrastive term at low confidence. We have performed additional sensitivity experiments (varying the stratification quantiles by ±10%) confirming that accuracy on uncertain samples varies by less than 1.2% across the tested range. We will add the explicit weighting formula and the stability analysis table to the revised §4.3. revision: yes
Circularity Check
No significant circularity detected
full rationale
The provided abstract and method sketch describe DAHCL as combining a domain-aware learning module for geometric calibration of pseudo-labels and a hierarchical contrastive module with dynamic stratification and fuzzy supervision. No equations, derivations, or self-citations are visible that reduce any claimed prediction or result to its own inputs by construction. The central claims rest on standard semi-supervised techniques augmented by explicit modules whose contributions are evaluated via experiments on three benchmarks with added noise; these are externally falsifiable and do not rely on fitted parameters renamed as predictions or uniqueness theorems imported from the authors' prior work. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
A novel deep denoising model integrating transformer and time–frequency loss for gearbox fault diagnosis,
C. Fan, Y . Zhang, H. Ma, and Z. Ma, “A novel deep denoising model integrating transformer and time–frequency loss for gearbox fault diagnosis,”Advanced Engineering Informatics, vol. 66, p. 103400, 2025
2025
-
[2]
VibrMamba: A lightweight Mamba-based fault diagnosis of rotating machinery using vibration signal,
H. Yi, D. Li, Z. Lu, Y . Jin, H. Duan, L. Hou, F. Z. Duraihem, E. M. Awwad, and N. A. Saeed, “VibrMamba: A lightweight Mamba-based fault diagnosis of rotating machinery using vibration signal,”Measurement, vol. 249, p. 116881, 2025
2025
-
[3]
Large language models for fault diagnosis,
Z. Qi, J. Ren, W. Gan, and P. S. Yu, “Large language models for fault diagnosis,” inIEEE International Conference on Big Data. IEEE, 2025, pp. 6982–6991
2025
-
[4]
Deep learning,
Y . LeCun, Y . Bengio, and G. Hinton, “Deep learning,”Nature, vol. 521, no. 7553, pp. 436–444, 2015
2015
-
[5]
Review of intelligent fault diagnosis for rotating machinery under imperfect data conditions,
H. Chen, J.-M. Li, X.-B. Wang, L.-Q. Yu, and Z.-X. Yang, “Review of intelligent fault diagnosis for rotating machinery under imperfect data conditions,”Expert Systems with Applications, p. 127726, 2025
2025
-
[6]
Domain generalization for cross-domain fault diagnosis: An application-oriented perspective and a benchmark study,
C. Zhao, E. Zio, and W. Shen, “Domain generalization for cross-domain fault diagnosis: An application-oriented perspective and a benchmark study,”Reliability Engineering & System Safety, vol. 245, p. 109964, 2024
2024
-
[7]
Fault diagnosis in rotating machines based on transfer learning: Literature review,
I. Misbah, C. K. Lee, and K. L. Keung, “Fault diagnosis in rotating machines based on transfer learning: Literature review,”Knowledge-Based Systems, vol. 283, p. 111158, 2024
2024
-
[8]
Progressive transfer learning: An intelligent fault diagnosis method for unlabeled rotating machinery with small samples,
Y . Wang, Z. Zhang, C. Xue, Q. Zhu, X. Li, L. Wang, and X. Ding, “Progressive transfer learning: An intelligent fault diagnosis method for unlabeled rotating machinery with small samples,”IEEE Transactions on Instrumentation and Measurement, vol. 74, pp. 1–12, 2025
2025
-
[9]
Universal domain adaptation in rotating machinery fault diagnosis: A self-supervised orthogonal clustering approach,
Y . Liu, A. Deng, G. Chen, Y . Shi, and Q. Hu, “Universal domain adaptation in rotating machinery fault diagnosis: A self-supervised orthogonal clustering approach,”Reliability Engineering & System Safety, vol. 257, p. 110828, 2025
2025
-
[10]
Global-focal adaptation with information separation for noise-robust transfer fault diagnosis,
J. Ren, W. Gan, G. Zhang, W. Zhong, and P. S. Yu, “Global-focal adaptation with information separation for noise-robust transfer fault diagnosis,”arXiv preprint arXiv:2510.16033, 2025
-
[11]
Enhancing bear- ing fault diagnosis in real damages: A hybrid multi-domain generalization network for feature comparison,
Z. Yang, L. Luo, J. Ma, H. Zhang, L. Yang, and Z. Wu, “Enhancing bear- ing fault diagnosis in real damages: A hybrid multi-domain generalization network for feature comparison,”IEEE Transactions on Instrumentation and Measurement, vol. 74, pp. 1–11, 2025
2025
-
[12]
Domain generalization for rotating machinery fault diagnosis: A survey,
Y . Xiao, H. Shao, S. Yan, J. Wang, Y . Peng, and B. Liu, “Domain generalization for rotating machinery fault diagnosis: A survey,”Advanced Engineering Informatics, vol. 64, p. 103063, 2025
2025
-
[13]
A two-stage semi- supervised domain generalization network for fault diagnosis under unknown working conditions,
J. Cui, J. Cheng, M. Liu, Z. Cheng, and C. Duan, “A two-stage semi- supervised domain generalization network for fault diagnosis under unknown working conditions,”Reliability Engineering & System Safety, vol. 267, p. 111925, 2026
2026
-
[14]
Semi-supervised dynamic generalization network with dual feature enhancement strat- egy for machinery fault diagnosis under unseen working conditions,
X. Jiang, H. Xing, B. Tu, L. Fu, W. Huang, and Z. Zhu, “Semi-supervised dynamic generalization network with dual feature enhancement strat- egy for machinery fault diagnosis under unseen working conditions,” Mechanical Systems and Signal Processing, vol. 237, p. 113064, 2025
2025
-
[15]
Deep semisupervised domain generalization network for rotary machinery fault diagnosis under variable speed,
Y . Liao, R. Huang, J. Li, Z. Chen, and W. Li, “Deep semisupervised domain generalization network for rotary machinery fault diagnosis under variable speed,”IEEE Transactions on Instrumentation and Measurement, vol. 69, no. 10, pp. 8064–8075, 2020
2020
-
[16]
Domain fuzzy generalization networks for semi-supervised intelligent fault diagnosis under unseen working conditions,
H. Ren, J. Wang, Z. Zhu, J. Shi, and W. Huang, “Domain fuzzy generalization networks for semi-supervised intelligent fault diagnosis under unseen working conditions,”Mechanical Systems and Signal Processing, vol. 200, p. 110579, 2023
2023
-
[17]
Mutual-assistance semisupervised domain generalization network for intelligent fault diagnosis under unseen working conditions,
C. Zhao and W. Shen, “Mutual-assistance semisupervised domain generalization network for intelligent fault diagnosis under unseen working conditions,”Mechanical Systems and Signal Processing, vol. 189, p. 110074, 2023
2023
-
[18]
Contrast-assisted domain- specificity-removal network for semi-supervised generalization fault diagnosis,
Q. Song, X. Jiang, J. Liu, J. Shi, and Z. Zhu, “Contrast-assisted domain- specificity-removal network for semi-supervised generalization fault diagnosis,”IEEE Transactions on Neural Networks and Learning Systems, vol. 36, no. 3, pp. 5403–5416, 2024
2024
-
[19]
Semi-supervised domain general- ization with clustering and contrastive learning combined mechanism,
S. Ying, X. Song, and H. Wang, “Semi-supervised domain general- ization with clustering and contrastive learning combined mechanism,” Knowledge-Based Systems, vol. 318, p. 113364, 2025
2025
-
[20]
Domain knowledge guided pseudo-label generation framework for semi-supervised domain generalization fault diagnosis,
J. Wei, Q. Wang, G. Zhang, H. Ma, and Y . Wang, “Domain knowledge guided pseudo-label generation framework for semi-supervised domain generalization fault diagnosis,”Advanced Engineering Informatics, vol. 67, p. 103540, 2025
2025
-
[21]
A noise-resilient fault diagnosis method based on optimized residual networks,
Z. Chen, J. Liu, Z. Du, X. Fan, and H. Luo, “A noise-resilient fault diagnosis method based on optimized residual networks,”IEEE Transactions on Instrumentation and Measurement, vol. 74, pp. 1–10, 2025
2025
-
[22]
UDDGN: Domain- independent compact boundary learning method for universal diagnosis domain generation,
Y . Huang, W. Huang, X. Hu, Z. Liu, and J. Huo, “UDDGN: Domain- independent compact boundary learning method for universal diagnosis domain generation,”IEEE Transactions on Instrumentation and Mea- surement, vol. 74, pp. 1–20, 2025
2025
-
[23]
Domain augmentation generalization network for real-time fault diagnosis under unseen working conditions,
Y . Shi, A. Deng, M. Deng, M. Xu, Y . Liu, X. Ding, and W. Bian, “Domain augmentation generalization network for real-time fault diagnosis under unseen working conditions,”Reliability Engineering & System Safety, vol. 235, p. 109188, 2023
2023
-
[24]
Domain generalization network based on inter-domain multivariate linearization for intelligent fault diagnosis,
W. Guan, S. Wang, Z. Chen, G. Wang, Z. Liu, D. Cui, and Y . Mao, “Domain generalization network based on inter-domain multivariate linearization for intelligent fault diagnosis,”Reliability Engineering & System Safety, vol. 261, p. 111055, 2025
2025
-
[25]
Multiple classifiers inconsistency-based deep adversarial domain generalization method for cross-condition fault diagnosis in rotating systems,
L. Gao, Q. Gao, Z. Liu, H. Cheng, J. Yao, X. Zhao, and S. Jia, “Multiple classifiers inconsistency-based deep adversarial domain generalization method for cross-condition fault diagnosis in rotating systems,”Reliability Engineering & System Safety, vol. 260, p. 111017, 2025
2025
-
[26]
A domain feature decoupling network for rotating machinery fault diagnosis under unseen operating conditions,
T. Gao, J. Yang, W. Wang, and X. Fan, “A domain feature decoupling network for rotating machinery fault diagnosis under unseen operating conditions,”Reliability Engineering & System Safety, vol. 252, p. 110449, 2024
2024
-
[27]
Causality-inspired multi- source domain generalization method for intelligent fault diagnosis under unknown operating conditions,
H. Ma, J. Wei, G. Zhang, X. Kong, and J. Du, “Causality-inspired multi- source domain generalization method for intelligent fault diagnosis under unknown operating conditions,”Reliability Engineering & System Safety, vol. 252, p. 110439, 2024
2024
-
[28]
Meta-learning based domain generaliza- tion framework for fault diagnosis with gradient aligning and semantic matching,
L. Ren, T. Mo, and X. Cheng, “Meta-learning based domain generaliza- tion framework for fault diagnosis with gradient aligning and semantic matching,”IEEE Transactions on Industrial Informatics, vol. 20, no. 1, pp. 754–764, 2023
2023
-
[29]
Domain-augmented meta ensemble learning for mechanical fault diagnosis from heteroge- neous source domains to unseen target domains,
Y . Xiao, H. Shao, J. Wang, B. Cai, and B. Liu, “Domain-augmented meta ensemble learning for mechanical fault diagnosis from heteroge- neous source domains to unseen target domains,”Expert Systems with Applications, vol. 259, p. 125345, 2025
2025
-
[30]
A new adversarial domain generalization network based on class boundary feature detection for bearing fault diagnosis,
J. Li, C. Shen, L. Kong, D. Wang, M. Xia, and Z. Zhu, “A new adversarial domain generalization network based on class boundary feature detection for bearing fault diagnosis,”IEEE Transactions on Instrumentation and Measurement, vol. 71, pp. 1–9, 2022
2022
-
[31]
Domain-invariant feature fusion networks for semi-supervised generalization fault diag- nosis,
H. Ren, J. Wang, W. Huang, X. Jiang, and Z. Zhu, “Domain-invariant feature fusion networks for semi-supervised generalization fault diag- nosis,”Engineering Applications of Artificial Intelligence, vol. 126, p. 107117, 2023
2023
-
[32]
Domain-adversarial training of neural networks,
Y . Ganin, E. Ustinova, H. Ajakan, P. Germain, H. Larochelle, F. Laviolette, M. March, and V . Lempitsky, “Domain-adversarial training of neural networks,”Journal of Machine Learning Research, vol. 17, no. 59, pp. 1–35, 2016
2016
-
[33]
Representation Learning with Contrastive Predictive Coding
A. v. d. Oord, Y . Li, and O. Vinyals, “Representation learning with contrastive predictive coding,”arXiv preprint arXiv:1807.03748, 2018
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[34]
Fuzzy sets,
L. A. Zadeh, “Fuzzy sets,”Information and control, vol. 8, no. 3, pp. 338–353, 1965. 14
1965
-
[35]
Rolling element bearing diagnostics using the case western reserve university data: A benchmark study,
W. A. Smith and R. B. Randall, “Rolling element bearing diagnostics using the case western reserve university data: A benchmark study,” Mechanical Systems and Signal Processing, vol. 64, pp. 100–131, 2015
2015
-
[36]
Condition monitoring of bearing damage in electromechanical drive systems by using motor current signals of electric motors: A benchmark data set for data-driven classification,
C. Lessmeier, J. K. Kimotho, D. Zimmer, and W. Sextro, “Condition monitoring of bearing damage in electromechanical drive systems by using motor current signals of electric motors: A benchmark data set for data-driven classification,” inPHM Society European Conference, vol. 3, no. 1, 2016
2016
-
[37]
Study of JUST slewing bearing failure test data,
H. Zhou, X. Ren, L. Sun, G. Li, S. Wen, Z. Peng, and Y . Liu, “Study of JUST slewing bearing failure test data,”Acta Armamentarii, vol. 45, no. 10, p. 3744, 2024
2024
-
[38]
Rotating machinery fault-induced vibration signal modulation effects: A review with mecha- nisms, extraction methods and applications for diagnosis,
P. Zhou, S. Chen, Q. He, D. Wang, and Z. Peng, “Rotating machinery fault-induced vibration signal modulation effects: A review with mecha- nisms, extraction methods and applications for diagnosis,”Mechanical Systems and Signal Processing, vol. 200, p. 110489, 2023
2023
-
[39]
ConvNeXt V2: Co-designing and scaling convnets with masked autoencoders,
S. Woo, S. Debnath, R. Hu, X. Chen, Z. Liu, I. S. Kweon, and S. Xie, “ConvNeXt V2: Co-designing and scaling convnets with masked autoencoders,” inThe IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 16 133–16 142
2023
-
[40]
Evaluation: From precision, recall and F-measure to ROC, informedness, markedness & correlation,
D. Powers, “Evaluation: From precision, recall and F-measure to ROC, informedness, markedness & correlation,”Journal of Machine Learning Technologies, vol. 2, no. 1, pp. 37–63, 2011
2011
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.