pith. sign in

arxiv: 2605.15722 · v1 · pith:TUBBULKInew · submitted 2026-05-15 · 💻 cs.LG · cs.AI· cs.CV· eess.SP

Bidirectional Fusion Guided by Cardiac Patterns for Semi-Supervised ECG Segmentation

Pith reviewed 2026-05-20 19:45 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.CVeess.SP
keywords semi-supervised segmentationECG delineationCutMixbidirectional fusioncardiac patternsdeep learningmedical signal processingwaveform segmentation
0
0 comments X

The pith

CardioMix uses cardiac pattern-guided bidirectional CutMix to improve semi-supervised ECG segmentation by exchanging information between labeled and unlabeled data while keeping samples physiologically valid.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces CardioMix to tackle the scarcity of annotated ECG data for training deep learning models in waveform segmentation. Conventional semi-supervised methods emphasize consistency on unlabeled data but miss opportunities for direct information exchange with labeled examples. CardioMix applies a bidirectional CutMix strategy steered by cardiac patterns, adding realistic unlabeled variations to the labeled set while delivering stronger supervision back to the unlabeled set. The cardiac guidance keeps every mixed sample physiologically plausible. Experiments on the SemiSegECG benchmark show the approach works as a plug-and-play addition to existing semi-supervised algorithms and beats prior CutMix variants across datasets and labeling ratios.

Core claim

CardioMix is a bidirectional CutMix strategy guided by cardiac patterns for ECG segmentation. This approach enriches the labeled set with realistic variations from unlabeled data while simultaneously applying stronger supervisory signals to the unlabeled set, as the cardiac pattern-guided mixing ensures all augmented samples remain physiologically meaningful.

What carries the argument

CardioMix, a bidirectional CutMix framework guided by cardiac patterns that performs realistic information exchange between labeled and unlabeled ECG data.

If this is right

  • CardioMix functions as a plug-and-play module that is compatible with various existing semi-supervised segmentation algorithms.
  • It consistently outperforms existing CutMix-based fusion strategies on diverse ECG datasets and across different fractions of labeled data.
  • Labeled training sets receive realistic variations drawn from unlabeled recordings.
  • Unlabeled data receive stronger supervisory signals through the same mixing process.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Domain-specific physiological constraints could guide augmentation in other biosignal tasks such as EEG or EMG segmentation.
  • The same bidirectional fusion idea might reduce annotation needs in related medical imaging domains that have clear structural priors.
  • Automatic extraction of guiding patterns, rather than relying on explicit cardiac annotations, could broaden applicability.

Load-bearing premise

The cardiac pattern-guided mixing ensures all augmented samples remain physiologically meaningful, allowing effective bidirectional information exchange between labeled and unlabeled sets without introducing invalid ECG variations.

What would settle it

A head-to-head test on SemiSegECG in which CardioMix integrated into multiple SemiSeg algorithms fails to exceed the performance of standard CutMix, or visual inspection revealing physiologically implausible waveforms in the generated mixed samples.

Figures

Figures reproduced from arXiv: 2605.15722 by Jeonghwa Lim, Minje Park, Sunghoon Joo.

Figure 1
Figure 1. Figure 1: Cardiac pattern-guided CutMix for semi-supervised ECG segmentation. Vanilla CutMix (left) can create waveforms with inconsistent cardiac patterns by pasting random segments, resulting in an unnatural sequence (e.g., a P-wave followed by a T-wave). The proposed method, CutMix guided by cardiac patterns (right), prevents this by matching an original ECG segment (query) with a reference ECG segment (key) whos… view at source ↗
Figure 2
Figure 2. Figure 2: A taxonomy of CutMix-based fusion methods. The red box is a segment to be replaced (query), and the blue box is a segment to be fused (key) in the place of the query. Ub and Lb are unlabeled and labeled mini-batches, respectively. (a) Vanilla CutMix. (b) Labeled-to￾unlabeled fusion. (c) CardioMix, a bidirectional fusion guided by cardiac patterns. Sim(ya, yb) as: Sim(ya, yb) = 1 C X C c=1 IoUc(ya, yb), IoU… view at source ↗
Figure 4
Figure 4. Figure 4: Cardiac pattern consistency ratio of fused ECGs (LUDB, 1/4 labeled ratio, MT as baseline algorithm). Consistency ratio measures the percentage of samples where all T-waves have preceding QRS complexes. (a) Consistency ratios after CutMix using ground-truth seg￾mentation labels. (b) Consistency ratios after L2U during training with pseudo-labels. cardiac cycles. The consistency ratio is computed as the prop… view at source ↗
Figure 5
Figure 5. Figure 5: Qualitative example 1. Random selection produces a T-T sequence and signal-based selection produces a P-T sequence within the fused region (red dashed box), while CardioMix preserves the natural P-QRS-T cycle. correlation between pattern preservation and segmentation performance. These properties translate into consistent performance gains in the ablation study, suggesting that the effectiveness of CardioM… view at source ↗
read the original abstract

Accurate delineation of electrocardiogram (ECG), the segmentation of meaningful waveform features, is crucial for cardiovascular diagnostics. However, the scarcity of annotated data poses a significant challenge for training deep learning models. Conventional semi-supervised semantic segmentation (SemiSeg) methods primarily focus on consistency from unlabeled data, underutilizing the information exchange possible between labeled and unlabeled sets. To address this, we introduce CardioMix, a framework built on a bidirectional CutMix strategy guided by cardiac patterns for ECG segmentation. This approach enriches the labeled set with realistic variations from unlabeled data while simultaneously applying stronger supervisory signals to the unlabeled set, as the cardiac pattern-guided mixing ensures all augmented samples remain physiologically meaningful. Our framework is designed as a plug-and-play module, demonstrating high compatibility with various SemiSeg algorithms. Extensive experiments on SemiSegECG, a public multi-dataset benchmark for ECG delineation, demonstrate that CardioMix consistently outperforms existing CutMix-based fusion strategies across diverse datasets and labeled ratios as a plug-and-play module compatible with various SemiSeg algorithms.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces CardioMix, a bidirectional CutMix strategy guided by cardiac patterns (e.g., QRS or RR landmarks) for semi-supervised ECG segmentation. The framework is positioned as a plug-and-play module that enriches labeled data with realistic variations from unlabeled data while providing stronger supervision to unlabeled samples, with the pattern guidance asserted to keep all mixes physiologically meaningful. It claims consistent outperformance over existing CutMix-based fusion strategies on the SemiSegECG benchmark across diverse datasets and labeled ratios, with high compatibility to various SemiSeg algorithms.

Significance. If the empirical gains are robust and the key assumption of physiological validity is substantiated through direct evidence, the work could offer a practical domain-informed augmentation technique for semi-supervised learning on ECG data, potentially aiding cardiovascular diagnostics in low-annotation regimes. The plug-and-play compatibility is a positive design choice that could facilitate adoption.

major comments (2)
  1. [§3] §3 (Cardiac pattern-guided mixing description): The central claim that CardioMix outperforms standard CutMix strategies because 'the cardiac pattern-guided mixing ensures all augmented samples remain physiologically meaningful' is load-bearing for the bidirectional fusion advantage, yet the manuscript provides no quantitative check such as an expert rating, waveform fidelity metric, or ablation that removes the pattern prior to isolate its contribution from generic mixing benefits.
  2. [§4] §4 (Experiments and results): The reported consistent outperformance on SemiSegECG lacks accompanying ablation results that test the physiological validity assumption (e.g., comparison to unstructured CutMix regions), which is necessary to confirm that the gains derive from the cardiac guidance rather than the bidirectional setup alone.
minor comments (1)
  1. [Abstract] The abstract states outperformance without including any specific quantitative metrics, statistical significance, or labeled-ratio details from the benchmark experiments.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback and the recommendation for major revision. The comments highlight the need for stronger empirical support of the physiological validity claim, which we agree will improve the manuscript. We respond to each major comment below and will incorporate the suggested ablations and metrics in the revised version.

read point-by-point responses
  1. Referee: [§3] §3 (Cardiac pattern-guided mixing description): The central claim that CardioMix outperforms standard CutMix strategies because 'the cardiac pattern-guided mixing ensures all augmented samples remain physiologically meaningful' is load-bearing for the bidirectional fusion advantage, yet the manuscript provides no quantitative check such as an expert rating, waveform fidelity metric, or ablation that removes the pattern prior to isolate its contribution from generic mixing benefits.

    Authors: We agree that the manuscript would be strengthened by direct quantitative evidence isolating the pattern guidance. The design uses cardiac landmarks (QRS or RR intervals) to define mixing boundaries so that augmented ECGs retain valid cycle structure, unlike unstructured mixing that can create non-physiological discontinuities. To substantiate this, we will add an ablation in the revised manuscript that compares the full CardioMix against an otherwise identical bidirectional fusion using random (unstructured) mixing regions. We will also introduce a simple waveform fidelity metric, such as the fraction of mixed samples preserving physiologically plausible RR-interval ranges and QRS morphology as checked by a rule-based detector. These additions will quantify the contribution of the cardiac prior beyond generic mixing benefits. revision: yes

  2. Referee: [§4] §4 (Experiments and results): The reported consistent outperformance on SemiSegECG lacks accompanying ablation results that test the physiological validity assumption (e.g., comparison to unstructured CutMix regions), which is necessary to confirm that the gains derive from the cardiac guidance rather than the bidirectional setup alone.

    Authors: We concur that the current results do not fully disentangle the cardiac guidance from the bidirectional mechanism. While the main experiments already show gains over prior CutMix variants on the SemiSegECG benchmark, we will add a dedicated ablation study in the revised manuscript. This will include direct comparisons of (i) bidirectional fusion with cardiac-pattern guidance versus (ii) bidirectional fusion with random mixing regions, evaluated across the same datasets and labeled-data ratios. The new results will clarify whether the observed improvements stem specifically from the pattern-guided mixing rather than bidirectionality alone. revision: yes

Circularity Check

0 steps flagged

No significant circularity; empirical validation is independent of inputs

full rationale

The paper proposes CardioMix as a bidirectional CutMix framework guided by cardiac patterns for semi-supervised ECG segmentation. Its central claim of consistent outperformance is positioned as arising from experimental results on the SemiSegECG benchmark across datasets and labeled ratios, not from any closed mathematical derivation, self-referential definitions, or fitted parameters renamed as predictions. The assertion that cardiac-pattern guidance ensures physiological meaningfulness is presented as a design property enabling the bidirectional exchange, but the superiority is demonstrated via external comparisons to existing CutMix strategies rather than reducing to the assumption by construction. No load-bearing self-citations, uniqueness theorems, or ansatzes imported from prior author work are invoked in the abstract or method framing. The framework is explicitly described as a plug-and-play module compatible with various SemiSeg algorithms, making the contribution self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on a domain assumption about ECG patterns being usable for valid mixing; no free parameters or new physical entities are introduced in the abstract.

axioms (1)
  • domain assumption Cardiac patterns in ECG signals can be used to guide data augmentation mixing while preserving physiological meaningfulness.
    Invoked to justify why bidirectional fusion produces useful training samples rather than artifacts.
invented entities (1)
  • CardioMix no independent evidence
    purpose: Bidirectional fusion framework for semi-supervised ECG segmentation.
    Newly introduced method without external falsifiable evidence provided in abstract.

pith-pipeline@v0.9.0 · 5716 in / 1230 out tokens · 58335 ms · 2026-05-20T19:45:01.035933+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

33 extracted references · 33 canonical work pages

  1. [1]

    Gacek and W

    A. Gacek and W. Pedrycz,ECG signal processing, classification and in- terpretation: a comprehensive framework of computational intelligence. Springer Science & Business Media, 2011

  2. [2]

    U-net architecture for the automatic detection and delineation of the electrocardiogram,

    G. Jimenez-Perez, A. Alcaine, and O. Camara, “U-net architecture for the automatic detection and delineation of the electrocardiogram,” in 2019 Computing in Cardiology (CinC). IEEE, 2019, pp. Page–1

  3. [3]

    Ecg segnet: An ecg delineation model based on the encoder-decoder structure,

    X. Liang, L. Li, Y . Liu, D. Chen, X. Wang, S. Hu, J. Wang, H. Zhang, C. Sun, and C. Liu, “Ecg segnet: An ecg delineation model based on the encoder-decoder structure,”Computers in biology and medicine, vol. 145, p. 105445, 2022

  4. [4]

    Deep learning based ecg segmentation for delineation of diverse arrhythmias,

    C. Joung, M. Kim, T. Paik, S.-H. Kong, S.-Y . Oh, W. K. Jeon, J.-h. Jeon, J.-S. Hong, W.-J. Kim, W. Kooket al., “Deep learning based ecg segmentation for delineation of diverse arrhythmias,”PloS one, vol. 19, no. 6, p. e0303178, 2024

  5. [5]

    A survey on semi- supervised semantic segmentation,

    A. Pel ´aez-Vegas, P. Mesejo, and J. Luengo, “A survey on semi- supervised semantic segmentation,”arXiv preprint arXiv:2302.09899, 2023

  6. [6]

    Augmentation matters: A simple-yet-effective approach to semi-supervised semantic segmentation,

    Z. Zhao, L. Yang, S. Long, J. Pi, L. Zhou, and J. Wang, “Augmentation matters: A simple-yet-effective approach to semi-supervised semantic segmentation,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2023, pp. 11 350–11 359

  7. [7]

    Locating noise is halfway denoising for semi-supervised segmentation,

    Y . Fang, F. Zhu, B. Cheng, L. Liu, Y . Zhao, and Y . Wei, “Locating noise is halfway denoising for semi-supervised segmentation,” inProceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 16 612–16 622

  8. [8]

    Cutmix: Reg- ularization strategy to train strong classifiers with localizable features,

    S. Yun, D. Han, S. J. Oh, S. Chun, J. Choe, and Y . Yoo, “Cutmix: Reg- ularization strategy to train strong classifiers with localizable features,” inProceedings of the IEEE/CVF international conference on computer vision, 2019, pp. 6023–6032

  9. [9]

    A wavelet-based ecg delineator: evaluation on standard databases,

    J. P. Mart ´ınez, R. Almeida, S. Olmos, A. P. Rocha, and P. Laguna, “A wavelet-based ecg delineator: evaluation on standard databases,”IEEE Transactions on biomedical engineering, vol. 51, no. 4, pp. 570–581, 2004

  10. [10]

    A comparison of the noise sensitivity of nine qrs detection algorithms,

    G. M. Friesen, T. C. Jannett, M. A. Jadallah, S. L. Yates, S. R. Quint, and H. T. Nagle, “A comparison of the noise sensitivity of nine qrs detection algorithms,”IEEE Transactions on biomedical engineering, vol. 37, no. 1, pp. 85–98, 1990

  11. [11]

    Revisiting qrs detection methodologies for portable, wearable, battery-operated, and wireless ecg systems,

    M. Elgendi, B. Eskofier, S. Dokos, and D. Abbott, “Revisiting qrs detection methodologies for portable, wearable, battery-operated, and wireless ecg systems,”PloS one, vol. 9, no. 1, p. e84018, 2014

  12. [12]

    Performance analysis of ten common qrs detectors on different ecg application cases,

    F. Liu, C. Liu, X. Jiang, Z. Zhang, Y . Zhang, J. Li, and S. Wei, “Performance analysis of ten common qrs detectors on different ecg application cases,”Journal of healthcare engineering, vol. 2018, no. 1, p. 9050812, 2018

  13. [13]

    Physiobank, physiotoolkit, and physionet: components of a new research resource for complex physiologic signals,

    A. L. Goldberger, L. A. Amaral, L. Glass, J. M. Hausdorff, P. C. Ivanov, R. G. Mark, J. E. Mietus, G. B. Moody, C.-K. Peng, and H. E. Stanley, “Physiobank, physiotoolkit, and physionet: components of a new research resource for complex physiologic signals,”circulation, vol. 101, no. 23, pp. e215–e220, 2000

  14. [14]

    Mean teachers are better role mod- els: Weight-averaged consistency targets improve semi-supervised deep learning results,

    A. Tarvainen and H. Valpola, “Mean teachers are better role mod- els: Weight-averaged consistency targets improve semi-supervised deep learning results,”Advances in neural information processing systems, vol. 30, 2017

  15. [15]

    Fixmatch: Simplifying semi- supervised learning with consistency and confidence,

    K. Sohn, D. Berthelot, N. Carlini, Z. Zhang, H. Zhang, C. A. Raffel, E. D. Cubuk, A. Kurakin, and C.-L. Li, “Fixmatch: Simplifying semi- supervised learning with consistency and confidence,”Advances in neural information processing systems, vol. 33, pp. 596–608, 2020

  16. [16]

    Semi-supervised semantic segmentation with cross pseudo supervision,

    X. Chen, Y . Yuan, G. Zeng, and J. Wang, “Semi-supervised semantic segmentation with cross pseudo supervision,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2021, pp. 2613–2622

  17. [17]

    Bootstrapping semantic segmentation with regional contrast,

    S. Liu, S. Zhi, E. Johns, and A. Davison, “Bootstrapping semantic segmentation with regional contrast,” inInternational Conference on Learning Representations, 2022. [Online]. Available: https: //openreview.net/forum?id=6u6N8WWwYSM

  18. [18]

    St++: Make self-training work better for semi-supervised semantic segmentation,

    L. Yang, W. Zhuo, L. Qi, Y . Shi, and Y . Gao, “St++: Make self-training work better for semi-supervised semantic segmentation,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 4268–4277

  19. [19]

    Semi- supervised semantic segmentation needs strong, varied perturbations,

    G. French, S. Laine, T. Aila, M. Mackiewicz, and G. Finlayson, “Semi- supervised semantic segmentation needs strong, varied perturbations,” inBritish Machine Vision Conference, no. 31, 2020

  20. [20]

    Ludb: a new open-access validation tool for electrocardiogram delineation algorithms,

    A. I. Kalyakulina, I. I. Yusipov, V . A. Moskalenko, A. V . Nikolskiy, K. A. Kosonogov, G. V . Osipov, N. Y . Zolotykh, and M. V . Ivanchenko, “Ludb: a new open-access validation tool for electrocardiogram delineation algorithms,”IEEE access, vol. 8, pp. 186 181–186 190, 2020

  21. [21]

    A database for evaluation of algorithms for measurement of qt and other waveform intervals in the ecg,

    P. Laguna, R. G. Mark, A. Goldberg, and G. B. Moody, “A database for evaluation of algorithms for measurement of qt and other waveform intervals in the ecg,” inComputers in cardiology 1997. IEEE, 1997, pp. 673–676

  22. [22]

    Isp ecg delineation dataset,

    A. Avetisyan, N. Khachaturov, A. Asatryan, S. Tigranyan, and Y . Markin, “Isp ecg delineation dataset,” 2024. [Online]. Available: https://zenodo.org/doi/10.5281/zenodo.11472366

  23. [23]

    A 12-lead ecg database to identify origins of idiopathic ventricular arrhythmia containing 334 patients,

    J. Zheng, G. Fu, K. Anderson, H. Chu, and C. Rakovski, “A 12-lead ecg database to identify origins of idiopathic ventricular arrhythmia containing 334 patients,”Scientific data, vol. 7, no. 1, p. 98, 2020

  24. [24]

    Generalis- ing electrocardiogram detection and delineation: training convolutional neural networks with synthetic data augmentation,

    G. Jimenez-Perez, J. Acosta, A. Alcaine, and O. Camara, “Generalis- ing electrocardiogram detection and delineation: training convolutional neural networks with synthetic data augmentation,”Frontiers in Cardio- vascular Medicine, vol. 11, p. 1341786, 2024

  25. [25]

    Semisegecg: A multi-dataset bench- mark for semi-supervised semantic segmentation in ecg delineation,

    M. Park, J. Lim, T. Yu, and S. Joo, “Semisegecg: A multi-dataset bench- mark for semi-supervised semantic segmentation in ecg delineation,” in Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025, pp. 5099–5104

  26. [26]

    Bat: Beat- aligned transformer for electrocardiogram classification,

    X. Li, C. Li, Y . Wei, Y . Sun, J. Wei, X. Li, and B. Qian, “Bat: Beat- aligned transformer for electrocardiogram classification,” in2021 IEEE International Conference on Data Mining (ICDM). IEEE, 2021, pp. 320–329

  27. [27]

    Guiding masked representation learning to capture spatio-temporal relationship of electrocardiogram,

    Y . Na, M. Park, Y . Tae, and S. Joo, “Guiding masked representation learning to capture spatio-temporal relationship of electrocardiogram,” inInternational Conference on Learning Representations, 2024. [Online]. Available: https://openreview.net/forum?id=WcOohbsF4H

  28. [28]

    Training data-efficient image transformers & distillation through attention,

    H. Touvron, M. Cord, M. Douze, F. Massa, A. Sablayrolles, and H. J ´egou, “Training data-efficient image transformers & distillation through attention,” inInternational conference on machine learning. PMLR, 2021, pp. 10 347–10 357

  29. [29]

    Fully convolutional networks for semantic segmentation,

    J. Long, E. Shelhamer, and T. Darrell, “Fully convolutional networks for semantic segmentation,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 3431–3440

  30. [30]

    Dropout: a simple way to prevent neural networks from over- fitting,

    N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhut- dinov, “Dropout: a simple way to prevent neural networks from over- fitting,”The journal of machine learning research, vol. 15, no. 1, pp. 1929–1958, 2014

  31. [31]

    Decoupled weight decay regularization,

    I. Loshchilov and F. Hutter, “Decoupled weight decay regularization,” in International Conference on Learning Representations, 2019. [Online]. Available: https://openreview.net/forum?id=Bkg6RiCqY7

  32. [32]

    Sgdr: Stochastic gradient descent with warm restarts,

    ——, “Sgdr: Stochastic gradient descent with warm restarts,” in International Conference on Learning Representations, 2017. [Online]. Available: https://openreview.net/forum?id=Skq89Scxx

  33. [33]

    Randaugment: Practical automated data augmentation with a reduced search space,

    E. D. Cubuk, B. Zoph, J. Shlens, and Q. V . Le, “Randaugment: Practical automated data augmentation with a reduced search space,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, 2020, pp. 702–703. LIMet al.: BIDIRECTIONAL FUSION GUIDED BY CARDIAC PATTERNS FOR SEMI-SUPERVISED ECG SEGMENTATION 11 APPENDIXI DETA...