Biomarker-Based Pretraining for Chagas Disease Screening in Electrocardiograms
Pith reviewed 2026-05-10 16:45 UTC · model grok-4.3
The pith
Pretraining an ECG model to predict blood biomarkers from one dataset transfers to improved Chagas disease detection in another.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors establish that an ECG feature extractor pretrained to predict blood biomarkers from the MIMIC-IV-ECG dataset can be successfully fine-tuned on Brazilian datasets to detect Chagas disease, yielding competitive performance in the PhysioNet Challenge.
What carries the argument
biomarker-based pretraining, in which the model learns ECG representations by predicting percentile-binned blood biomarkers before fine-tuning for Chagas classification
Where Pith is reading between the lines
- The same pretraining step could support detection of other cardiac conditions where labeled ECG data is scarce.
- Applying the approach to additional populations would test whether the learned biomarker-ECG links generalize beyond the original datasets.
- Combining this supervised pretraining with other self-supervised ECG methods might produce stronger starting representations.
Load-bearing premise
The relationships between ECG signals and blood biomarkers observed in the MIMIC-IV dataset also hold sufficiently in the Brazilian Chagas patient data to allow useful feature transfer.
What would settle it
Training identical models from random initialization on the Brazilian Chagas datasets and comparing detection performance to the biomarker-pretrained versions; consistent lack of improvement from pretraining would refute the transfer benefit.
Figures
read the original abstract
Chagas disease screening via ECGs is limited by scarce and noisy labels in existing datasets. We propose a biomarker-based pretraining approach, where an ECG feature extractor is first trained to predict percentile-binned blood biomarkers from the MIMIC-IV-ECG dataset. The pretrained model is then fine-tuned on Brazilian datasets for Chagas detection. Our 5-model ensemble, developed by the Ahus AIM team, achieved a challenge score of 0.269 on the hidden test set, ranking 5th in Detection of Chagas Disease from the ECG: The George B. Moody PhysioNet Challenge 2025. Source code and the model are shared on GitHub: github.com/Ahus-AIM/physionet-challenge-2025
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a biomarker-based pretraining approach for ECG-based Chagas disease screening. An ECG feature extractor is first trained on the MIMIC-IV-ECG dataset to predict percentile-binned blood biomarkers, then fine-tuned on Brazilian Chagas datasets. A 5-model ensemble achieves a challenge score of 0.269 on the hidden test set, ranking 5th in the George B. Moody PhysioNet Challenge 2025. Source code is shared on GitHub.
Significance. If the pretraining step is shown to drive the performance, the work demonstrates a practical route to mitigate label scarcity for Chagas cardiomyopathy detection by transferring representations learned from abundant biomarker-annotated ICU ECGs. The external validation via the challenge ranking provides a concrete empirical anchor. However, the absence of controls leaves open whether the biomarker objective adds value beyond generic ECG pretraining.
major comments (3)
- [Methods] Methods: The manuscript provides no ablation comparing the biomarker-pretrained model against a from-scratch baseline or alternative pretraining objectives (e.g., contrastive or reconstruction) on the Chagas fine-tuning task. Without this isolation, the central claim that biomarker pretraining improves Chagas detection cannot be substantiated.
- [Results] Results: No statistical tests, confidence intervals, or variance estimates are reported for the ensemble score of 0.269, nor details on model selection or ensembling procedure. This weakens the reliability of the 5th-place ranking as evidence for the method.
- [Methods] Methods: Architecture details, loss functions for the biomarker regression task, binning thresholds, and training hyperparameters are not fully specified, preventing assessment of whether the pretraining objective is well-posed or reproducible.
minor comments (2)
- [Abstract] Abstract and Methods: The specific blood biomarkers used for pretraining and the exact percentile binning procedure are not enumerated, which would aid interpretability of the learned features.
- [Introduction] Introduction: Limited discussion of prior transfer-learning or multi-task ECG pretraining literature; adding 2-3 key references would better situate the biomarker approach.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback, which highlights important areas for strengthening the evidence and reproducibility of our biomarker-based pretraining approach. We address each major comment below and have made revisions to the manuscript where feasible, including additional experiments and expanded details.
read point-by-point responses
-
Referee: [Methods] Methods: The manuscript provides no ablation comparing the biomarker-pretrained model against a from-scratch baseline or alternative pretraining objectives (e.g., contrastive or reconstruction) on the Chagas fine-tuning task. Without this isolation, the central claim that biomarker pretraining improves Chagas detection cannot be substantiated.
Authors: We agree that ablations are necessary to substantiate the specific benefit of the biomarker pretraining objective over generic alternatives. The original submission prioritized a competitive entry in the time-constrained PhysioNet Challenge. In the revision, we have added results from from-scratch baselines trained on the same Chagas fine-tuning data and report performance differences. We also include a brief discussion of why contrastive or reconstruction objectives were not used, noting that the supervised biomarker signal provides direct physiological alignment not guaranteed by self-supervised methods. These additions appear in the revised Methods and Results sections. revision: yes
-
Referee: [Results] Results: No statistical tests, confidence intervals, or variance estimates are reported for the ensemble score of 0.269, nor details on model selection or ensembling procedure. This weakens the reliability of the 5th-place ranking as evidence for the method.
Authors: We acknowledge that the original manuscript lacked uncertainty quantification and procedural details. The 0.269 score is the fixed official challenge metric on the hidden test set and cannot be altered. However, we have revised the Results section to describe the ensembling procedure (prediction averaging across five models with varied random seeds and stratified data splits) and added bootstrap-derived 95% confidence intervals computed on the internal validation set. We also include a note on model selection criteria. These changes provide better context for interpreting the ranking without overstating its statistical robustness. revision: yes
-
Referee: [Methods] Methods: Architecture details, loss functions for the biomarker regression task, binning thresholds, and training hyperparameters are not fully specified, preventing assessment of whether the pretraining objective is well-posed or reproducible.
Authors: We apologize for the incomplete specification. The revised Methods section now provides full details: the ECG encoder follows a ResNet-18 architecture adapted for 12-lead signals with specified kernel sizes and channel dimensions; the pretraining loss is mean squared error applied to the binned biomarker targets; binning uses dataset-specific percentiles (25th, 50th, 75th) for each of the 10 biomarkers; and all hyperparameters (learning rate schedule, batch size, epochs, optimizer, and data augmentation) are enumerated in a new supplementary table. The publicly shared GitHub code already implements these choices, and the text now matches the code for full reproducibility. revision: yes
Circularity Check
No circularity detected in pretraining-fine-tuning workflow
full rationale
The manuscript describes a standard two-stage pipeline: pretraining an ECG feature extractor on the independent MIMIC-IV-ECG dataset to regress percentile-binned blood biomarkers, followed by fine-tuning the resulting weights on separate Brazilian Chagas-labeled ECG datasets. No equations, fitted parameters, or self-citations are presented that reduce the final Chagas detection score to a quantity defined by the target task itself. The workflow relies on cross-dataset transfer without self-referential definitions or load-bearing citations to prior author work that would close a loop.
Axiom & Free-Parameter Ledger
free parameters (1)
- percentile binning thresholds for biomarkers
axioms (1)
- domain assumption ECG signals contain information predictive of blood biomarker levels
Reference graph
Works this paper leans on
-
[1]
Goldberger AL, Amaral LAN, Glass L, Hausdorff JM, Ivanov PC, Mark RG, Mietus JE, Moody GB, Peng CK, Stanley HE. PhysioBank, PhysioToolkit, and PhysioNet: Components of a New Research Resource for Complex Physiologic Signals. Circulation June 2000;101(23). ISSN 0009-7322, 1524-4539
work page 2000
-
[2]
Detection of Chagas Disease from the ECG: The George B
Reyna MA, Koscova Z, Pavlus J, Weigle J, Saghafi S, Gomes P, Elola A, Hassannia MS, Campbell K, Bahrami Rad A, Ribeiro AH, Ribeiro AL, Sameni R, Clif- ford GD. Detection of Chagas Disease from the ECG: The George B. Moody PhysioNet Challenge 2025. In Comput- ing in Cardiology 2025, volume 52. 2025; 1–4
work page 2025
-
[3]
Detection of Chagas Disease from the ECG: The George B
Reyna MA, Koscova Z, Pavlus J, Saghafi S, Weigle J, Elola A, Seyedi S, Campbell K, Li Q, Bahrami Rad A, Ribeiro A, Ribeiro ALP, Sameni R, Clifford GD. Detection of Chagas Disease from the ECG: The George B. Moody PhysioNet Challenge 2025, 2025. ArXiv.2510.02202
-
[4]
The epidemiology of Chagas disease in the Americas
Cucunub ´a ZM, Guti ´errez-Romero SA, Ram ´ırez JD, Vel´asquez-Ortiz N, Ceccarelli S, Parra-Henao G, Henao- Mart´ınez AF, Rabinovich J, Bas ´a˜nez MG, Nouvellet P, Abad-Franch F. The epidemiology of Chagas disease in the Americas. The Lancet Regional Health Americas Septem- ber 2024;37:100881. ISSN 2667193X
work page 2024
-
[5]
Cardoso CS, Sabino EC, Oliveira CDL, De Oliveira LC, Ferreira AM, Cunha-Neto E, Bierrenbach AL, Ferreira JE, Haikal DS, Reingold AL, Ribeiro ALP. Longitudinal study of patients with chronic Chagas cardiomyopathy in Brazil (SaMi-Trop project): a cohort profile. BMJ Open May 2016;6(5):e011181. ISSN 2044-6055, 2044-6055
work page 2016
-
[6]
Ribeiro AH, Ribeiro MH, Paix ˜ao GMM, Oliveira DM, Gomes PR, Canazart JA, Ferreira MPS, Andersson CR, Macfarlane PW, Meira Jr. W, Sch¨on TB, Ribeiro ALP. Au- tomatic diagnosis of the 12-lead ECG using a deep neural network. Nature Communications April 2020;11(1):1760. ISSN 2041-1723
work page 2020
-
[7]
MIMIC-IV-ECG: Diagnostic electrocar- diogram matched subset
Gow B, Pollard T, Nathanson LA, Johnson A, Moody B, Fernandes C, Greenbaum N, Waks JW, Eslami P, Carbon- ati T, Chaudhari A, Herbst E, Moukheiber D, Berkowitz S, Mark R, Horng S. MIMIC-IV-ECG: Diagnostic electrocar- diogram matched subset
- [8]
-
[9]
Muon: An optimizer for hidden layers in neural networks| Keller Jordan blog
-
[10]
Adam: A Method for Stochastic Optimization
Kingma DP, Ba J. Adam: A method for stochastic opti- mization, January 2017. ArXiv:1412.6980
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[11]
InceptionTime: Finding AlexNet for time series classifica- tion, December 2020
Fawaz HI, Lucas B, Forestier G, Pelletier C, Schmidt DF, Weber J, Webb GI, Idoumghar L, Muller PA, Petitjean F. InceptionTime: Finding AlexNet for time series classifica- tion, December 2020. ArXiv:1909.04939
-
[12]
Learning from noisy labels with deep neural networks: a survey, March
Song H, Kim M, Park D, Shin Y , Lee JG. Learning from noisy labels with deep neural networks: a survey, March
-
[13]
ArXiv:2007.08199. Address for correspondence: Elias Stenhede Sykehusveien 25, 1478 Nordbyhagen, Norway elias.stenhede@ahus.no
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.