CogAdapt: Transferring Clinical ECG Foundation Models to Wearable Cognitive Load Assessment via Lead Adaptation
Pith reviewed 2026-05-22 06:35 UTC · model grok-4.3
The pith
A learnable adapter lets clinical ECG foundation models assess cognitive load from 3-lead wearables by converting signals to 12-lead form.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
CogAdapt transfers clinical ECG foundation models to wearable cognitive load assessment by using LeadBridge, a learnable adapter that transforms 3-lead wearable ECG signals into anatomically consistent 12-lead representations, and ProFine, a progressive fine-tuning strategy that gradually unfreezes encoder layers while preventing catastrophic forgetting. On the CLARE and CL-Drive datasets under leave-one-subject-out cross-validation, CogAdapt achieves macro-F1 scores of 0.626 and 0.768 and substantially outperforms baselines trained from scratch.
What carries the argument
LeadBridge, a learnable adapter that transforms 3-lead wearable signals into 12-lead representations, paired with ProFine progressive fine-tuning to avoid forgetting.
If this is right
- Real-time cognitive load assessment becomes possible on consumer wearables without collecting millions of new labeled recordings.
- Subject-independent performance improves because the clinical pre-training supplies robust features that survive the lead transformation.
- Progressive unfreezing during fine-tuning lets the model retain clinical knowledge while learning the new task.
- The same adapter-plus-progressive-tuning pattern could be applied to other sensor mismatches in physiological signal processing.
Where Pith is reading between the lines
- The technique might generalize to other wearable signals such as PPG or EEG for detecting related states like fatigue or attention.
- Consumer devices could incorporate the adapted model to adjust interfaces dynamically during driving or office work.
- Future tests on larger and more diverse wearable datasets would show whether the performance gap over scratch training holds outside the two evaluated collections.
Load-bearing premise
The load-bearing premise is that a learnable adapter can turn 3-lead wearable ECG signals into 12-lead versions that still contain the features needed to tell cognitive load levels apart.
What would settle it
If a model trained from scratch on the same wearable data matches or exceeds the adapted model's macro-F1 scores on the CLARE or CL-Drive datasets under the same leave-one-subject-out protocol, the benefit of the clinical foundation model transfer would be refuted.
Figures
read the original abstract
Real-time cognitive load assessment is essential for adaptive human-computer interaction but remains challenging due to limited labeled data and poor cross-subject generalization. Recent ECG foundation models pre-trained on millions of clinical recordings offer rich representations, but cannot be directly applied to wearable devices due to sensor configuration mismatch and task differences. In this paper, we propose CogAdapt, a framework that adapts clinical ECG foundation models to wearable cognitive load assessment. CogAdapt introduces LeadBridge, a learnable adapter that transforms 3-lead wearable signals into anatomically consistent 12-lead representations, and ProFine, a progressive fine-tuning strategy that gradually unfreezes encoder layers while preventing catastrophic forgetting. Evaluations on two public datasets (CLARE and CL-Drive) under leave-one-subject-out cross-validation show that CogAdapt substantially outperforms baselines trained from scratch, achieving macro-F1 scores of 0.626 and 0.768. These results demonstrate the promise of foundation model adaptation for subject-independent cognitive load assessment from wearable sensors.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes CogAdapt, a framework for adapting clinical ECG foundation models to wearable cognitive load assessment. It introduces LeadBridge, a learnable adapter that maps 3-lead wearable ECG signals to anatomically consistent 12-lead representations, and ProFine, a progressive fine-tuning strategy that gradually unfreezes encoder layers. On the public CLARE and CL-Drive datasets, CogAdapt achieves macro-F1 scores of 0.626 and 0.768 respectively under leave-one-subject-out cross-validation, substantially outperforming baselines trained from scratch.
Significance. If the central claim holds, the work would demonstrate a practical route for transferring large-scale clinical ECG foundation models to wearable sensor configurations despite lead mismatch and task shift. The use of public datasets with leave-one-subject-out validation provides a reproducible empirical foundation for subject-independent cognitive load assessment, which could benefit adaptive human-computer interaction systems. However, the absence of mechanistic validation for the adapter limits the current significance.
major comments (2)
- The manuscript reports clear performance gains (macro-F1 0.626 on CLARE, 0.768 on CL-Drive under LOSO) but provides no description of the LeadBridge adapter architecture, the loss functions employed during adaptation, hyperparameter choices, or any statistical significance testing of the improvements over baselines. These omissions are load-bearing because they prevent independent verification of whether the reported gains are robust or reproducible.
- No waveform-level fidelity metrics, feature-space alignment measures between original and adapted leads, or ablation experiments that isolate LeadBridge from the ProFine progressive unfreezing schedule are reported. This directly affects the central claim that the adapter produces representations preserving cognitive-load-relevant features (subtle HRV and morphological patterns), as the gains could arise from fine-tuning alone rather than successful lead adaptation.
minor comments (1)
- The abstract and results sections would benefit from explicit statements on the number of subjects, class distribution, and exact baseline implementations to strengthen reproducibility.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive comments on our manuscript. We have carefully addressed each major comment below, providing clarifications and indicating the specific revisions made to improve the paper's reproducibility and validation of the central claims.
read point-by-point responses
-
Referee: The manuscript reports clear performance gains (macro-F1 0.626 on CLARE, 0.768 on CL-Drive under LOSO) but provides no description of the LeadBridge adapter architecture, the loss functions employed during adaptation, hyperparameter choices, or any statistical significance testing of the improvements over baselines. These omissions are load-bearing because they prevent independent verification of whether the reported gains are robust or reproducible.
Authors: We agree that these implementation details are essential for reproducibility and were insufficiently elaborated in the original submission. In the revised manuscript, we have expanded Section 3.2 to provide a complete architectural specification of LeadBridge, including its convolutional and attention-based layers, input/output dimensions, and parameter counts. We have added the exact loss functions (cross-entropy for classification combined with an L2 reconstruction term for lead adaptation), a comprehensive hyperparameter table (learning rates, batch sizes, unfreezing schedules), and statistical significance testing via paired t-tests with p-values reported for all baseline comparisons in the results tables. revision: yes
-
Referee: No waveform-level fidelity metrics, feature-space alignment measures between original and adapted leads, or ablation experiments that isolate LeadBridge from the ProFine progressive unfreezing schedule are reported. This directly affects the central claim that the adapter produces representations preserving cognitive-load-relevant features (subtle HRV and morphological patterns), as the gains could arise from fine-tuning alone rather than successful lead adaptation.
Authors: We acknowledge that the original manuscript lacked these targeted analyses, which would more directly support the role of LeadBridge in feature preservation. To address this, the revised version includes new experiments: waveform fidelity is now quantified via MSE and Pearson correlation on reconstructed leads; feature-space alignment uses cosine similarity and CCA between embeddings from the adapted and original clinical leads. We have also added ablation studies (reported in a new table) comparing full CogAdapt against variants without LeadBridge and with fixed vs. progressive unfreezing, demonstrating that the adapter contributes gains beyond ProFine alone while preserving HRV-related patterns. revision: yes
Circularity Check
No circularity; empirical results on external datasets
full rationale
The paper introduces CogAdapt with a learnable LeadBridge adapter and ProFine progressive fine-tuning to transfer clinical ECG foundation models to wearable cognitive load tasks. All reported performance numbers (macro-F1 0.626 on CLARE, 0.768 on CL-Drive under LOSO) are measured outcomes from direct evaluation on two independent public datasets against scratch-trained baselines. No equation, adapter definition, or fine-tuning schedule reduces by construction to its own fitted inputs or to a self-citation chain; the derivation chain consists of architectural choices whose validity is tested externally rather than assumed.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Clinical ECG foundation models contain transferable representations for cognitive load when input format mismatch is corrected.
invented entities (2)
-
LeadBridge
no independent evidence
-
ProFine
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Miller, Saba Emrani, Udhyaku- mar Nallasamy, and Ian Shapiro
[Abbaspourazadet al., 2024 ] Salar Abbaspourazad, Ous- sama Elachqar, Andrew C. Miller, Saba Emrani, Udhyaku- mar Nallasamy, and Ian Shapiro. Large-scale training of foundation models for wearable biosignals. InICLR,
work page 2024
-
[2]
[Angkanet al., 2024 ] Prithila Angkan, Behnam Behinaein, Zunayed Mahmud, Anubhav Bhatti, Dirk Rodenburg, Paul Hungler, and Ali Etemad. Multimodal brain–computer in- terface for in-vehicle driver cognitive load measurement: Dataset and baselines.IEEE Transactions on Intelligent Transportation Systems, 25(6):5949–5964,
work page 2024
-
[3]
[Ayreset al., 2021 ] Paul Ayres, Joy Yeonjoo Lee, Fred Paas, and Jeroen J. G. Van Merrienboer. The validity of physio- logical measures to identify differences in intrinsic cogni- tive load.Frontiers in Psychology, 12:702538,
work page 2021
-
[4]
[Bhattiet al., 2024 ] Anubhav Bhatti, Prithila Angkan, Behnam Behinaein, Zunayed Mahmud, D. Rodenburg, Heather Braund, P. Mclellan, A. Ruberto, Geoffery Harri- son, Daryl Wilson, A. Szulewski, Dan Howes, A. Etemad, and Paul Hungler. Clare: Cognitive load assessment in realtime with multimodal data.IEEE Transactions on Cognitive and Developmental Systems,
work page 2024
-
[5]
A simple frame- work for contrastive learning of visual representations
[Chenet al., 2020 ] Ting Chen, Simon Kornblith, Moham- mad Norouzi, and Geoffrey Hinton. A simple frame- work for contrastive learning of visual representations. In Hal Daum´e III and Aarti Singh, editors,Proceedings of the 37th International Conference on Machine Learning, vol- ume 119 ofProceedings of Machine Learning Research, pages 1597–1607. PMLR, 13–18 Jul
work page 2020
-
[6]
On deriving the electrocardio- gram from vectorcardiographic leads.Clinical cardiology, 3(2):87–95,
[Doweret al., 1980 ] Gordon E Dower, H Bastos Machado, and John A Osborne. On deriving the electrocardio- gram from vectorcardiographic leads.Clinical cardiology, 3(2):87–95,
work page 1980
-
[7]
[Feildet al., 2008 ] Dirk Q. Feild, Sophia H. Zhou, Eric D. Helfenbein, Richard E. Gregg, and James M. Lindauer. Technical challenges and future directions in lead recon- struction for reduced-lead systems.Journal of Electrocar- diology, 41(6):466–473,
work page 2008
-
[8]
[Gjoreskiet al., 2020 ] Martin Gjoreski, Tine Kolenik, Timo- tej Knez, Mitja Luˇstrek, Matjaˇz Gams, Hristijan Gjoreski, and Veljko Pejovi´c. Datasets for cognitive load inference using wearable sensors and psychological traits.Applied Sciences, 10(11):3843,
work page 2020
-
[9]
[Gkintoniet al., 2025 ] Evgenia Gkintoni, Hera Antonopoulou, Andrew Sortwell, and Constantinos Halkiopoulos. Challenging cognitive load theory: The role of educational neuroscience and artificial intelligence in redefining learning efficacy.Brain Sciences, 15:203, 02
work page 2025
-
[10]
Foundation models for biosignals: A survey, August
[Guet al., 2025 ] Xiao Gu, Yuxuan Shu, Jinpei Han, Yuxuan Liu, Zhangdaihong Liu, James Anibal, Veer Sangha, Ed- ward Phillips, Bradley Segal, Yuxuan Liu, Hang Yuan, Fenglin Liu, Kim Branson, Patrick Schwab, Danielle Bel- grave, Lei Clifton, Dimitris Spathis, Vasileios Lampos, A Aldo Faisal, and David A Clifton. Foundation models for biosignals: A survey, August
work page 2025
-
[11]
[Hanet al., 2024 ] Yu Han, Xiaofeng Liu, Xiang Zhang, and Cheng Ding
TechRxiv Preprint. [Hanet al., 2024 ] Yu Han, Xiaofeng Liu, Xiang Zhang, and Cheng Ding. Foundation models in electrocardiogram: A review.arXiv preprint arXiv:2410.19877,
-
[12]
[Hanet al., 2025 ] Yu Han, Vittorio Murino, Xiaofeng Liu, Xiang Zhang, and Cheng Ding. A systematic review on foundation models for electrocardiogram analysis: Initial strides and expansive horizons,
work page 2025
-
[13]
[Hassannia and Sameni, 2025] Mohammadsina Hassannia and Reza Sameni. A neural network architecture for ecg lead reconstruction: Separating shared and lead-specific ecg characteristics.Computing in Cardiology Conference (CinC),
work page 2025
-
[14]
Delving deep into rectifiers: Surpass- ing human-level performance on imagenet classification
[Heet al., 2015 ] Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Delving deep into rectifiers: Surpass- ing human-level performance on imagenet classification. InProceedings of the IEEE international conference on computer vision, pages 1026–1034,
work page 2015
-
[15]
Universal language model fine-tuning for text clas- sification
[Howard and Ruder, 2018] Jeremy Howard and Sebastian Ruder. Universal language model fine-tuning for text clas- sification. InProceedings of the 56th Annual Meeting of the Association for Computational Linguistics, pages 328– 339,
work page 2018
-
[16]
[Hugheset al., 2019 ] Ashley M. Hughes, Gabriella M. Han- cock, Shannon L. Marlow, Kimberly Stowers, and Ed- uardo Salas. Cardiac measures of cognitive workload: a meta-analysis.Human factors, 61(3):393–414,
work page 2019
-
[17]
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
[Ioffe, 2015] Sergey Ioffe. Batch normalization: Accelerat- ing deep network training by reducing internal covariate shift.arXiv preprint arXiv:1502.03167,
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[18]
A database for cognitive workload classification using electrocardio- gram and respiration signal,
[Kalatzis and others, 2021] Ilias Kalatzis et al. A database for cognitive workload classification using electrocardio- gram and respiration signal,
work page 2021
-
[19]
Dataset description; please fill in the correct journal/venue, volume, pages, and DOI from the official publication. [Kimet al., 2022 ] S. Kim, S. Chon, J. K. Kim, J. Kim, Y . Gil, and S. Jung. Lightweight convolutional neural net- work for real-time arrhythmia classification on low-power wearable electrocardiograph. InProceedings of the An- nual Internat...
work page 2022
-
[20]
Clocs: Contrastive learning of car- diac signals across space, time, and patients
[Kiyassehet al., 2021 ] Dani Kiyasseh, Tingting Zhu, and David A Clifton. Clocs: Contrastive learning of car- diac signals across space, time, and patients. InInter- national Conference on Machine Learning, pages 5606–
work page 2021
-
[21]
[Kligfieldet al., 2007 ] Paul Kligfield, Leonard S Gettes, James J Bailey, Rory Childers, Barbara J Deal, E William Hancock, Gerard Van Herpen, Jan A Kors, Peter Mac- farlane, David M Mirvis, et al. Recommendations for the standardization and interpretation of the electrocardio- gram: part i: the electrocardiogram and its technology a scientific statement...
work page 2007
- [22]
-
[23]
[Kundu, 2022] Subhasis Kundu. Hci-driven emotion- adaptive uis for cognitive efficiency: Real-time adaptation to fatigue, focus, and emotional expression.International Scientific Journal of Engineering and Management, 01:1– 7, 09
work page 2022
-
[24]
David Hairston, and Kay Robbins
[Lawhernet al., 2013 ] Vernon Lawhern, W. David Hairston, and Kay Robbins. Detect: A matlab toolbox for event de- tection and identification in time series, with applications to artifact detection in eeg signals.PLOS ONE, 8(4):1–13, 04
work page 2013
-
[25]
CLAS: A database for cognitive load, affect, and stress,
[Markova and others, 2019] Veselina Markova et al. CLAS: A database for cognitive load, affect, and stress,
work page 2019
-
[26]
[McKeenet al., 2025 ] Kaden McKeen, Sameer Masood, Augustin Toma, Barry Rubin, and Bo Wang
Dataset description; please insert the correct venue/journal information from the official publication. [McKeenet al., 2025 ] Kaden McKeen, Sameer Masood, Augustin Toma, Barry Rubin, and Bo Wang. Ecg-fm: An open electrocardiogram foundation model.JAMIA open, 8(5):ooaf122,
work page 2025
-
[27]
A modern wilson’s central terminal electrocardiography database
[Moeinzadehet al., 2018 ] Hossein Moeinzadeh, Gaetano Gargiulo, Paolo Bifulco, Mario Cesarelli, A O’Loughlin, MI Shugman, and Aravinda Thiagalingam. A modern wilson’s central terminal electrocardiography database. Heart, Lung and Circulation, 27:S293–S294,
work page 2018
-
[28]
[Mvuhet al., 2024 ] Franck Lino Mvuh, Claude Odile Vanessa Ebode Ko’a, and Bertrand Bodo. Mul- tichannel high noise level ecg denoising based on adversarial deep learning.Scientific Reports, 14(1):801,
work page 2024
-
[29]
[Naismith and Cavalcanti, 2015] Laura M. Naismith and Ro- drigo B. Cavalcanti. Validity of cognitive load measures in simulation-based training: a systematic review.Academic Medicine, 90(11):S24–S35, Nov
work page 2015
-
[30]
Reconstruction of 12-lead ecg: a review of algorithms.Frontiers in Physiology, 16:1532284,
[Obianomet al., 2025 ] Ekenedirichukwu N Obianom, G Andr ´e Ng, and Xin Li. Reconstruction of 12-lead ecg: a review of algorithms.Frontiers in Physiology, 16:1532284,
work page 2025
-
[31]
[Potharajuet al., 2025 ] Saiprasad Potharaju, K. Veera Raghavulu, N. S. Pradeep Kumar, Suresh Salendra, Swapnali N. Tambe, and M. V . V . Prasad Kantipudi. Multi-resolution hybrid sliding window approach for ecg arrhythmia detection using deep learning models. Discover Applied Sciences, 7(12):1412,
work page 2025
-
[32]
[Rahmanet al., 2022 ] Saifur Rahman, Chandan Karmakar, Iynkaran Natgunanathan, John Yearwood, and Marimuthu Palaniswami. Robustness of electrocardiogram signal quality indices.Journal of The Royal Society Interface, 19(189):20220012, 04
work page 2022
-
[33]
[Razaet al., 2024 ] Muhammad Shahzad Raza, Muhammad Murtaza, C. T. Cheng, and Muhammad Asif Khan. Sys- tematic review of cognitive impairment in drivers through mental workload using physiological measures of heart rate variability.Frontiers in Computational Neuroscience, 18:1475530,
work page 2024
- [34]
-
[35]
[Shanget al., 2021 ] Zuogang Shang, Zhibin Zhao, Hui Fang, Samuel Relton, Darcy Murphy, Zoe Hancox, Ruqiang Yan, and David Wong. Deep discriminative do- main generalization with adversarial feature learning for classifying ecg signals. In2021 Computing in Cardiology (CinC), volume 48, pages 1–4, Sep
work page 2021
-
[36]
[Solhjooet al., 2019 ] S. Solhjoo, M. C. Haigney, E. McBee, and S. J. Durning. Heart rate and heart rate variabil- ity correlate with clinical reasoning performance and self- reported measures of cognitive load.Scientific Reports, 9(1):1–9,
work page 2019
-
[37]
[Suryaniet al., 2024 ] Mira Suryani, Harry Budi Santoso, Martin Schrepp, Rizal Fathoni Aji, Setiawan Hadi, Dana Indra Sensuse, Ryan Randy Suryono, et al. Role, methodology, and measurement of cognitive load in com- puter science and information systems research.IEEE Ac- cess,
work page 2024
-
[38]
[V ozda and Cerny, 2015] M. V ozda and M. Cerny. Methods for derivation of orthogonal leads from 12-lead electrocar- diogram: A review.Biomedical Signal Processing and Control, 19:23–34,
work page 2015
-
[39]
Ptb-xl, a large publicly available electrocardiography dataset.Scientific Data, 7(1):1–15,
[Wagneret al., 2020 ] Patrick Wagner, Nils Strodthoff, Ralf- Dieter Bousseljot, Dieter Kreiseler, Fatima I Lunze, Wo- jciech Samek, and Tobias Schaeffter. Ptb-xl, a large publicly available electrocardiography dataset.Scientific Data, 7(1):1–15,
work page 2020
-
[40]
Simper: Simple self-supervised learning of periodic tar- gets.arXiv preprint arXiv:2210.03115,
[Yanget al., 2022 ] Yuzhe Yang, Xin Liu, Jiang Wu, Silviu Borac, Dina Katabi, Ming-Zher Poh, and Daniel McDuff. Simper: Simple self-supervised learning of periodic tar- gets.arXiv preprint arXiv:2210.03115,
-
[41]
Appendix A Baselines We compare our approach against two baseline methods for cognitive load classification from ECG signals: ECG-LightCNNrepresents the family of lightweight con- volutional architectures for end-to-end ECG analysis [Bhatti et al., 2024]. This baseline uses a shallow CNN inspired by VGG networks with multiple convolutional and max-pooling...
-
[42]
Auto-αdenotes inverse frequency class weights
Hyperparameter K-Fold LOSO Scenario A (Frozen Encoder) Training Epochs 30 30 Batch Size 64 64 LR Head1e−3 1e−3 LR Adapter1e−4 1e−4 Loss Function Focal (γ=2) Focal (γ=2) Class Weighting Auto-αAuto-α Data Augmentation Enabled Enabled Scenario B (Partial Unfreeze) Training Epochs 20 20 Batch Size 64 64 LR Head5e−4 5e−4 LR Adapter1e−4 1e−4 LR Encoder (Top 2)1...
work page 2048
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.