pith. sign in

arxiv: 2511.01747 · v4 · pith:X3I2DME5new · submitted 2025-11-03 · 📡 eess.SP

AnyPPG: An ECG-Guided PPG Foundation Model Trained on Over 100,000 Hours of Recordings for Holistic Health Profiling

classification 📡 eess.SP
keywords anyppghealthcardiovasculardiseaseholistictasksacrossanalysis
0
0 comments X
read the original abstract

Photoplethysmography (PPG) is widely used as a non-invasive and accessible modality for continuous health monitoring. However, despite being a peripheral hemodynamic signal intrinsically coupled with systemic circulation, existing research has largely confined its scope to a narrow range of cardiovascular tasks, leaving a fundamental question underexplored: to what extent can PPG support holistic health profiling beyond traditional cardiovascular applications? To answer this question, we present AnyPPG, a foundation model-based framework designed to reveal the broader health-profiling potential of PPG. To ensure reliable performance for this investigation, AnyPPG is pretrained with ECG guidance on the most diverse PPG corpus with synchronized ECG to date, comprising over 100,000 hours of recordings from six large-scale data sources. This pretraining yields robust and physiologically grounded PPG representations that provide a reliable basis for subsequent analysis. Building upon this pretrained model, we conduct a systematic investigation into the association between PPG and holistic health through, to our knowledge, the first PPG-based phenome-wide disease detection study, spanning 1,468 disease phenotypes in more than 15,000 subjects. Our evaluation demonstrates the effectiveness of AnyPPG: across eight clinical and wearable datasets covering 15 downstream tasks, it achieves the best performance in 13 tasks. More importantly, in the phenome-wide analysis, AnyPPG exhibits meaningful discriminative capability (AUC $\ge$ 0.70) for 307 phenotypes across 16 distinct phecode chapters, including 230 non-circulatory conditions such as dementia and chronic kidney disease, many of which have rarely been explored using PPG. Collectively, these findings indicate that easily acquired PPG signals encode rich health-related information extending well beyond conventional cardiovascular assessment.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. BCG-FM: A Foundation Model for Ambient Cardiac Health Sensing

    cs.LG 2026-06 unverdicted novelty 7.0

    BCG-FM, the first foundation model for ambient BCG, achieves 3.26-year MAE on biological age estimation and discriminates 15 health conditions using frozen embeddings from participant-level contrastive pretraining on ...

  2. Physiology-Aware Masked Cross-Modal Reconstruction for Biosignal Representation Learning

    cs.LG 2026-05 unverdicted novelty 7.0

    xMAE pretrains biosignal representations via masked cross-modal reconstruction of temporally ordered signals like ECG and PPG, outperforming baselines on 15 of 19 downstream tasks including cardiovascular prediction a...

  3. WavesFM: Hierarchical Representation Learning for Longitudinal Wearable Sensor Waveforms

    cs.LG 2026-05 unverdicted novelty 6.0

    WavesFM uses hierarchical SSL to pretrain a segment encoder on short waveforms followed by a temporal encoder on multi-day sequences, outperforming prior methods on 58 tasks after training on over 12 million hours of ...

  4. PRISM-CTG: A Foundation Model for Cardiotocography Analysis with Multi-View SSL

    cs.LG 2026-04 unverdicted novelty 6.0

    PRISM-CTG is the first large-scale foundation model for cardiotocography that uses multi-view self-supervised learning on unlabeled data to learn transferable representations, outperforming baselines on seven downstre...