pith. sign in

arxiv: 2606.11556 · v1 · pith:ARTE3ZS3new · submitted 2026-06-10 · 💻 cs.CR · cs.AI· cs.LG

Privacy-Preserving Federated Autoencoder for ECG Anomaly Detection on Edge Devices

Pith reviewed 2026-06-27 09:41 UTC · model grok-4.3

classification 💻 cs.CR cs.AIcs.LG
keywords federated learningdifferential privacyautoencoderECG anomaly detectionedge computingquantizationPTB-XL
0
0 comments X

The pith

Federated learning with differential privacy and quantization matches centralized ECG anomaly detection performance on edge devices.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes an end-to-end system for unsupervised ECG anomaly detection that meets privacy, edge deployment, and detection quality requirements simultaneously. It combines federated averaging across ten simulated hospitals, client-side differential privacy via DP-SGD, and post-training 8-bit quantization. The central empirical result is that federated models match or exceed centralized baselines across autoencoder architectures, and that the performance penalties from differential privacy and quantization are independent of each other. This matters for satisfying GDPR and HIPAA constraints while enabling real-time inference on hardware such as the Raspberry Pi 4 under non-IID data conditions.

Core claim

Federated learning matches or exceeds the centralized baseline across all architectures, with ConvAE federated AUROC reaching 0.782. An epsilon sweep identifies epsilon equals 4 as the recommended clinical operating point. INT8 quantization roughly halves model size and cuts Raspberry Pi 4 latency by up to 44 percent with less than 0.12 percent AUROC loss. DP and quantization penalties are empirically independent.

What carries the argument

The composition of Flower-based FedAvg across ten clients, client-side DP-SGD with a Renyi-DP accountant, and INT8 post-training quantization applied to VanillaAE, ConvAE, and VAE architectures for reconstruction-error anomaly detection on PTB-XL.

If this is right

  • Strong privacy guarantees at epsilon equals 4 can be used without forcing a further reduction in model size or accuracy.
  • Quantization can be applied after privacy training with additive rather than multiplicative cost.
  • The same three-component pipeline can be benchmarked on other 12-lead ECG datasets to check consistency of the independence result.
  • Edge deployment on AArch64 devices becomes feasible for continuous monitoring while meeting legal privacy standards.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The observed independence between DP noise and quantization error may generalize to other time-series sensor tasks such as EEG or blood-pressure monitoring.
  • Production use would still require separate defenses against federated-specific attacks such as model poisoning that the current evaluation does not address.
  • Replacing the simulated hospital partitions with actual institutional data splits would provide a stronger test of the non-IID claim.

Load-bearing premise

The ten simulated non-IID partitions of PTB-XL adequately proxy real cross-hospital data distributions and that reconstruction error serves as a reliable proxy for clinically meaningful ECG anomalies.

What would settle it

Running the same pipeline on ECG recordings collected from multiple distinct real hospitals, with anomalies labeled by cardiologists rather than derived from reconstruction error on simulated partitions.

Figures

Figures reproduced from arXiv: 2606.11556 by Aydin Abadi, Ghadah Albalawi, Ghouse Ibrahim Kaleelullah, Hilal Tutus, Jakub Kacper Szel\k{a}g, Kaan Arda Akyol, Maha Alghamdi, Mujeeb Ahmed, Rehmat Ullah, Sarah Al Subaiei, Shardul Kapse, Syed Mohammed Raheeb.

Figure 1
Figure 1. Figure 1: Proposed pipeline. PTB-XL ECGs are filtered, normalized, and split across 10 non-IID federated clients. Local autoencoders, with [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Detection quality versus model footprint for the two convolu [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Privacy and utility trade-off. The elbow between [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Per-class AUROC across diagnostic categories under varying [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
read the original abstract

Continuous electrocardiography (ECG) monitoring could surface rhythm abnormalities before they escalate into cardiovascular events. However, a deployable system must satisfy three requirements simultaneously: legal-grade privacy (GDPR, HIPAA), real-time inference on constrained edge hardware, and detection quality under non-IID cross-hospital data. We design and evaluate an end-to-end federated system addressing all three for unsupervised 12-lead ECG anomaly detection on PTB-XL dataset, combining three autoencoder families (VanillaAE, ConvAE, VAE), Flower-based federated averaging (FedAvg) across ten simulated hospitals, client-side differentially private SGD (DP-SGD) with a R\'enyi-DP accountant, and 8-bit integer (INT8) post-training quantization with Raspberry Pi 4 benchmarking. Our main contributions are: an empirical characterization of how these mechanisms compose, practical DP-specific recommendations, and technical and security insights for a clinically sensitive setting. Federated learning matches or exceeds the centralized baseline across all architectures (ConvAE federated area under the ROC curve, AUROC, $0.782$), and an $\varepsilon$ sweep identifies $\varepsilon=4$ as the recommended clinical operating point. INT8 quantization roughly halves model size and cuts Pi 4 latency by up to $44%$ with $<0.12%$ AUROC loss. Crucially, DP and quantization penalties are empirically independent, so practitioners need not trade a strong privacy guarantee for a compact edge footprint. To our knowledge, this is the first system combining federated learning, formal $(\varepsilon,\delta)$-DP, unsupervised reconstruction-based detection, and quantized AArch64 deployment.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The manuscript describes an end-to-end federated system for unsupervised 12-lead ECG anomaly detection using autoencoders (VanillaAE, ConvAE, VAE) on the PTB-XL dataset. It integrates Flower-based FedAvg across ten simulated hospitals, client-side DP-SGD with a Rényi-DP accountant, and 8-bit post-training quantization, with Raspberry Pi 4 benchmarking. Central empirical claims are that federated ConvAE achieves AUROC 0.782 matching or exceeding the centralized baseline, DP and quantization penalties are independent, quantization halves model size with up to 44% latency reduction and <0.12% AUROC loss, and ε=4 is the recommended clinical operating point.

Significance. If the reported measurements hold under the described experimental conditions, the work supplies concrete, reproducible metrics showing that federated averaging, formal (ε,δ)-DP, and INT8 quantization can be composed for edge deployment in a medical setting without forcing a privacy-efficiency trade-off. The empirical independence finding and the explicit ε sweep provide actionable guidance beyond abstract privacy claims.

major comments (2)
  1. [Experimental Setup] The ten simulated non-IID partitions of PTB-XL (described in the experimental setup) are load-bearing for the non-IID performance claim, yet the manuscript supplies insufficient detail on the exact partitioning procedure, per-client label or feature distributions, and any validation that these partitions approximate real cross-hospital heterogeneity; without this, the assertion that federated learning 'matches or exceeds' centralized performance cannot be fully assessed for robustness.
  2. [Results] Results tables reporting AUROC values (e.g., federated ConvAE at 0.782) and latency reductions omit standard deviations across random seeds, confidence intervals, or statistical significance tests comparing federated versus centralized runs; this weakens the quantitative claim that penalties from DP and quantization are independent and that performance 'matches or exceeds' the baseline.
minor comments (3)
  1. [Abstract] The abstract states that 'an ε sweep identifies ε=4 as the recommended clinical operating point' without stating the precise selection criterion (e.g., maximum allowable AUROC drop or a specific (ε,δ) target); this should be clarified in the main text.
  2. Hyperparameters (local epochs, learning rate, batch size, DP noise multiplier, clipping norm) are referenced but not collected in a single reproducibility table; adding one would improve clarity.
  3. Figure captions for the Raspberry Pi 4 latency and model-size plots should explicitly note the number of inference runs averaged and any temperature or power constraints used during benchmarking.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback and the recommendation for minor revision. We address each major comment below and indicate the revisions that will be incorporated.

read point-by-point responses
  1. Referee: [Experimental Setup] The ten simulated non-IID partitions of PTB-XL (described in the experimental setup) are load-bearing for the non-IID performance claim, yet the manuscript supplies insufficient detail on the exact partitioning procedure, per-client label or feature distributions, and any validation that these partitions approximate real cross-hospital heterogeneity; without this, the assertion that federated learning 'matches or exceeds' centralized performance cannot be fully assessed for robustness.

    Authors: We agree that additional detail on the partitioning procedure is warranted to support reproducibility and robustness assessment. In the revised manuscript we will expand the experimental setup section with: (i) the exact algorithm used to create the ten non-IID partitions (including any patient-ID or demographic stratification rules), (ii) summary tables or figures of per-client feature and label distributions, and (iii) a short discussion of how the simulated heterogeneity relates to documented cross-hospital ECG variability in the literature. These additions will allow readers to evaluate the non-IID performance claims more rigorously. revision: yes

  2. Referee: [Results] Results tables reporting AUROC values (e.g., federated ConvAE at 0.782) and latency reductions omit standard deviations across random seeds, confidence intervals, or statistical significance tests comparing federated versus centralized runs; this weakens the quantitative claim that penalties from DP and quantization are independent and that performance 'matches or exceeds' the baseline.

    Authors: We acknowledge that reporting variability measures would strengthen the quantitative claims. In the revision we will (i) rerun the primary configurations with at least three random seeds, (ii) add standard deviations and 95% confidence intervals to the AUROC and latency tables, and (iii) include paired statistical tests (e.g., Wilcoxon signed-rank) between federated and centralized results. These changes will provide clearer support for the reported performance matching and the empirical independence of the DP and quantization penalties. revision: yes

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

This is an empirical evaluation paper reporting measured AUROC, latency, and model-size results from federated training, DP-SGD, and INT8 quantization on PTB-XL partitions. All headline claims (FL matching centralized baseline, empirical independence of DP and quantization penalties) are direct experimental outcomes on held-out data; no equation, ansatz, or uniqueness theorem is invoked that reduces the reported quantities to the inputs by construction. No self-citation chain or fitted-parameter renaming appears in the derivation of the central results.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The evaluation rests on the domain assumption that PTB-XL partitions simulate realistic hospital non-IID conditions and on the choice of epsilon=4 as the operating point; no new entities are postulated.

free parameters (2)
  • epsilon = 4
    Selected from sweep as recommended clinical operating point
  • number_of_clients = 10
    Fixed simulation size for federated averaging
axioms (1)
  • domain assumption PTB-XL dataset can be partitioned to represent non-IID cross-hospital distributions
    Invoked to justify the federated evaluation setup

pith-pipeline@v0.9.1-grok · 5899 in / 1275 out tokens · 23716 ms · 2026-06-27T09:41:09.605914+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

45 extracted references · 8 canonical work pages · 2 internal anchors

  1. [1]

    Deep Learning with Differential Privacy

    M. Abadi et al. “Deep Learning with Differential Privacy”. In:ACM CCS. 2016, pp. 308–318

  2. [2]

    Federated Learning and Differential Privacy Tech- niques on Multi-hospital Population-scale Electrocardiogram Data

    V . Agrawal et al. “Federated Learning and Differential Privacy Tech- niques on Multi-hospital Population-scale Electrocardiogram Data”. In: ICMHI. 2024, pp. 143–152

  3. [3]

    ECG De-Anonymization: Real-World Risks and a Privacy-by-Design Mitigation Strategy

    H. Aguelal and P. Palmieri. “ECG De-Anonymization: Real-World Risks and a Privacy-by-Design Mitigation Strategy”. In:CBMS. 2025, pp. 449–456

  4. [4]

    Unsupervised Transformer-Based Anomaly Detection in ECG Signals

    A. Alamr and A. Artoli. “Unsupervised Transformer-Based Anomaly Detection in ECG Signals”. In:Algorithms16 (2023), p. 152

  5. [5]

    https : / / anonymous

    Anonymous.ECG-Federated-AE: Anonymous Source Code Reposi- tory. https : / / anonymous . 4open . science / r / ecg - federated - ae - DB18. 2026

  6. [6]

    Cryptology ePrint Archive, Paper 2020/704

    Bell et al.Secure Single-Server Aggregation with (Poly)Logarithmic Overhead. Cryptology ePrint Archive, Paper 2020/704. 2020

  7. [7]

    Federated Disentangled Representation Learning for Unsupervised Brain Anomaly Detection

    C. I. Bercea et al. “Federated Disentangled Representation Learning for Unsupervised Brain Anomaly Detection”. In:Nat. Mach. Intell.4 (2022), pp. 685–695

  8. [8]

    Fusion of Personalized Federated Learning with Differential Privacy for Diagnosis of Arrhythmia Disease

    S. M. Bokhari et al. “Fusion of Personalized Federated Learning with Differential Privacy for Diagnosis of Arrhythmia Disease”. In:PLOS ONE20 (2025), e0327108

  9. [9]

    Federated Learning Strategies for Atrial Fibrillation Detection

    W. Chorney and S. H. Ling. “Federated Learning Strategies for Atrial Fibrillation Detection”. In:J. Exp. Theor. Anal.3 (2025), p. 23

  10. [10]

    Dwork and A

    C. Dwork and A. Roth.The Algorithmic Foundations of Differential Privacy. V ol. 9. Found. Trends Theor. Comput. Sci., 2014

  11. [11]

    Our Data, Ourselves: Privacy Via Distributed Noise Generation

    C. Dwork et al. “Our Data, Ourselves: Privacy Via Distributed Noise Generation”. In:EUROCRYPT. Springer, 2006, pp. 486–503

  12. [12]

    Elmir et al.Federated Learning with Gramian Angular Fields for Privacy-Preserving ECG Classification on Heterogeneous IoT Devices

    Y . Elmir et al.Federated Learning with Gramian Angular Fields for Privacy-Preserving ECG Classification on Heterogeneous IoT Devices. arXiv:2511.03753. 2025

  13. [13]

    Fu et al.Differentially Private Federated Learning: A Systematic Review

    J. Fu et al.Differentially Private Federated Learning: A Systematic Review. arXiv:2405.08299. 2025

  14. [14]

    Garreta Basora and M

    M. Garreta Basora and M. O. Mulayim.An Attention-Augmented VAE- BiLSTM Framework for Anomaly Detection in 12-Lead ECG Signals. arXiv:2510.05919. 2025

  15. [15]

    Ghazarian et al.Privacy-Preserving ECG Data Analysis with Differential Privacy: A Literature Review and A Case Study

    A. Ghazarian et al.Privacy-Preserving ECG Data Analysis with Differential Privacy: A Literature Review and A Case Study. arXiv:2406.13880. 2024

  16. [16]

    PhysioBank, PhysioToolkit, and PhysioNet

    A. L. Goldberger et al. “PhysioBank, PhysioToolkit, and PhysioNet”. In:Circulation101 (2000), e215–e220

  17. [17]

    Application of Federated Learning Tech- niques for Arrhythmia Classification Using 12-Lead ECG Signals

    D. M. J. Gutierrez et al. “Application of Federated Learning Tech- niques for Arrhythmia Classification Using 12-Lead ECG Signals”. In:ALGOCLOUD. Springer, 2023, pp. 38–65

  18. [18]

    Reliable ECG Anomaly Detection on Edge Devices for Internet of Medical Things Applications

    M. Hizem et al. “Reliable ECG Anomaly Detection on Edge Devices for Internet of Medical Things Applications”. In:Sensors25 (2025), p. 2496

  19. [19]

    Federated Learning-Based Privacy-Preserving ECG Signal Classification for IoMT

    F. Jiang et al. “Federated Learning-Based Privacy-Preserving ECG Signal Classification for IoMT”. In:ICCC. 2025, pp. 1–6

  20. [20]

    Secure, Privacy-Preserving and Federated Machine Learning in Medical Imaging

    G. A. Kaissis et al. “Secure, Privacy-Preserving and Federated Machine Learning in Medical Imaging”. In:Nat. Mach. Intell.2 (2020), pp. 305– 311

  21. [21]

    Cross-device Federated Unsupervised Learning for the Detection of Anomalies in Single-Lead Electrocardio- gram Signals

    M. Kapsecker and S. M. Jonas. “Cross-device Federated Unsupervised Learning for the Detection of Anomalies in Single-Lead Electrocardio- gram Signals”. In:PLOS Digit. Health4 (2025)

  22. [22]

    D. P. Kingma and M. Welling.Auto-Encoding Variational Bayes. arXiv:1312.6114. 2013

  23. [23]

    Quantizing deep convolutional networks for efficient inference: A whitepaper

    R. Krishnamoorthi.Quantizing Deep Convolutional Networks for Ef- ficient Inference: A Whitepaper. arXiv:1806.08342. 2018

  24. [24]

    Enhanced Federated Anomaly Detection through Autoencoders Using Summary Statistics-Based Thresholding

    S. Laridi et al. “Enhanced Federated Anomaly Detection through Autoencoders Using Summary Statistics-Based Thresholding”. In:Sci. Rep.14 (2024), p. 26704

  25. [25]

    Communication-Efficient Learning of Deep Net- works from Decentralized Data

    B. McMahan et al. “Communication-Efficient Learning of Deep Net- works from Decentralized Data”. In:AISTATS. 2017, pp. 1273–1282

  26. [26]

    Learning Differentially Private Recurrent Language Models

    H. B. McMahan et al. “Learning Differentially Private Recurrent Language Models”. In:ICLR. 2018

  27. [27]

    Rényi Differential Privacy

    I. Mironov. “Rényi Differential Privacy”. In:CSF. 2017, pp. 263–275

  28. [28]

    Nardi et al.Anomaly Detection through Unsupervised Federated Learning

    M. Nardi et al.Anomaly Detection through Unsupervised Federated Learning. arXiv:2209.04184. 2022

  29. [29]

    Unsupervised ECG Analysis: A Review

    K. Nezamabadi et al. “Unsupervised ECG Analysis: A Review”. In: IEEE Rev. Biomed. Eng.16 (2023), pp. 208–224

  30. [30]

    ECG-DPSHAP: An Approach towards Privacy- Preserving SHAP-Based Explainable AI for 12-Lead ECG Classifica- tion Model

    S. Nuannimnoi et al. “ECG-DPSHAP: An Approach towards Privacy- Preserving SHAP-Based Explainable AI for 12-Lead ECG Classifica- tion Model”. In:IC4e. 2025

  31. [31]

    PyTorch: An Imperative Style, High-Performance Deep Learning Library

    A. Paszke et al. “PyTorch: An Imperative Style, High-Performance Deep Learning Library”. In:NeurIPS. 2019, pp. 8024–8035

  32. [32]

    A Secure Multi-Modal Federated Transfer Learning System for ECG Classification

    N. Phan. “A Secure Multi-Modal Federated Transfer Learning System for ECG Classification”. In:Neurocomputing(2026)

  33. [33]

    How to DP-fy ML: A Practical Guide to Machine Learning with Differential Privacy

    N. Ponomareva et al. “How to DP-fy ML: A Practical Guide to Machine Learning with Differential Privacy”. In:JAIR77 (2023), pp. 1113–1201

  34. [34]

    Handling Data Heterogeneity in Federated Learning for Medical Imaging: A Review

    L. Qu et al. “Handling Data Heterogeneity in Federated Learning for Medical Imaging: A Review”. In:Med. Image Anal.82 (2022), p. 102570

  35. [35]

    Designing ECG Monitoring Healthcare System with Federated Transfer Learning and Explainable AI

    A. Raza et al. “Designing ECG Monitoring Healthcare System with Federated Transfer Learning and Explainable AI”. In:Knowl.-Based Syst.236 (2022), p. 107763

  36. [36]

    ECG-based Real-Time Arrhythmia Monitoring Using Quantized Deep Neural Networks: A Feasibility Study

    H. Ribeiro et al. “ECG-based Real-Time Arrhythmia Monitoring Using Quantized Deep Neural Networks: A Feasibility Study”. In:Comput. Biol. Med.143 (2022), p. 105249

  37. [37]

    Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XL

    N. Strodthoff et al. “Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XL”. In:IEEE J. Biomed. Health Inform.25 (2021), pp. 1519–1528

  38. [38]

    FEEL-ECG: Federated Edge Learning for Person- alized and Explainable ECG Monitoring with Adaptive Compression and Preprocessing

    K. Swetha et al. “FEEL-ECG: Federated Edge Learning for Person- alized and Explainable ECG Monitoring with Adaptive Compression and Preprocessing”. In:INDISCON. 2025, pp. 1–7

  39. [39]

    PTB-XL, a Large Publicly Available Electrocardio- graphy Dataset

    P. Wagner et al. “PTB-XL, a Large Publicly Available Electrocardio- graphy Dataset”. In:Sci. Data7 (2020), p. 154

  40. [40]

    Linkage Attacks Expose Identity Risks in Public ECG Data Sharing

    Z. Wang et al. “Linkage Attacks Expose Identity Risks in Public ECG Data Sharing”. In:IEEE EMBC. 2025

  41. [41]

    Federated Learning With Deep Neural Networks: A Privacy-Preserving Approach to Enhanced ECG Classification

    K. Weimann and T. O. F. Conrad. “Federated Learning With Deep Neural Networks: A Privacy-Preserving Approach to Enhanced ECG Classification”. In:IEEE J. Biomed. Health Inform.28 (2024)

  42. [42]

    World Health Organization.Cardiovascular Diseases (CVDs) – Fact Sheet. WHO. 2021

  43. [43]

    Group Normalization

    Y . Wu and K. He. “Group Normalization”. In:ECCV. 2018, pp. 3–19

  44. [44]

    Yousefpour et al.Opacus: User-Friendly Differential Privacy Li- brary in PyTorch

    A. Yousefpour et al.Opacus: User-Friendly Differential Privacy Li- brary in PyTorch. arXiv:2109.12298. 2021

  45. [45]

    A Two-Stage Differential Privacy Scheme for FL Based on Edge Intelligence

    L. Zhang et al. “A Two-Stage Differential Privacy Scheme for FL Based on Edge Intelligence”. In:IEEE J. Biomed. Health Inform.28 (2024), pp. 3349–3360