Recognition: unknown
Towards Real-Time ECG and EMG Modeling on μNPUs
Pith reviewed 2026-05-10 05:52 UTC · model grok-4.3
The pith
PhysioLite matches Transformer accuracy on ECG and EMG signals while fitting in under 400KB for microcontroller NPUs.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
PhysioLite is a lightweight, NPU-compatible model architecture and training framework for ECG/EMG signal analysis. Using learnable wavelet filter banks, CPU-offloaded positional encoding, and hardware-aware layer design, PhysioLite reaches performance comparable to state-of-the-art Transformer-based foundation models on ECG and EMG benchmarks, while being <10% of the size (~370KB with 8-bit quantization). The authors also profile its component-wise latency and resource consumption on the MAX78000 and HX6538 μNPUs to show its viability for signal analysis on constrained, battery-powered hardware.
What carries the argument
Learnable wavelet filter banks combined with CPU-offloaded positional encoding and hardware-aware layers that replace dynamic attention to enable efficient inference on μNPUs.
If this is right
- Enables near-real-time, offline physiological signal analysis directly on wearable hardware.
- Supports privacy-preserving inference by keeping all processing local without data offload.
- Reduces model footprint to ~370KB after 8-bit quantization while matching benchmark scores.
- Demonstrates concrete latency and resource profiles on MAX78000 and HX6538 μNPUs.
Where Pith is reading between the lines
- The same wavelet-plus-hardware design might transfer to other time-series bio-signals such as EEG.
- Hardware vendors could add native support for wavelet banks to further speed up similar models.
- If the accuracy holds under patient-specific noise, it could support always-on monitoring features in consumer devices.
- Releasing the models and training code allows independent checks on additional public datasets.
Load-bearing premise
The combination of learnable wavelet banks, CPU-offloaded encoding, and hardware-aware layers will preserve accuracy across real-world noisy signals and varying hardware without dynamic attention.
What would settle it
A test set of noisy real-world ECG or EMG recordings where PhysioLite's accuracy falls substantially below that of the compared Transformer models.
Figures
read the original abstract
The miniaturisation of neural processing units (NPUs) and other low-power accelerators has enabled their integration into microcontroller-scale wearable hardware, supporting near-real-time, offline, and privacy-preserving inference. Yet physiological signal analysis has remained infeasible on such hardware; recent Transformer-based models show state-of-the-art performance but are prohibitively large for resource- and power-constrained hardware and incompatible with $\mu$NPUs due to their dynamic attention operations. We introduce PhysioLite, a lightweight, NPU-compatible model architecture and training framework for ECG/EMG signal analysis. Using learnable wavelet filter banks, CPU-offloaded positional encoding, and hardware-aware layer design, PhysioLite reaches performance comparable to state-of-the-art Transformer-based foundation models on ECG and EMG benchmarks, while being <10% of the size ($\sim$370KB with 8-bit quantization). We also profile its component-wise latency and resource consumption on both the MAX78000 and HX6538 WE2 $\mu$NPUs, demonstrating its viability for signal analysis on constrained, battery-powered hardware. We release our model(s) and training framework at: https://github.com/j0shmillar/physiolite.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces PhysioLite, a lightweight model architecture and training framework for ECG and EMG signal analysis on microcontroller-scale NPUs. It combines learnable wavelet filter banks, CPU-offloaded positional encoding, and hardware-aware layer design to claim performance comparable to state-of-the-art Transformer-based foundation models, while achieving a model size of ~370KB under 8-bit quantization and demonstrating viability through latency and resource profiling on the MAX78000 and HX6538 μNPUs. The models and framework are released publicly.
Significance. If the performance claims hold, the work is significant for enabling real-time, offline physiological signal processing on battery-powered wearables with strong privacy guarantees. The explicit hardware profiling on two specific μNPUs and the public release of models plus training code are clear strengths that support reproducibility and practical deployment.
major comments (1)
- [Abstract] Abstract: the claim that PhysioLite 'reaches performance comparable to state-of-the-art Transformer-based foundation models on ECG and EMG benchmarks' is load-bearing for the central contribution yet provides no quantitative metrics, named datasets, baseline scores, or statistical tests; without these the size-performance tradeoff cannot be evaluated.
minor comments (1)
- [Abstract] Abstract: the phrase 'near-real-time' is used without a concrete latency target (e.g., ms per inference) that would allow readers to judge the hardware profiling results.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback and positive recommendation for minor revision. We address the single major comment below.
read point-by-point responses
-
Referee: [Abstract] Abstract: the claim that PhysioLite 'reaches performance comparable to state-of-the-art Transformer-based foundation models on ECG and EMG benchmarks' is load-bearing for the central contribution yet provides no quantitative metrics, named datasets, baseline scores, or statistical tests; without these the size-performance tradeoff cannot be evaluated.
Authors: We agree that the abstract would benefit from explicit quantitative support for the central claim. The manuscript body already contains the supporting experimental results, including performance metrics on named ECG and EMG benchmarks with direct comparisons to Transformer baselines. For the revision we will expand the abstract to include key quantitative metrics, dataset names, and baseline scores (while remaining within length limits) so that the size-performance tradeoff is immediately evaluable from the abstract alone. No changes to the underlying results or claims are required. revision: yes
Circularity Check
No significant circularity detected
full rationale
The paper introduces PhysioLite as an engineering architecture combining learnable wavelet filter banks, CPU-offloaded positional encoding, and hardware-aware layers to achieve compact size and μNPU compatibility. All performance claims are asserted via direct comparison to external Transformer baselines on standard ECG/EMG benchmarks, with code release for reproduction. No equations, derivations, or first-principles results are presented that reduce by construction to fitted inputs or self-citations; the central claims remain empirically grounded rather than tautological.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Manal Alghieth. 2025. DeepECG-Net: a hybrid transformer-based deep learning model for real-time ECG anomaly detection.Scientific Reports15, 1 (2025), 20714. doi:10.1038/s41598-025-07781-1
-
[2]
Analog Devices Inc. 2024. ai8x-synthesis: Quantization and Synthesis for MAX78000/MAX78002 Edge AI Devices. https://github.com/analogdevicesinc/ ai8x-synthesis. Accessed: 2026-04-02
2024
-
[3]
Arm Ltd. 2024. Arm Ethos-U85 NPU. https://www.arm.com/products/silicon-ip- cpu/ethos/ethos-u85. Accessed: 2025-11-08
2024
-
[4]
and NXP Semiconductors
Arm Ltd. and NXP Semiconductors. 2024. ethos-u-vela: Neural Network Compiler for Arm Ethos-U NPUs. https://github.com/nxp-imx/ethos-u-vela. Accessed: 2026-04-02
2024
-
[5]
Attia, Suraj Kapa, Francisco Lopez-Jimenez, Paul M
Zachi I. Attia, Suraj Kapa, Francisco Lopez-Jimenez, Paul M. McKie, Dorothy J. Ladewig, Gaurav Satam, Patricia A. Pellikka, Maurice Enriquez-Sarano, Peter A. Noseworthy, Thomas M. Munger, Samuel J. Asirvatham, Christopher G. Scott, Rickey E. Carter, and Paul A. Friedman. 2019. Screening for cardiac contractile dysfunction using an artificial intelligence–...
-
[6]
Benalcázar, Lorena Barona, Leonardo Valdivieso, Xavier Aguas, and Jonathan Zea
Marco E. Benalcázar, Lorena Barona, Leonardo Valdivieso, Xavier Aguas, and Jonathan Zea. 2020. EMG-EPN-612 Dataset [Dataset]. Zenodo. doi:10.5281/ zenodo.4421500
2020
-
[7]
Marco E. Benalcázar, Andrés G. Jaramillo, Jonathan, A. Zea, Andrés Páez, and Víctor Hugo Andaluz. 2017. Hand gesture recognition using machine learning and the Myo armband. In2017 25th European Signal Processing Conference (EUSIPCO). 1040–1044. doi:10.23919/EUSIPCO.2017.8081366
- [8]
- [9]
-
[10]
Subject-aware contrastive learning for biosignals
Joseph Y. Cheng, Hanlin Goh, Kaan Dogrusoz, Oncel Tuzel, and Er- drin Azemi. 2020. Subject-Aware Contrastive Learning for Biosignals. arXiv:2007.04871 [cs.LG] https://arxiv.org/abs/2007.04871
-
[11]
Romain Cosentino and Behnaam Aazhang. 2020. Learnable Group Transform for Time-Series. InProceedings of the 37th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 119). PMLR, 2164–2173. https://proceedings.mlr.press/v119/cosentino20a.html
2020
-
[12]
Robert David, Jared Duke, Advait Jain, Vijay Janapa Reddi, Nat Jeffries, Jian Li, Nick Kreeger, Ian Nappier, Meghna Natraj, Shlomi Regev, Rocky Rhodes, Tiezhen Wang, and Pete Warden. 2021. TensorFlow Lite Micro: Embedded Machine Learning on TinyML Systems. arXiv:2010.08678 [cs.LG] https://arxiv.org/abs/ 2010.08678
-
[13]
De Luca, L
Carlo J. De Luca, L. Donald Gilmore, Mikhail Kuznetsov, and Serge H. Roy
-
[14]
Filtering the surface EMG signal: Movement artifact and baseline noise contamination.Journal of Biomechanics43, 8 (2010), 1573–1579. doi:10.1016/j. jbiomech.2010.01.027
work page doi:10.1016/j 2010
-
[15]
Alberto Dequino, Luca Bompani, Luca Benini, and Francesco Conti. 2025. Op- timizing BFloat16 Deployment of Tiny Transformers on Ultra-Low Power Ex- treme Edge SoCs.Journal of Low Power Electronics and Applications15, 1 (2025). doi:10.3390/jlpea15010008
-
[16]
Ethan Eddy, Erik J Scheme, and Scott Bateman. 2023. A Framework and Call to Action for the Future Development of EMG-Based Input in HCI. InProceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI ’23). ACM, 1–23. doi:10.1145/3544548.3580962
-
[17]
Adams, Matthew Mattina, and Paul N
Igor Fedorov, Ryan P. Adams, Matthew Mattina, and Paul N. Whatmough. 2019. SpArSe: Sparse Architecture Search for CNNs on Resource-Constrained Micro- controllers. arXiv:1905.12107 [cs.LG] https://arxiv.org/abs/1905.12107
-
[18]
Sebastian Frey, Marco Guermandi, Simone Benatti, Victor Kartsch, Andrea Cosset- tini, and Luca Benini. 2023. BioGAP: A 10-Core FP-Capable Ultra-Low Power IoT Processor with Medical-Grade AFE and BLE Connectivity for Wearable Biosignal Processing.arXiv preprint arXiv:2307.01619(2023)
- [19]
-
[20]
Brian Gow, Tom Pollard, Larry A Nathanson, Alistair Johnson, Benjamin Moody, Chrystinne Fernandes, Nathaniel Greenbaum, Jonathan W Waks, Parastou Eslami, Tanner Carbonati, Ashish Chaudhari, Elizabeth Herbst, Dana Moukheiber, Seth Berkowitz, Roger Mark, and Steven Horng. 2023. MIMIC-IV-ECG: Diagnostic Electrocardiogram Matched Subset.PhysioNet(Sept. 2023)....
-
[21]
Ahmed Imtiaz Humayun, Shabnam Ghaffarzadegan, Zhe Feng, and Taufiq Hasan
-
[22]
Learning Front-end Filter-bank Parameters using Convolutional Neural Networks for Abnormal Heart Sound Detection. In2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE, 1408–1411. doi:10.1109/embc.2018.8512578
- [23]
-
[24]
Bhabesh Kalita, Nabamita Deb, and Daisy Das. 2024. AnEEG: leveraging deep learning for effective artifact removal in EEG data.Scientific Reports14, 1 (2024), 24234. doi:10.1038/s41598-024-75091-z
-
[25]
Victor Javier Kartsch, Simone Benatti, Fiorenzo Artoni, Silvestro Micera, and Luca Benini. 2025. Open-source RISC-V platforms for embedded medical grade EMG processing: Are we there yet?Microprocessors and Microsystems(2025), 105239
2025
-
[26]
Albert, Joel Xue, Aarya Parekh, Reza Sameni, Matthew A
Zuzana Koscova, Qiao Li, Chad Robichaux, Valdery Moura Junior, Manohar Ghanta, Aditya Gupta, Jonathan Rosand, Aaron Aguirre, Shenda Hong, David E. Albert, Joel Xue, Aarya Parekh, Reza Sameni, Matthew A. Reyna, M. Brandon Westover, and Gari D. Cli- ford. 2024. The Harvard-Emory ECG Database.medRxiv(2024). arXiv:https://www.medrxiv.org/content/early/2024/10...
-
[27]
N. Krilova, I. Kastalskiy, V. Kazantsev, V. Makarov, and S. Lobov. 2018. EMG Data for Gestures [Dataset]. UCI Machine Learning Repository. doi:10.24432/C5ZP5C
-
[28]
Jun Li, Aaron Aguirre, Junior Moura, Che Liu, Lanhai Zhong, Chenxi Sun, Gari Clifford, Brandon Westover, and Shenda Hong. 2025. An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple Domains. arXiv:2410.04133 [cs.LG] https://arxiv.org/abs/2410. 04133
-
[29]
Yinan Liang, Ziwei Wang, Xiuwei Xu, Yansong Tang, Jie Zhou, and Jiwen Lu
-
[30]
arXiv:2310.16898 [cs.CV] https://arxiv.org/abs/2310.16898
MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory. arXiv:2310.16898 [cs.CV] https://arxiv.org/abs/2310.16898
-
[31]
Edgar Liberis, Łukasz Dudziak, and Nicholas D. Lane. 2021. 𝜇NAS: Constrained Neural Architecture Search for Microcontrollers. InProceedings of the 1st Work- shop on Machine Learning and Systems(Online, United Kingdom)(EuroML- Sys ’21). Association for Computing Machinery, New York, NY, USA, 70–79. doi:10.1145/3437984.3458836
-
[32]
Cheng Liu, E.Y.-K. Ng, et al. 2018. An Open Access Database for Evaluating the Algorithms of the 1st China Physiological Signal Challenge 2018. Proceedings of the 1st China Physiological Signal Challenge (CPSC) 2018, 7th International Conference on Biomedical Engineering and Biotechnology (ICBEB 2018). doi:10. 1166/jmihi.2018.2442
- [33]
-
[34]
Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. arXiv:1711.05101 [cs.LG] https://arxiv.org/abs/1711.05101
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[35]
2021.MAX78000 Microcontroller with Ultra-Low-Power Con- volutional Neural Network Accelerator
Maxim Integrated. 2021.MAX78000 Microcontroller with Ultra-Low-Power Con- volutional Neural Network Accelerator. Maxim Integrated. https://www.analog. com/media/en/technical-documentation/data-sheets/MAX78000.pdf Data Sheet Rev 1; 19-100868; 5/21
2021
- [36]
-
[37]
Gabriel Michau, Gaetan Frusque, and Olga Fink. 2022. Fully learnable deep wavelet transform for unsupervised monitoring of high-frequency time series. Proceedings of the National Academy of Sciences119, 8 (Feb. 2022). doi:10.1073/ pnas.2106598119
2022
-
[38]
Josh Millar, Yushan Huang, Sarab Sethi, Hamed Haddadi, and Anil Madhavapeddy
-
[39]
arXiv:2503.22567 [cs.LG] https: //arxiv.org/abs/2503.22567
Benchmarking Ultra-Low-Power 𝜇NPUs. arXiv:2503.22567 [cs.LG] https: //arxiv.org/abs/2503.22567
-
[40]
Monsoon Solutions Inc. 2023. High Voltage Power Monitor. https://www.msoon. com/high-voltage-power-monitor Accessed: 2026-04-02
2023
-
[41]
Arthur Moss, Hyunjong Lee, Lei Xun, Chulhong Min, Fahim Kawsar, and Alessandro Montanari. 2023. Ultra-Low Power DNN Accelerators for IoT: Re- source Characterization of the MAX78000. InProceedings of the 20th ACM Con- ference on Embedded Networked Sensor Systems(Boston, Massachusetts)(Sen- Sys ’22). Association for Computing Machinery, New York, NY, USA, ...
-
[42]
Oliver and F
N. Oliver and F. Flores-Mangas. 2006. HealthGear: a real-time wearable system for monitoring and analyzing physiological signals. InInternational Workshop on Towards Real-Time ECG and EMG Modeling onµNPUs SenSys ’26, May 11–14, 2026, Saint Malo, France Wearable and Implantable Body Sensor Networks (BSN’06). 4 pp.–64. doi:10.1109/ BSN.2006.27
2006
-
[43]
Stefano Pizzolato, Luca Tagliapietra, Matteo Cognolato, Monica Reggiani, Hen- ning Müller, and Manfredo Atzori. 2017. Comparison of six electromyography acquisition setups on hand movement classification tasks.PLOS ONE12, 10 (2017), e0186132. doi:10.1371/journal.pone.0186132
-
[44]
C. Rajashekar Reddy, Vivian Dsouza, Ashok Samraj Thangarajan, Przemysław Pawełczak, Fahim Kawsar, and Alessandro Montanari. 2025. BioPulse: Towards Enabling Perpetual Vital Signs Monitoring Using a Body Patch. InProceedings of the 26th International Workshop on Mobile Computing Systems and Applications (HotMobile ’25). Association for Computing Machinery,...
-
[45]
Matthew A Reyna, Nadi Sadr, Erick A Perez Alday, Annie Gu, Amit J Shah, Chad Robichaux, Ali Bahrami Rad, Andoni Elola, Salman Seyedi, Sardar Ansari, Hamid Ghanbari, Qiao Li, Ashish Sharma, and Gari D Clifford. 2021. Will Two Do? Varying Dimensions in Electrocardiography: The PhysioNet/Computing in Cardiology Challenge 2021. In2021 Computing in Cardiology ...
-
[46]
Manuele Rusci, Alessandro Capotondi, and Luca Benini. 2020. Memory-Driven Mixed Low Precision Quantization for Enabling Deep Network Inference on Microcontrollers. InProceedings of Machine Learning and Systems, I. Dhillon, D. Papailiopoulos, and V. Sze (Eds.), Vol. 2. 326–335. https://proceedings.mlsys. org/paper_files/paper/2020/file/0c8abcf158ed12d0dd94...
2020
-
[47]
Swapnil Sayan Saha, Sandeep Singh Sandha, and Mani Srivastava. 2022. Machine Learning for Microcontroller-Class Hardware: A Review.IEEE Sensors Journal 22, 22 (2022), 21362–21390. doi:10.1109/JSEN.2022.3210773
- [48]
-
[49]
Jianlin Su, Yu Lu, Shengfeng Pan, Ahmed Murtadha, Bo Wen, and Yunfeng Liu. 2023. RoFormer: Enhanced Transformer with Rotary Position Embedding. arXiv:2104.09864 [cs.CL] https://arxiv.org/abs/2104.09864
work page internal anchor Pith review arXiv 2023
-
[50]
Vivienne Sze, Yu-Hsin Chen, Tien-Ju Yang, and Joel S. Emer. 2020. How to Evaluate Deep Neural Network Processors: TOPS/W (Alone) Considered Harmful. IEEE Solid-State Circuits Magazine12, 3 (2020), 28–41. doi:10.1109/MSSC.2020. 3002140
-
[51]
Jack Tchimino, Rehne Lessmann Hansen, Peter Holmberg Jørgensen, Jakob Dideriksen, and Strahinja Dosen. 2024. Application of EMG feedback for hand prosthesis control in high-level amputation: a case study.Scientific Reports14, 1 (2024), 31676. doi:10.1038/s41598-024-80828-x
-
[52]
GreenWaves Technologies. 2021. GAP9 Product Brief
2021
-
[53]
Himax Technologies. 2025. WiseEye2 AI Processor. https://www.himax.com. tw/products/wiseeye-ai-sensing/wiseeye2-ai-processor/ Accessed: 2025-03-12
2025
-
[54]
Asher Trockman and J. Zico Kolter. 2022. Patches Are All You Need? arXiv:2201.09792 [cs.CV] https://arxiv.org/abs/2201.09792
-
[55]
Patrick Wagner, Nils Strodthoff, Ralf-Dieter Bousseljot, Wojciech Samek, and Tobias Schaeffter. 2022. PTB-XL, a large publicly available electrocardiography dataset (version 1.0.3). PhysioNet. doi:10.13026/kfzx-aw45 RRID:SCR_007345
- [56]
-
[57]
Jianwei Zheng, Hangyuan Guo, and Huimin Chu. 2022. A large scale 12-lead electrocardiogram database for arrhythmia study (version 1.0.0). PhysioNet. doi:10.13026/wgex-er52 RRID:SCR_007345
-
[58]
Norbert Ádám, Dávid Val’ko, Zoltán Balogh, Branislav Madoš, and Ján Hurtuk
-
[59]
Comparative evaluation of filtration techniques for ECG signal denoising with emphasis on stationary wavelet transform.Scientific Reports15, 1 (2025), 42514. doi:10.1038/s41598-025-26476-1 A Ablation Study We conduct ablation studies to quantify the contribution of key architectural and training components, including knowledge dis- tillation, positional e...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.