Towards Green Wearable Computing: A Physics-Aware Spiking Neural Network for Energy-Efficient IMU-based Human Activity Recognition
Pith reviewed 2026-05-10 16:16 UTC · model grok-4.3
The pith
A physics-aware spiking network matches state-of-the-art accuracy in wearable activity recognition while cutting dynamic energy use by up to 98 percent.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
PAS-Net is a fully multiplier-free spiking architecture that spatially enforces human-joint physical constraints via an adaptive symmetric topology mixer and temporally adapts to non-stationary movement rhythms with an O(1)-memory causal neuromodulator. These elements replace dense floating-point operations with sparse 0.1 pJ integer accumulations and unlock a flexible early-exit mechanism through a temporal spike error objective, achieving state-of-the-art accuracy and up to 98 percent reduction in dynamic energy consumption on seven diverse IMU datasets.
What carries the argument
The adaptive symmetric topology mixer that enforces joint physical constraints together with the O(1)-memory causal neuromodulator that produces context-aware dynamic threshold neurons, enabling event-driven sparse computations and early exits in a spiking network for biomechanical signals.
If this is right
- Battery-constrained wearables can run continuous activity monitoring without fixed processing windows or heavy buffering.
- Neuromorphic chips can implement the sparse integer operations directly for further power reductions in real hardware.
- Physical joint constraints improve handling of real biomechanical data streams compared with generic network designs.
- The early-exit approach supports flexible, confidence-based decisions on streaming sensor input.
- The overall design provides a template for ultra-low-power always-on sensing in edge devices.
Where Pith is reading between the lines
- The topology mixer could be adapted to other multi-sensor body models or full kinematic chains for broader tracking tasks.
- Combining the early-exit mechanism with additional compression techniques might yield further gains in non-spiking networks.
- Hardware measurements on actual neuromorphic processors would clarify how close the reported 0.1 pJ accumulations come to practical device limits.
- The method may transfer to other temporal sensor problems where physical constraints and non-stationary patterns appear.
Load-bearing premise
The adaptive symmetric topology mixer successfully enforces human-joint physical constraints and the O(1)-memory causal neuromodulator adapts to non-stationary rhythms without losing accuracy when all floating-point operations are replaced by integer accumulations.
What would settle it
If PAS-Net shows accuracy below state-of-the-art levels or energy savings below 80 percent when tested on a new large IMU dataset with varied subjects and activities, the central performance claims would be falsified.
Figures
read the original abstract
Wearable IMU-based Human Activity Recognition (HAR) relies heavily on Deep Neural Networks (DNNs), which are burdened by immense computational and buffering demands. Their power-hungry floating-point operations and rigid requirement to process complete temporal windows severely cripple battery-constrained edge devices. While Spiking Neural Networks (SNNs) offer extreme event-driven energy efficiency, standard architectures struggle with complex biomechanical topologies and temporal gradient degradation. To bridge this gap, we propose the Physics-Aware Spiking Neural Network (PAS-Net), a fully multiplier-free architecture explicitly tailored for Green HAR. Spatially, an adaptive symmetric topology mixer enforces human-joint physical constraints. Temporally, an $O(1)$-memory causal neuromodulator yields context-aware dynamic threshold neurons, adapting actively to non-stationary movement rhythms. Furthermore, we leverage a temporal spike error objective to unlock a flexible early-exit mechanism for continuous IMU streams. Evaluated across seven diverse datasets, PAS-Net achieves state-of-the-art accuracy while replacing dense operations with sparse 0.1 pJ integer accumulations. Crucially, its confidence-driven early-exit capability drastically reduces dynamic energy consumption by up to 98\%. PAS-Net establishes a robust, ultra-low-power neuromorphic standard for always-on wearable sensing. The source code and pre-trained models are publicly available at https://github.com/zhengnaichuan2022/PAS-Net.git.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes PAS-Net, a fully multiplier-free spiking neural network for IMU-based human activity recognition. It introduces an adaptive symmetric topology mixer to enforce human-joint physical constraints, an O(1)-memory causal neuromodulator with dynamic thresholds for adapting to non-stationary temporal rhythms, and a temporal spike error objective that enables confidence-driven early-exit on continuous streams. The authors evaluate the model on seven diverse datasets and claim state-of-the-art accuracy together with up to 98% reduction in dynamic energy consumption via replacement of dense floating-point operations by sparse 0.1 pJ integer accumulations.
Significance. If the performance claims and the attribution to the physics-aware components hold under rigorous validation, the work would represent a meaningful advance in energy-efficient neuromorphic sensing for battery-constrained wearables. The public release of code and pre-trained models strengthens reproducibility and potential impact.
major comments (3)
- [§3.2] §3.2 (Adaptive Symmetric Topology Mixer): The description asserts enforcement of human-joint physical constraints yet supplies no explicit mechanism (hard constraint, symmetry loss term, graph Laplacian, or biomechanical prior injection) and no validation against ground-truth joint kinematics; without this, the contribution of the mixer to the reported SOTA accuracy cannot be isolated from standard SNN training.
- [§3.3] §3.3 (O(1)-memory causal neuromodulator): The claim that dynamic thresholds adapt to non-stationary movement rhythms without accuracy degradation lacks a stability analysis, gradient-flow derivation, or ablation on real IMU streams; if the neuromodulator introduces hidden temporal bias or vanishing gradients, the 98% early-exit energy saving cannot be attributed to the physics-aware design.
- [§5] §5 (Experimental Results): The abstract and results claim SOTA accuracy plus 98% energy reduction across seven datasets, but the provided text supplies no quantitative tables, baseline comparisons, error bars, or component-wise ablations; without these, the central attribution of gains to the novel components remains unsubstantiated.
minor comments (2)
- [§3.4] Notation for the temporal spike error objective is introduced without an equation number or explicit loss formulation, making reproduction difficult.
- [Figure 4] Figure captions for energy-consumption plots should explicitly state the measurement methodology (e.g., hardware platform, power model) rather than referring only to 'dynamic energy'.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed feedback on our manuscript. We have reviewed the major comments carefully and will make revisions to improve the clarity and substantiation of our claims. Our point-by-point responses are provided below.
read point-by-point responses
-
Referee: [§3.2] §3.2 (Adaptive Symmetric Topology Mixer): The description asserts enforcement of human-joint physical constraints yet supplies no explicit mechanism (hard constraint, symmetry loss term, graph Laplacian, or biomechanical prior injection) and no validation against ground-truth joint kinematics; without this, the contribution of the mixer to the reported SOTA accuracy cannot be isolated from standard SNN training.
Authors: We agree with the referee that the description of the Adaptive Symmetric Topology Mixer in §3.2 requires more explicit details to demonstrate how it enforces human-joint physical constraints. In the revised manuscript, we will add a precise mathematical definition of the mixer, explaining the adaptive symmetric operations that preserve joint topology without requiring additional loss terms or external priors. We will also include an ablation study to isolate its contribution to the overall accuracy. Regarding validation against ground-truth joint kinematics, our study is based on standard IMU datasets which do not provide such kinematic ground truth; therefore, we will instead emphasize performance-based validation through ablations on the HAR tasks. revision: partial
-
Referee: [§3.3] §3.3 (O(1)-memory causal neuromodulator): The claim that dynamic thresholds adapt to non-stationary movement rhythms without accuracy degradation lacks a stability analysis, gradient-flow derivation, or ablation on real IMU streams; if the neuromodulator introduces hidden temporal bias or vanishing gradients, the 98% early-exit energy saving cannot be attributed to the physics-aware design.
Authors: We thank the referee for highlighting this important aspect. The revised version of the manuscript will incorporate a stability analysis for the dynamic thresholds in the causal neuromodulator, along with a derivation of the gradient flow to ensure no vanishing gradients or hidden biases are introduced. Furthermore, we will add ablations specifically on real IMU streams to demonstrate the adaptation to non-stationary rhythms and confirm the attribution of the energy savings to this component. revision: yes
-
Referee: [§5] §5 (Experimental Results): The abstract and results claim SOTA accuracy plus 98% energy reduction across seven datasets, but the provided text supplies no quantitative tables, baseline comparisons, error bars, or component-wise ablations; without these, the central attribution of gains to the novel components remains unsubstantiated.
Authors: We apologize for any lack of clarity in the experimental section. Although the original submission includes results on seven datasets, we acknowledge that comprehensive tables, baseline comparisons, error bars, and ablations may not have been sufficiently detailed. In the revision, we will expand §5 with full quantitative tables showing accuracy and energy metrics, comparisons to state-of-the-art methods, standard deviations from repeated experiments, and component-wise ablations for the topology mixer, neuromodulator, and early-exit mechanism. This will better substantiate the claims made in the abstract and results. revision: yes
Circularity Check
No circularity: claims rest on novel architectural components and empirical evaluation
full rationale
The paper's abstract and description introduce PAS-Net via two new components (adaptive symmetric topology mixer and O(1)-memory causal neuromodulator) plus a temporal spike error objective for early-exit. No equations, derivations, or self-citations are present that reduce the stated accuracy or energy gains to fitted parameters, self-definitions, or prior author results by construction. The central claims are framed as outcomes of the proposed architecture evaluated across seven datasets, with no load-bearing step that collapses to tautology or renamed input. This is the expected non-finding for an architecture paper whose novelty lies in component design rather than a closed mathematical derivation.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Spiking neural networks can perform event-driven computation with lower energy than dense DNNs on temporal sensor data.
- domain assumption Human joint movements impose symmetric physical constraints that can be encoded into network topology.
invented entities (3)
-
Adaptive symmetric topology mixer
no independent evidence
-
O(1)-memory causal neuromodulator
no independent evidence
-
Temporal spike error objective
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Kerem Altun, Billur Barshan, and Orkun Tunçel. 2010. Comparative study on classifying human activities with miniature inertial and magnetic sensors.Pattern Recognition43, 10 (2010), 3605–3620
work page 2010
-
[2]
Davide Anguita, Alessandro Ghio, Luca Oneto, Xavier Parra, Jorge Luis Reyes-Ortiz, et al. 2013. A public domain dataset for human activity recognition using smartphones.. InEsann, Vol. 3. 3–4
work page 2013
-
[3]
Marc Bachlin, Meir Plotnik, Daniel Roggen, Inbal Maidan, Jeffrey M Hausdorff, Nir Giladi, and Gerhard Troster. 2009. Wearable assistant for Parkinson’s disease patients with the freezing of gait symptom.IEEE Transactions on Information Technology in Biomedicine14, 2 (2009), 436–446
work page 2009
-
[4]
Guillaume Bellec, Darjan Salaj, Anand Subramoney, Robert Legenstein, and Wolfgang Maass. 2018. Long short-term memory and learning-to-learn in networks of spiking neurons.Advances in neural information processing systems31 (2018)
work page 2018
-
[5]
Andreas Bulling, Ulf Blanke, and Bernt Schiele. 2014. A tutorial on human activity recognition using body-worn inertial sensors.ACM Computing Surveys (CSUR)46, 3 (2014), 1–33
work page 2014
-
[6]
Roman Chereshnev and Attila Kertész-Farkas. 2017. Hugadb: Human gait database for activity recognition from wearable inertial sensor networks. InInternational Conference on Analysis of Images, Social Networks and Texts. Springer, 131–141
work page 2017
- [7]
-
[8]
Nidhi Dua, Shiva Nand Singh, Vijay Bhaskar Semwal, and Sravan Kumar Challa. 2023. Inception inspired CNN-GRU hybrid network for human activity recognition.Multimedia Tools and Applications82, 4 (2023), 5369–5403
work page 2023
-
[9]
Yu Enokibori. 2024. rTsfNet: A DNN Model with Multi-head 3D Rotation and Time Series Feature Extraction for IMU-based Human Activity Recognition.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies8, 4 (2024), 1–26
work page 2024
-
[10]
Rohit Girdhar, Alaaeldin El-Nouby, Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, and Ishan Misra. 2023. Imagebind: One embedding space to bind them all. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition. 15180–15190
work page 2023
-
[11]
Harish Haresamudram, Chi Ian Tang, Sungho Suh, Paul Lukowicz, and Thomas Ploetz. 2025. Past, present, and future of sensor-based human activity recognition using wearables: A surveying tutorial on a still challenging task.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies9, 2 (2025), 1–44. 18•Trovato et al
work page 2025
-
[12]
Jun He, Qian Zhang, Liqun Wang, and Ling Pei. 2018. Weakly supervised human activity recognition from wearable sensors by recurrent attention learning.IEEE Sensors Journal19, 6 (2018), 2287–2297
work page 2018
-
[13]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. InProceedings of the IEEE conference on computer vision and pattern recognition. 770–778
work page 2016
-
[14]
Mark Horowitz. 2014. 1.1 computing’s energy problem (and what we can do about it). In2014 IEEE international solid-state circuits conference digest of technical papers (ISSCC). IEEE, 10–14
work page 2014
-
[15]
Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-excitation networks. InProceedings of the IEEE conference on computer vision and pattern recognition. 7132–7141
work page 2018
-
[16]
Andrey Ignatov. 2018. Real-time human activity recognition from accelerometer data using convolutional neural networks.Applied Soft Computing62 (2018), 915–922
work page 2018
-
[17]
Wenchao Jiang and Zhaozheng Yin. 2015. Human activity recognition using wearable sensors by deep convolutional neural networks. InProceedings of the 23rd ACM international conference on Multimedia. 1307–1310
work page 2015
-
[18]
Zanobya N Khan and Jamil Ahmad. 2021. Attention induced multi-head convolutional neural network for human activity recognition. Applied soft computing110 (2021), 107671
work page 2021
-
[19]
D. Lee, Y. Li, Y. Kim, S. Xiao, and P. Panda. 2025. Spiking transformer with spatial-temporal attention. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 13948–13958
work page 2025
-
[20]
Zikang Leng, Amitrajit Bhattacharjee, Hrudhai Rajasekhar, Lizhe Zhang, Elizabeth Bruda, Hyeokhyen Kwon, and Thomas Plötz. 2024. Imugpt 2.0: Language-based cross modality transfer for sensor-based human activity recognition.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies8, 3 (2024), 1–32
work page 2024
-
[21]
Haobo Li, Aman Shrestha, Hadi Heidari, Julien Le Kernec, and Francesco Fioranelli. 2019. Bi-LSTM network for multimodal continuous human activity recognition and fall detection.IEEE Sensors Journal20, 3 (2019), 1191–1201
work page 2019
-
[22]
Zechen Li, Shohreh Deldari, Linyao Chen, Hao Xue, and Flora D Salim. 2025. Sensorllm: Aligning large language models with motion sensors for human activity recognition. InProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 354–379
work page 2025
-
[23]
Runze Liu, Taiting Lu, Shengming Yuan, Hao Zhou, and Mahanth Gowda. 2024. SmartDampener: an open source platform for sport analytics in tennis.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies8, 3 (2024), 1–30
work page 2024
- [24]
-
[25]
L. Lu, C. Zhang, K. Cao, T. Deng, and Q. Yang. 2022. A multichannel CNN-GRU model for human activity recognition.IEEE Access10 (2022), 66797–66810
work page 2022
-
[26]
Changze Lv, Dongqi Han, Yansen Wang, Xiaoqing Zheng, Xuanjing Huang, and Dongsheng Li. 2024. Advancing spiking neural networks for sequential modeling with central pattern generators.Advances in Neural Information Processing Systems37 (2024), 26915–26940
work page 2024
- [27]
- [28]
-
[29]
Wolfgang Maass. 1997. Networks of spiking neurons: the third generation of neural network models.Neural networks10, 9 (1997), 1659–1671
work page 1997
-
[30]
Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Aparajita Saraf, Amy Bearman, and Babak Damavandi. 2023. IMU2CLIP: language- grounded motion sensor translation with multimodal contrastive learning. InFindings of the Association for Computational Linguistics: EMNLP 2023. 13246–13253
work page 2023
-
[31]
Francisco Javier Ordóñez and Daniel Roggen. 2016. Deep convolutional and lstm recurrent neural networks for multimodal wearable activity recognition.Sensors16, 1 (2016), 115
work page 2016
- [32]
-
[33]
Shyamal Patel, Hyung Park, Paolo Bonato, Leighton Chan, and Mary Rodgers. 2012. A review of wearable sensors and systems with application in rehabilitation.Journal of neuroengineering and rehabilitation9, 1 (2012), 21
work page 2012
-
[34]
Valentin Radu, Catherine Tong, Sourav Bhattacharya, Nicholas D Lane, Cecilia Mascolo, Mahesh K Marina, and Fahim Kawsar. 2018. Multimodal deep learning for activity and context recognition.Proceedings of the ACM on interactive, mobile, wearable and ubiquitous technologies1, 4 (2018), 1–27
work page 2018
-
[35]
Daniele Ravi, Charence Wong, Benny Lo, and Guang-Zhong Yang. 2016. Deep learning for human activity recognition: A resource efficient implementation on low-power devices. In2016 IEEE 13th international conference on wearable and implantable body sensor networks (BSN). IEEE, 71–76
work page 2016
-
[36]
Attila Reiss and Didier Stricker. 2012. Introducing a new benchmarked dataset for activity monitoring. In2012 16th international symposium on wearable computers. IEEE, 108–109. Towards Green Wearable Computing: A Physics-Aware Spiking Neural Network for Energy-Efficient IMU-based Human Activity Recognition•19
work page 2012
-
[37]
Mutegeki Ronald, Alwin Poulose, and Dong Seog Han. 2021. iSPLInception: an inception-ResNet deep learning architecture for human activity recognition.IEEE Access9 (2021), 68985–69001
work page 2021
-
[38]
Charissa Ann Ronao and Sung-Bae Cho. 2016. Human activity recognition with smartphone sensors using deep learning neural networks.Expert systems with applications59 (2016), 235–244
work page 2016
-
[39]
Kaushik Roy, Akhilesh Jaiswal, and Priyadarshini Panda. 2019. Towards spike-based machine intelligence with neuromorphic computing. Nature575, 7784 (2019), 607–617
work page 2019
-
[40]
Roy Schwartz, Jesse Dodge, Noah A Smith, and Oren Etzioni. 2020. Green ai.Commun. ACM63, 12 (2020), 54–63
work page 2020
-
[41]
Chi Ian Tang, Ignacio Perez-Pozuelo, Dimitris Spathis, Soren Brage, Nicholas Wareham, and Cecilia Mascolo. 2021. Selfhar: Improving human activity recognition through self-training with unlabeled data.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies5, 1 (2021), 1–30
work page 2021
-
[42]
Astrid Ustad, Aleksej Logacjov, Stine Øverengen Trollebø, Pernille Thingstad, Beatrix Vereijken, Kerstin Bach, and Nina Skjæret Maroni
-
[43]
Validation of an activity type recognition model classifying daily physical behavior in older adults: the HAR70+ model.Sensors23, 5 (2023), 2368
work page 2023
-
[44]
Yan Wang, Xin Wang, Hongmei Yang, Yingrui Geng, Hongnian Yu, Ge Zheng, and Liang Liao. 2023. MhaGNN: A novel framework for wearable sensor-based human activity recognition combining multi-head attention and graph neural networks.IEEE Transactions on Instrumentation and Measurement72 (2023), 1–14
work page 2023
-
[45]
Yujie Wu, Lei Deng, Guoqi Li, Jun Zhu, and Luping Shi. 2018. Spatio-temporal backpropagation for training high-performance spiking neural networks.Frontiers in neuroscience12 (2018), 331
work page 2018
-
[46]
Kun Xia, Jianguang Huang, and Hanyu Wang. 2020. LSTM-CNN architecture for human activity recognition.Ieee Access8 (2020), 56855–56866
work page 2020
-
[47]
Hongjia Xu, Pengzhan Zhou, Rui Tan, and Mo Li. 2023. Practically adopting human activity recognition. InProceedings of the 29th Annual International Conference on Mobile Computing and Networking. 1–15
work page 2023
-
[48]
Huatao Xu, Pengfei Zhou, Rui Tan, Mo Li, and Guobin Shen. 2021. Limu-bert: Unleashing the potential of unlabeled data for imu sensing applications. InProceedings of the 19th ACM Conference on Embedded Networked Sensor Systems. 220–233
work page 2021
-
[49]
Yan Yan, Dali Chen, Yushi Liu, Jinjin Zhao, Bo Wang, Xuankun Wu, Xiaohao Jiao, Yuqian Chen, Huihui Li, and Xuchao Ren. 2021. TNDA-HAR. doi:10.21227/4epb-pg26
- [50]
-
[51]
Man Yao, Jiakui Hu, Zhaokun Zhou, Liane Yuan, Yonghong Tian, Bo Xu, and Guoqi Li. 2023. Spike-driven transformer. InAdvances in Neural Information Processing Systems, Vol. 36. 64043–64058
work page 2023
-
[52]
Bojian Yin, Federico Corradi, and Sander M Bohté. 2021. Accurate and efficient time-domain classification with adaptive spiking recurrent neural networks.Nature Machine Intelligence3, 10 (2021), 905–913
work page 2021
-
[53]
Ming Zeng, Le T Nguyen, Bo Yu, Ole J Mengshoel, Jiang Zhu, Pang Wu, and Joy Zhang. 2014. Convolutional neural networks for human activity recognition using mobile sensors. In6th international conference on mobile computing, applications and services. IEEE, 197–205
work page 2014
-
[54]
Mi Zhang and Alexander A Sawchuk. 2012. USC-HAD: A daily activity dataset for ubiquitous activity recognition using wearable sensors. InProceedings of the 2012 ACM conference on ubiquitous computing. 1036–1043
work page 2012
-
[55]
Yiming Zhang, Lin Wang, Hui Chen, Ao Tian, Shibo Zhou, and Yike Guo. 2022. IF-ConvTransformer: A framework for human activity recognition using IMU fusion and ConvTransformer.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 6, 2 (2022), 1–26
work page 2022
-
[56]
Chenlin Zhou, Han Zhang, Zhaokun Zhou, Liutao Yu, Liyuan Huang, Xiaopeng Fan, et al . 2024. Qkformer: Hierarchical spiking transformer using qk attention. InAdvances in Neural Information Processing Systems, Vol. 37. 13074–13098
work page 2024
- [57]
-
[58]
Zulun Zhu, Jiaying Peng, Jintang Li, Liang Chen, Qi Yu, and Siqiang Luo. 2022. Spiking graph convolutional networks.arXiv preprint arXiv:2205.02767(2022). A Detailed Network Architecture and Hyperparameters To ensure the rigorous reproducibility of our proposed framework, we provide the explicit internal tensor transformations and dataset-specific hyperpa...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.