KAN-MLP-Mixer: A comprehensive investigation of the usage of Kolmogorov-Arnold Networks (KANs) for improving IMU-based Human Activity Recognition
Pith reviewed 2026-06-30 18:12 UTC · model grok-4.3
The pith
A hybrid model using KAN for input embedding and classification with MLP layers in between improves IMU-based human activity recognition accuracy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that replacing all MLP components with KANs degrades accuracy and efficiency on noisy IMU data, but a selective hybrid architecture that uses a KAN-based input embedding layer, retains MLP layers for intermediate feature mixing, and adds a specialized LarctanKAN module for final classification yields consistent gains. On eight public HAR datasets the hybrid model delivers a 5.33 percent average relative improvement in macro F1 score over the pure-MLP baseline and outperforms both standalone KAN and MLP models. Applying the identical hybrid pattern to other state-of-the-art HAR networks likewise improves their results, showing that careful orchestration of KAN and MLP com
What carries the argument
The hybrid KAN-MLP architecture that places KAN modules only at the input embedding layer and as a LarctanKAN classifier while retaining MLP layers for intermediate feature mixing.
If this is right
- The hybrid strategy can be added to other existing HAR architectures to raise their accuracy without redesigning the full network.
- Selective use of KAN components preserves noise robustness while adding precision that pure MLP models lack on IMU signals.
- The approach yields more accurate and robust models for real-world wearable activity recognition tasks.
- Careful placement of KAN modules matters more than blanket replacement of MLP layers.
Where Pith is reading between the lines
- The same input-and-output KAN placement pattern may transfer to other noisy time-series classification problems beyond activity recognition.
- Efficiency comparisons between the hybrid and pure models on edge devices could reveal whether the accuracy gain comes at an acceptable compute cost.
- Further ablations that isolate the LarctanKAN classifier from the input embedding layer would clarify which component drives most of the observed lift.
Load-bearing premise
The performance gains arise specifically from the chosen placement of KAN modules rather than from differences in hyperparameter search effort or dataset-specific tuning.
What would settle it
Evaluating the same hybrid model and pure-MLP baseline on a new IMU HAR dataset with comparable noise levels and checking whether the 5.33 percent relative macro F1 improvement is reproduced.
Figures
read the original abstract
Kolmogorov-Arnold Networks (KANs) have demonstrated an exceptional ability to learn complex functions on clean, low-dimensional data but struggle to maintain performance on noisy and imperfect real-world datasets. In contrast, conventional multi-layer perceptrons (MLPs) are far more tolerant to noise and computationally efficient. Replacing all MLP components with KANs in HAR models often degrades accuracy and computation efficiency, highlighting an open challenge: how to combine KANs' precision with MLPs' noise robustness and efficiency. To address this, we systematically explore various placements of KAN modules within deep HAR networks and propose a hybrid architecture that strategically synergizes the strengths of both paradigms, which uses a KAN-based input embedding layer, retains MLP layers for intermediate feature mixing, and introduces a specialized LarctanKAN module for final activity classification. Across eight public HAR datasets, the hybrid KAN-MLP model achieves an average macro F1 score relative improvement of 5.33\% compared pure-MLP model, significantly outperforming standalone KAN and MLP baselines. Furthermore, integrating this hybrid strategy into other state-of-the-art HAR architectures consistently boosts their performance. Our findings demonstrate that a carefully orchestrated combination of KAN, MLP, or other conventional neural components yields more robust and accurate HAR models for real-world wearable sensing environments.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a hybrid KAN-MLP architecture for IMU-based human activity recognition that places a KAN module in the input embedding layer, retains MLP layers for feature mixing, and uses a specialized LarctanKAN classifier. It reports that this hybrid yields a 5.33% average relative improvement in macro F1 over pure-MLP baselines across eight public datasets, outperforms standalone KAN and MLP models, and can be integrated into other SOTA HAR architectures to improve their performance.
Significance. If the reported gains prove robust to hyperparameter matching and statistical controls, the work would demonstrate a practical way to combine KAN precision with MLP noise tolerance in real-world sensor data, potentially guiding hybrid designs for other noisy, high-dimensional tasks in wearable sensing.
major comments (2)
- [Abstract] Abstract: the central claim of a 5.33% average macro-F1 relative improvement is presented without per-dataset breakdowns, error bars, or statistical significance tests, which is load-bearing for the assertion that the specific KAN-MLP placement and LarctanKAN choice are the causal drivers rather than incidental factors.
- [Abstract] Abstract: the description of systematic architecture exploration does not reference ablation tables or controls that isolate the effect of KAN input embedding plus LarctanKAN classifier from differences in hyperparameter search budget or search space applied to the pure-MLP baselines.
minor comments (1)
- [Abstract] Abstract: the phrase "compared pure-MLP model" is missing the preposition "to".
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on the abstract. We address each comment below and will revise the manuscript to strengthen the presentation of results and controls.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central claim of a 5.33% average macro-F1 relative improvement is presented without per-dataset breakdowns, error bars, or statistical significance tests, which is load-bearing for the assertion that the specific KAN-MLP placement and LarctanKAN choice are the causal drivers rather than incidental factors.
Authors: The manuscript reports per-dataset macro-F1 scores, standard deviations across five random seeds, and paired statistical tests in the results section and supplementary tables. The abstract summarizes the average gain as the primary finding. We will revise the abstract to note the consistency of gains and statistical support, e.g., by adding a parenthetical reference to the detailed tables. revision: yes
-
Referee: [Abstract] Abstract: the description of systematic architecture exploration does not reference ablation tables or controls that isolate the effect of KAN input embedding plus LarctanKAN classifier from differences in hyperparameter search budget or search space applied to the pure-MLP baselines.
Authors: The full manuscript includes ablation studies that apply identical hyperparameter search budgets and spaces to all model variants. We will revise the abstract to explicitly reference these controlled ablations. revision: yes
Circularity Check
No circularity: empirical performance comparison on public datasets
full rationale
The paper conducts an empirical investigation of KAN placements in HAR models, reporting macro F1 improvements on eight public datasets without any derivation chain, equations, or self-citations that reduce claims to inputs by construction. The central result is a set of experimental comparisons whose validity rests on dataset benchmarks and baselines rather than tautological definitions or fitted-parameter renamings. No load-bearing self-citation or ansatz smuggling is present.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption KANs excel on clean low-dimensional data but degrade on noisy real-world sensor streams while MLPs are more robust and efficient
invented entities (1)
-
LarctanKAN module
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Sidra Abbas, Shtwai Alsubai, Muhammad Ibrar Ul Haque, Gabriel Avelino Sampedro, Ahmad Almadhor, Abdullah Al Hejaili, and Iryna Ivanochko
-
[2]
Active machine learning for heterogeneity activity recognition through smartwatch sensors.IEEE Access12 (2024), 22595–22607
2024
-
[3]
Reem Abdel-Salam, Rana Mostafa, and Mayada Hadhood. 2021. Human activity recognition using wearable sensors: review, challenges, evaluation benchmark. InInternational workshop on deep learning for human activity recognition. Springer, 1–15
2021
- [4]
-
[5]
Vladimir I. Arnold. 1963. On Functions of Three Variables.Doklady Akademii Nauk SSSR148 (1963), 9–12
1963
-
[6]
Vladimir I Arnold. 2009. On functions of three variables.Collected Works: Representations of Functions, Celestial Mechanics and KAM Theory, 1957–1965(2009), 5–8
2009
-
[7]
Marc Bachlin, Meir Plotnik, Daniel Roggen, Inbal Maidan, Jeffrey M Hausdorff, Nir Giladi, and Gerhard Troster. 2009. Wearable assistant for Parkinson’s disease patients with the freezing of gait symptom.IEEE Transactions on Information Technology in Biomedicine14, 2 (2009), 436–446
2009
-
[8]
Oresti Banos, Rafael Garcia, Juan A Holgado-Terriza, Miguel Damas, Hector Pomares, Ignacio Rojas, Alejandro Saez, and Claudia Villalonga. 2014. mHealthDroid: A novel framework for agile development of mobile health applications.Ambient Assisted Living and Daily Activities8868, 14 (2014), 91–98
2014
-
[9]
Sizhen Bian, Mengxi Liu, Bo Zhou, and Paul Lukowicz. 2022. The state-of-the-art sensing techniques in human activity recognition: A survey. Sensors22, 12 (2022), 4596
2022
- [10]
- [11]
- [12]
-
[13]
Blealtan Cao. 2024. An Efficient Implementation of Kolmogorov-Arnold Network. https://github.com/Blealtan/efficient-kan. Accessed: 2025-04-10
2024
-
[14]
Ricardo Chavarriaga, Hesam Sagha, Alberto Calatroni, Sundara Tejaswi Digumarti, Gerhard Tröster, José del R Millán, and Daniel Roggen. 2013. The Opportunity challenge: A benchmark database for on-body sensor-based activity recognition.Pattern Recognition Letters34, 15 (2013), 2033–2042
2013
-
[15]
Kaixuan Chen, Dalin Zhang, Lina Yao, Bin Guo, Zhiwen Yu, and Yunhao Liu. 2021. Deep learning for sensor-based human activity recognition: Overview, challenges, and opportunities.ACM Computing Surveys (CSUR)54, 4 (2021), 1–40
2021
- [16]
- [17]
-
[18]
Iveta Dirgová Luptáková, Martin Kubovčík, and Jiří Pospíchal. 2022. Wearable sensor-based human activity recognition with transformer model. Sensors22, 5 (2022), 1911
2022
- [19]
-
[20]
Yu Enokibori. 2024. rTsfNet: a DNN model with Multi-head 3D Rotation and Time Series Feature Extraction for IMU-based Human Activity Recognition.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies8, 4 (2024), 1–26
2024
-
[21]
Changjun Fan and Fei Gao. 2021. Enhanced human activity recognition using wearable sensors via a hybrid feature selection method.Sensors21, 19 (2021), 6434
2021
- [22]
-
[23]
Ahmed Dawod Mohammed Ibrahum, Zhengyu Shang, and Jang-Eui Hong. 2024. How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation.Applied Sciences14, 22 (2024), 10173
2024
-
[24]
Petr Ivashkov, Po-Wei Huang, Kelvin Koor, Lirandë Pira, and Patrick Rebentrost. 2026. QKAN: quantum Kolmogorov-Arnold networks with applications in machine learning and multivariate state preparation.npj Quantum Information12, 1 (11 Mar 2026), 73. doi:10.1038/s41534-026-01202-5
-
[25]
Ali Jamali, Swalpa Kumar Roy, Danfeng Hong, Bing Lu, and Pedram Ghamisi. 2024. How to learn more? Exploring Kolmogorov–Arnold networks for hyperspectral image classification.Remote Sensing16, 21 (2024), 4015
2024
-
[26]
Zanobya N Khan and Jamil Ahmad. 2021. Attention induced multi-head convolutional neural network for human activity recognition.Applied soft computing110 (2021), 107671
2021
-
[27]
Benjamin C Koenig, Suyong Kim, and Sili Deng. 2024. KAN-ODEs: Kolmogorov–Arnold network ordinary differential equations for learning dynamical systems and hidden physics.Computer Methods in Applied Mechanics and Engineering432 (2024), 117397
2024
-
[28]
Andrei Nikolaevich Kolmogorov. 1957. On the representations of continuous functions of many variables by superposition of continuous functions of one variable and addition. InDokl. Akad. Nauk USSR, Vol. 114. 953–956. Manuscript submitted to ACM KAN-MLP-Mixer 23
1957
-
[29]
Tran Xuan Hieu Le, Thi Diem Tran, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Tinh Nguyen, Yasuhiko Nakashima, et al . 2024. Exploring the limitations of kolmogorov-arnold networks in classification: Insights to software training and hardware implementation. In2024 Twelfth International Symposium on Computing and Networking Workshops (CANDARW). IE...
2024
- [30]
-
[31]
Hanxiao Liu, Zihang Dai, David So, and Quoc V Le. 2021. Pay attention to mlps.Advances in neural information processing systems34 (2021), 9204–9215
2021
-
[32]
Mengxi Liu, Daniel Geißler, Dominique Nshimyimana, Sizhen Bian, Bo Zhou, and Paul Lukowicz. 2024. Initial investigation of kolmogorov-arnold networks (kans) as feature extractors for imu based human activity recognition. InCompanion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous Computing. 500–506
2024
- [33]
-
[34]
Ziming Liu, Yixuan Wang, Sachin Vaidya, Fabian Ruehle, James Halverson, Marin Soljačić, Thomas Y Hou, and Max Tegmark. 2024. Kan: Kolmogorov-arnold networks.arXiv preprint arXiv:2404.19756(2024)
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[35]
Mohammad Malekzadeh, Richard Clegg, Andrea Cavallaro, and Hamed Haddadi. 2021. Dana: Dimension-adaptive neural architecture for multivariate sensor data.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies5, 3 (2021), 1–27
2021
-
[36]
Takeru Miyoshi, Makoto Koshino, and Hidetaka Nambo. 2025. Applying MLP-Mixer and gMLP to Human Activity Recognition.Sensors25, 2 (2025), 311
2025
-
[37]
Kamsiriochukwu Ojiako and Katayoun Farrahi. 2023. MLPs Are All You Need for Human Activity Recognition.Applied Sciences13, 20 (2023), 11154
2023
-
[38]
Ordóñez and Daniel Roggen
Francisco J. Ordóñez and Daniel Roggen. 2016. Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition.Sensors16, 1 (2016), 115
2016
-
[39]
Allan Pinkus. 1999. Approximation theory of the MLP model in neural networks.Acta numerica8 (1999), 143–195
1999
-
[40]
Eleonora Poeta, Flavio Giobergia, Eliana Pastor, Tania Cerquitelli, and Elena Baralis. 2024. A benchmarking study of kolmogorov-arnold networks on tabular data. In2024 IEEE 18th International Conference on Application of Information and Communication Technologies (AICT). IEEE, 1–6
2024
-
[41]
Attila Reiss and Didier Stricker. 2012. Introducing a new benchmarked dataset for activity monitoring. In2012 16th international symposium on wearable computers. IEEE, 108–109
2012
-
[42]
Reyes-Ortiz, Davide Anguita, Alessandro Ghio, Luca Oneto, and Xavier Parra
Jorge L. Reyes-Ortiz, Davide Anguita, Alessandro Ghio, Luca Oneto, and Xavier Parra. 2013.Human Activity Recognition Using Smartphones. doi:10.24432/C54S4K
-
[43]
Yoli Shavit and Itzik Klein. 2021. Boosting inertial-based human activity recognition with transformers.IEEE Access9 (2021), 53540–53547
2021
-
[44]
Haoran Shen, Chen Zeng, Jiahui Wang, and Qiao Wang. 2025. Reduced effectiveness of kolmogorov-arnold networks on functions with noise. In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1–5
2025
- [45]
-
[46]
Yueyuan Sui, Minghui Zhao, Junxi Xia, Xiaofan Jiang, and Stephen Xia. 2024. Tramba: A hybrid transformer and mamba architecture for practical audio and bone conduction speech super resolution and enhancement on mobile and wearable platforms.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies8, 4 (2024), 1–29
2024
-
[47]
Ilya O Tolstikhin, Neil Houlsby, Alexander Kolesnikov, Lucas Beyer, Xiaohua Zhai, Thomas Unterthiner, Jessica Yung, Andreas Steiner, Daniel Keysers, Jakob Uszkoreit, et al. 2021. Mlp-mixer: An all-mlp architecture for vision.Advances in neural information processing systems34 (2021), 24261–24272
2021
-
[48]
Juan Diego Toscano, Vivek Oommen, Alan John Varghese, Zongren Zou, Nazanin Ahmadi Daryakenari, Chenxi Wu, and George Em Karniadakis
-
[49]
From pinns to pikans: Recent advances in physics-informed machine learning.Machine Learning for Computational Science and Engineering1, 1 (2025), 1–43
2025
-
[50]
Yu-Hsuan Tseng and Chih-Yu Wen. 2023. Hybrid Learning Models for IMU-Based HAR with Feature Analysis and Data Correction.Sensors23, 18 (2023), 7802
2023
-
[51]
Yannick Werner, Akash Malemath, Mengxi Liu, Vitor Fortes Rey, Nikolaos Palaiodimopoulos, Paul Lukowicz, and Maximilian Kiefer-Emmanouilidis
-
[52]
doi:10.1038/s41598-025-22705-9
QuKAN: A Quantum Circuit Born Machine Approach to Quantum Kolmogorov Arnold Networks.Scientific Reports15, 1 (09 Oct 2025), 35239. doi:10.1038/s41598-025-22705-9
- [53]
- [54]
-
[55]
Yafeng Yin, Lei Xie, Zhiwei Jiang, Fu Xiao, Jiannong Cao, and Sanglu Lu. 2024. A systematic review of human activity recognition based on mobile devices: overview, progress and trends.IEEE Communications Surveys & Tutorials26, 2 (2024), 890–929
2024
-
[56]
Piero Zappi, Clemens Lombriser, Thomas Stiefmeier, Elisabetta Farella, Daniel Roggen, Luca Benini, and Gerhard Tröster. 2008. Activity recognition from on-body sensors: accuracy-power trade-off by dynamic sensor selection. InWireless Sensor Networks: 5th European Conference, EWSN 2008, Bologna, Italy, January 30-February 1, 2008. Proceedings. Springer, 17–33
2008
-
[57]
Licheng Zhang, Xihong Wu, and Dingsheng Luo. 2015. Recognizing human activities from raw accelerometer data using deep neural networks. In 2015 IEEE 14th International conference on machine learning and applications (ICMLA). IEEE, 865–870. Manuscript submitted to ACM 24 Liu et al
2015
-
[58]
Ye Zhang, Longguang Wang, Huiling Chen, Aosheng Tian, Shilin Zhou, and Yulan Guo. 2022. IF-ConvTransformer: A framework for human activity recognition using IMU fusion and ConvTransformer.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies6, 2 (2022), 1–26
2022
-
[59]
Yexu Zhou, Tobias King, Haibin Zhao, Yiran Huang, Till Riedel, and Michael Beigl. 2024. MLP-HAR: Boosting Performance and Efficiency of HAR Models on Edge Devices with Purely Fully Connected Layers. InProceedings of the 2024 ACM International Symposium on Wearable Computers. 133–139
2024
-
[60]
Yexu Zhou, Haibin Zhao, Yiran Huang, Till Riedel, Michael Hefenbrock, and Michael Beigl. 2022. Tinyhar: A lightweight deep learning model designed for human activity recognition. InProceedings of the 2022 ACM International Symposium on Wearable Computers. 89–93. Received 20 February 2007; revised 12 March 2009; accepted 5 June 2009 Manuscript submitted to ACM
2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.