KAN-MLP-Mixer: A comprehensive investigation of the usage of Kolmogorov-Arnold Networks (KANs) for improving IMU-based Human Activity Recognition

Bo Zhou; Daniel Gei{\ss}ler; Francisco Calatrava Nicolas; Maximilian Kiefer-Emmanouilidis; Mengxi Liu; Paul Lukowicz; Sizhen Bian; Vitor Fortes

arxiv: 2605.19031 · v2 · pith:7KU43LGSnew · submitted 2026-05-18 · 💻 cs.AI · eess.SP

KAN-MLP-Mixer: A comprehensive investigation of the usage of Kolmogorov-Arnold Networks (KANs) for improving IMU-based Human Activity Recognition

Mengxi Liu , Sizhen Bian , Vitor Fortes , Francisco Calatrava Nicolas , Daniel Gei{\ss}ler , Maximilian Kiefer-Emmanouilidis , Bo Zhou , Paul Lukowicz This is my paper

Pith reviewed 2026-06-30 18:12 UTC · model grok-4.3

classification 💻 cs.AI eess.SP

keywords Kolmogorov-Arnold NetworksHuman Activity RecognitionIMU sensorsHybrid neural networksWearable sensingNeural architecture search

0 comments

The pith

A hybrid model using KAN for input embedding and classification with MLP layers in between improves IMU-based human activity recognition accuracy.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper explores how to combine Kolmogorov-Arnold Networks with conventional multi-layer perceptrons in models that recognize human activities from inertial measurement unit sensor data. KANs learn precise functions well on clean low-dimensional inputs but lose accuracy on the noisy signals typical of real wearable recordings, while MLPs tolerate noise better and run more efficiently. Systematic tests of KAN placements show that restricting KAN modules to the input embedding layer and a final LarctanKAN classifier, while keeping MLP layers for intermediate mixing, produces the best results. This hybrid raises average macro F1 score by 5.33 percent relative to a pure-MLP baseline across eight public datasets and also lifts performance when the same pattern is added to other established HAR architectures.

Core claim

The central claim is that replacing all MLP components with KANs degrades accuracy and efficiency on noisy IMU data, but a selective hybrid architecture that uses a KAN-based input embedding layer, retains MLP layers for intermediate feature mixing, and adds a specialized LarctanKAN module for final classification yields consistent gains. On eight public HAR datasets the hybrid model delivers a 5.33 percent average relative improvement in macro F1 score over the pure-MLP baseline and outperforms both standalone KAN and MLP models. Applying the identical hybrid pattern to other state-of-the-art HAR networks likewise improves their results, showing that careful orchestration of KAN and MLP com

What carries the argument

The hybrid KAN-MLP architecture that places KAN modules only at the input embedding layer and as a LarctanKAN classifier while retaining MLP layers for intermediate feature mixing.

If this is right

The hybrid strategy can be added to other existing HAR architectures to raise their accuracy without redesigning the full network.
Selective use of KAN components preserves noise robustness while adding precision that pure MLP models lack on IMU signals.
The approach yields more accurate and robust models for real-world wearable activity recognition tasks.
Careful placement of KAN modules matters more than blanket replacement of MLP layers.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same input-and-output KAN placement pattern may transfer to other noisy time-series classification problems beyond activity recognition.
Efficiency comparisons between the hybrid and pure models on edge devices could reveal whether the accuracy gain comes at an acceptable compute cost.
Further ablations that isolate the LarctanKAN classifier from the input embedding layer would clarify which component drives most of the observed lift.

Load-bearing premise

The performance gains arise specifically from the chosen placement of KAN modules rather than from differences in hyperparameter search effort or dataset-specific tuning.

What would settle it

Evaluating the same hybrid model and pure-MLP baseline on a new IMU HAR dataset with comparable noise levels and checking whether the 5.33 percent relative macro F1 improvement is reproduced.

Figures

Figures reproduced from arXiv: 2605.19031 by Bo Zhou, Daniel Gei{\ss}ler, Francisco Calatrava Nicolas, Maximilian Kiefer-Emmanouilidis, Mengxi Liu, Paul Lukowicz, Sizhen Bian, Vitor Fortes.

**Figure 1.** Figure 1: Comparison of model predictions on synthetic functions representing typical characteristics of sensor data. The step function [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗

**Figure 2.** Figure 2: The proposed hybrid network architecture KAN-MLP-Mixer based on an empirical study. It consists of three modules: KAN [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗

**Figure 3.** Figure 3: Average performance improvement compared to the MLPHAR baseline across eight datasets using the hybrid model with [PITH_FULL_IMAGE:figures/full_fig_p013_3.png] view at source ↗

**Figure 4.** Figure 4: Performance comparison for KAN-MLP-Mixer and MLPHAR models on five datasets under three sensor configurations (single [PITH_FULL_IMAGE:figures/full_fig_p015_4.png] view at source ↗

**Figure 5.** Figure 5: Performance comparison for KAN-MLP-Mixer and MLPHAR models under three window size configurations. Numerical [PITH_FULL_IMAGE:figures/full_fig_p016_5.png] view at source ↗

**Figure 6.** Figure 6: The extending hybrid design across diverse neural backbones, the original models only have the pure convolutional layers on [PITH_FULL_IMAGE:figures/full_fig_p017_6.png] view at source ↗

**Figure 7.** Figure 7: Number of parameters in different components of KAN-MLP-Mixer and MLPHAR models.(The [PITH_FULL_IMAGE:figures/full_fig_p018_7.png] view at source ↗

**Figure 8.** Figure 8: Parameter-efficiency when scaling models comparing between KAN-MLP-Mixer and MLPHAR across eight benchmark [PITH_FULL_IMAGE:figures/full_fig_p019_8.png] view at source ↗

**Figure 9.** Figure 9: Computational efficiency comparison between KAN-MLP-Mixer and MLPHAR across eight benchmark datasets. For KAN [PITH_FULL_IMAGE:figures/full_fig_p020_9.png] view at source ↗

read the original abstract

Kolmogorov-Arnold Networks (KANs) have demonstrated an exceptional ability to learn complex functions on clean, low-dimensional data but struggle to maintain performance on noisy and imperfect real-world datasets. In contrast, conventional multi-layer perceptrons (MLPs) are far more tolerant to noise and computationally efficient. Replacing all MLP components with KANs in HAR models often degrades accuracy and computation efficiency, highlighting an open challenge: how to combine KANs' precision with MLPs' noise robustness and efficiency. To address this, we systematically explore various placements of KAN modules within deep HAR networks and propose a hybrid architecture that strategically synergizes the strengths of both paradigms, which uses a KAN-based input embedding layer, retains MLP layers for intermediate feature mixing, and introduces a specialized LarctanKAN module for final activity classification. Across eight public HAR datasets, the hybrid KAN-MLP model achieves an average macro F1 score relative improvement of 5.33\% compared pure-MLP model, significantly outperforming standalone KAN and MLP baselines. Furthermore, integrating this hybrid strategy into other state-of-the-art HAR architectures consistently boosts their performance. Our findings demonstrate that a carefully orchestrated combination of KAN, MLP, or other conventional neural components yields more robust and accurate HAR models for real-world wearable sensing environments.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The KAN-MLP hybrid gives a modest reported lift on eight HAR datasets but the gains could easily trace to uneven tuning rather than the specific module placements.

read the letter

The main takeaway is that this work identifies a workable hybrid placement—KAN at the input embedding, standard MLP layers for mixing, and a LarctanKAN at the classifier—and shows that swapping this pattern into several existing HAR models improves their numbers. That placement choice is concrete and not previously documented in the cited literature, so the architecture itself counts as a small but real increment for the IMU activity-recognition subfield.

What the paper does cleanly is run the same hybrid recipe across eight public datasets and report that the average macro-F1 rises 5.33 % relative to a pure-MLP baseline while also lifting other published architectures when the same substitution is applied. The abstract frames this as a deliberate search over module locations rather than a single lucky configuration, which is the right way to approach the question.

The soft spot is exactly the one flagged in the stress-test note. The abstract gives no error bars, no per-dataset variance, no statistical tests, and no description of how many hyper-parameter trials were allocated to the pure-MLP controls versus the hybrid. Without those controls it is hard to rule out that the observed delta comes from more search effort or incidental capacity differences rather than the KAN-MLP synergy. If the full paper contains matched search budgets and ablation tables that close this gap, the result strengthens; if not, the central claim stays provisional.

The work is incremental rather than foundational, but the empirical protocol is straightforward and the domain (wearable sensing) is practical. A reader already working on IMU models could extract the placement recipe and test it in a few days. I would bring it to a reading group for the architecture diagram and the cross-dataset numbers, but I would not cite it until the tuning controls are visible.

It is worth sending to peer review. The question it asks is legitimate, the datasets are standard, and the hybrid idea is falsifiable with the right ablations. A referee can ask for the missing controls without needing to rewrite the paper.

Referee Report

2 major / 1 minor

Summary. The paper proposes a hybrid KAN-MLP architecture for IMU-based human activity recognition that places a KAN module in the input embedding layer, retains MLP layers for feature mixing, and uses a specialized LarctanKAN classifier. It reports that this hybrid yields a 5.33% average relative improvement in macro F1 over pure-MLP baselines across eight public datasets, outperforms standalone KAN and MLP models, and can be integrated into other SOTA HAR architectures to improve their performance.

Significance. If the reported gains prove robust to hyperparameter matching and statistical controls, the work would demonstrate a practical way to combine KAN precision with MLP noise tolerance in real-world sensor data, potentially guiding hybrid designs for other noisy, high-dimensional tasks in wearable sensing.

major comments (2)

[Abstract] Abstract: the central claim of a 5.33% average macro-F1 relative improvement is presented without per-dataset breakdowns, error bars, or statistical significance tests, which is load-bearing for the assertion that the specific KAN-MLP placement and LarctanKAN choice are the causal drivers rather than incidental factors.
[Abstract] Abstract: the description of systematic architecture exploration does not reference ablation tables or controls that isolate the effect of KAN input embedding plus LarctanKAN classifier from differences in hyperparameter search budget or search space applied to the pure-MLP baselines.

minor comments (1)

[Abstract] Abstract: the phrase "compared pure-MLP model" is missing the preposition "to".

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on the abstract. We address each comment below and will revise the manuscript to strengthen the presentation of results and controls.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim of a 5.33% average macro-F1 relative improvement is presented without per-dataset breakdowns, error bars, or statistical significance tests, which is load-bearing for the assertion that the specific KAN-MLP placement and LarctanKAN choice are the causal drivers rather than incidental factors.

Authors: The manuscript reports per-dataset macro-F1 scores, standard deviations across five random seeds, and paired statistical tests in the results section and supplementary tables. The abstract summarizes the average gain as the primary finding. We will revise the abstract to note the consistency of gains and statistical support, e.g., by adding a parenthetical reference to the detailed tables. revision: yes
Referee: [Abstract] Abstract: the description of systematic architecture exploration does not reference ablation tables or controls that isolate the effect of KAN input embedding plus LarctanKAN classifier from differences in hyperparameter search budget or search space applied to the pure-MLP baselines.

Authors: The full manuscript includes ablation studies that apply identical hyperparameter search budgets and spaces to all model variants. We will revise the abstract to explicitly reference these controlled ablations. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical performance comparison on public datasets

full rationale

The paper conducts an empirical investigation of KAN placements in HAR models, reporting macro F1 improvements on eight public datasets without any derivation chain, equations, or self-citations that reduce claims to inputs by construction. The central result is a set of experimental comparisons whose validity rests on dataset benchmarks and baselines rather than tautological definitions or fitted-parameter renamings. No load-bearing self-citation or ansatz smuggling is present.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The paper rests on standard assumptions about neural network training and the relative noise tolerance of KAN versus MLP layers; the only invented component is the LarctanKAN module whose independent evidence is limited to the reported performance numbers.

axioms (1)

domain assumption KANs excel on clean low-dimensional data but degrade on noisy real-world sensor streams while MLPs are more robust and efficient
Explicitly stated in the opening sentences of the abstract as the motivation for hybrid design.

invented entities (1)

LarctanKAN module no independent evidence
purpose: Specialized final classification layer that combines KAN precision with activity-label output
Introduced as a custom component in the hybrid architecture; no external validation or theoretical derivation supplied beyond the empirical gains.

pith-pipeline@v0.9.1-grok · 5815 in / 1487 out tokens · 26944 ms · 2026-06-30T18:12:43.811340+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

60 extracted references · 17 canonical work pages · 1 internal anchor

[1]

Sidra Abbas, Shtwai Alsubai, Muhammad Ibrar Ul Haque, Gabriel Avelino Sampedro, Ahmad Almadhor, Abdullah Al Hejaili, and Iryna Ivanochko
[2]

Active machine learning for heterogeneity activity recognition through smartwatch sensors.IEEE Access12 (2024), 22595–22607

2024
[3]

Reem Abdel-Salam, Rana Mostafa, and Mayada Hadhood. 2021. Human activity recognition using wearable sensors: review, challenges, evaluation benchmark. InInternational workshop on deep learning for human activity recognition. Springer, 1–15

2021
[4]

Mohammed Ghaith Altarabichi. 2024. Dropkan: Regularizing kans by masking post-activations.arXiv preprint arXiv:2407.13044(2024)

work page arXiv 2024
[5]

Vladimir I. Arnold. 1963. On Functions of Three Variables.Doklady Akademii Nauk SSSR148 (1963), 9–12

1963
[6]

Vladimir I Arnold. 2009. On functions of three variables.Collected Works: Representations of Functions, Celestial Mechanics and KAM Theory, 1957–1965(2009), 5–8

2009
[7]

Marc Bachlin, Meir Plotnik, Daniel Roggen, Inbal Maidan, Jeffrey M Hausdorff, Nir Giladi, and Gerhard Troster. 2009. Wearable assistant for Parkinson’s disease patients with the freezing of gait symptom.IEEE Transactions on Information Technology in Biomedicine14, 2 (2009), 436–446

2009
[8]

Oresti Banos, Rafael Garcia, Juan A Holgado-Terriza, Miguel Damas, Hector Pomares, Ignacio Rojas, Alejandro Saez, and Claudia Villalonga. 2014. mHealthDroid: A novel framework for agile development of mobile health applications.Ambient Assisted Living and Daily Activities8868, 14 (2014), 91–98

2014
[9]

Sizhen Bian, Mengxi Liu, Bo Zhou, and Paul Lukowicz. 2022. The state-of-the-art sensing techniques in human activity recognition: A survey. Sensors22, 12 (2022), 4596

2022
[10]

Alexander Dylan Bodner, Antonio Santiago Tepsich, Jack Natan Spolski, and Santiago Pourteau. 2024. Convolutional kolmogorov-arnold networks. arXiv preprint arXiv:2406.13155(2024)

work page arXiv 2024
[11]

Zavareh Bozorgasl and Hao Chen. [n. d.]. Wav-kan: Wavelet kolmogorov-arnold networks, 2024.arXiv preprint arXiv:2405.12832([n. d.])

work page arXiv 2024
[12]

Yueyang Cang, Li Shi, et al. 2024. Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision.arXiv preprint arXiv:2411.06727(2024)

work page arXiv 2024
[13]

Blealtan Cao. 2024. An Efficient Implementation of Kolmogorov-Arnold Network. https://github.com/Blealtan/efficient-kan. Accessed: 2025-04-10

2024
[14]

Ricardo Chavarriaga, Hesam Sagha, Alberto Calatroni, Sundara Tejaswi Digumarti, Gerhard Tröster, José del R Millán, and Daniel Roggen. 2013. The Opportunity challenge: A benchmark database for on-body sensor-based activity recognition.Pattern Recognition Letters34, 15 (2013), 2033–2042

2013
[15]

Kaixuan Chen, Dalin Zhang, Lina Yao, Bin Guo, Zhiwen Yu, and Yunhao Liu. 2021. Deep learning for sensor-based human activity recognition: Overview, challenges, and opportunities.ACM Computing Surveys (CSUR)54, 4 (2021), 1–40

2021
[16]

Zhijie Chen and Xinglin Zhang. 2024. Larctan-skan: Simple and efficient single-parameterized kolmogorov-arnold networks using learnable trigonometric function.arXiv preprint arXiv:2410.19360(2024)

work page arXiv 2024
[17]

Zhijie Chen and Xinglin Zhang. 2024. Lss-skan: Efficient kolmogorov-arnold networks based on single-parameterized function.arXiv preprint arXiv:2410.14951(2024)

work page arXiv 2024
[18]

Iveta Dirgová Luptáková, Martin Kubovčík, and Jiří Pospíchal. 2022. Wearable sensor-based human activity recognition with transformer model. Sensors22, 5 (2022), 1911

2022
[19]

Ivan Drokin. 2024. Kolmogorov-arnold convolutions: Design principles and empirical studies.arXiv preprint arXiv:2407.01092(2024)

work page arXiv 2024
[20]

Yu Enokibori. 2024. rTsfNet: a DNN model with Multi-head 3D Rotation and Time Series Feature Extraction for IMU-based Human Activity Recognition.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies8, 4 (2024), 1–26

2024
[21]

Changjun Fan and Fei Gao. 2021. Enhanced human activity recognition using wearable sensors via a hybrid feature selection method.Sensors21, 19 (2021), 6434

2021
[22]

Remi Genet and Hugo Inzirillo. 2024. Tkan: Temporal kolmogorov-arnold networks.arXiv preprint arXiv:2405.07344(2024)

work page arXiv 2024
[23]

Ahmed Dawod Mohammed Ibrahum, Zhengyu Shang, and Jang-Eui Hong. 2024. How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation.Applied Sciences14, 22 (2024), 10173

2024
[24]

Petr Ivashkov, Po-Wei Huang, Kelvin Koor, Lirandë Pira, and Patrick Rebentrost. 2026. QKAN: quantum Kolmogorov-Arnold networks with applications in machine learning and multivariate state preparation.npj Quantum Information12, 1 (11 Mar 2026), 73. doi:10.1038/s41534-026-01202-5

work page doi:10.1038/s41534-026-01202-5 2026
[25]

Ali Jamali, Swalpa Kumar Roy, Danfeng Hong, Bing Lu, and Pedram Ghamisi. 2024. How to learn more? Exploring Kolmogorov–Arnold networks for hyperspectral image classification.Remote Sensing16, 21 (2024), 4015

2024
[26]

Zanobya N Khan and Jamil Ahmad. 2021. Attention induced multi-head convolutional neural network for human activity recognition.Applied soft computing110 (2021), 107671

2021
[27]

Benjamin C Koenig, Suyong Kim, and Sili Deng. 2024. KAN-ODEs: Kolmogorov–Arnold network ordinary differential equations for learning dynamical systems and hidden physics.Computer Methods in Applied Mechanics and Engineering432 (2024), 117397

2024
[28]

Andrei Nikolaevich Kolmogorov. 1957. On the representations of continuous functions of many variables by superposition of continuous functions of one variable and addition. InDokl. Akad. Nauk USSR, Vol. 114. 953–956. Manuscript submitted to ACM KAN-MLP-Mixer 23

1957
[29]

Tran Xuan Hieu Le, Thi Diem Tran, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Tinh Nguyen, Yasuhiko Nakashima, et al . 2024. Exploring the limitations of kolmogorov-arnold networks in classification: Insights to software training and hardware implementation. In2024 Twelfth International Symposium on Computing and Networking Workshops (CANDARW). IE...

2024
[30]

Ziyao Li. 2024. Kolmogorov-Arnold Networks are Radial Basis Function Networks. (2024). arXiv:2405.06721 [cs.LG]

work page arXiv 2024
[31]

Hanxiao Liu, Zihang Dai, David So, and Quoc V Le. 2021. Pay attention to mlps.Advances in neural information processing systems34 (2021), 9204–9215

2021
[32]

Mengxi Liu, Daniel Geißler, Dominique Nshimyimana, Sizhen Bian, Bo Zhou, and Paul Lukowicz. 2024. Initial investigation of kolmogorov-arnold networks (kans) as feature extractors for imu based human activity recognition. InCompanion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous Computing. 500–506

2024
[33]

Ziming Liu, Pingchuan Ma, Yixuan Wang, Wojciech Matusik, and Max Tegmark. 2024. Kan 2.0: Kolmogorov-arnold networks meet science.arXiv preprint arXiv:2408.10205(2024)

work page arXiv 2024
[34]

Ziming Liu, Yixuan Wang, Sachin Vaidya, Fabian Ruehle, James Halverson, Marin Soljačić, Thomas Y Hou, and Max Tegmark. 2024. Kan: Kolmogorov-arnold networks.arXiv preprint arXiv:2404.19756(2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024
[35]

Mohammad Malekzadeh, Richard Clegg, Andrea Cavallaro, and Hamed Haddadi. 2021. Dana: Dimension-adaptive neural architecture for multivariate sensor data.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies5, 3 (2021), 1–27

2021
[36]

Takeru Miyoshi, Makoto Koshino, and Hidetaka Nambo. 2025. Applying MLP-Mixer and gMLP to Human Activity Recognition.Sensors25, 2 (2025), 311

2025
[37]

Kamsiriochukwu Ojiako and Katayoun Farrahi. 2023. MLPs Are All You Need for Human Activity Recognition.Applied Sciences13, 20 (2023), 11154

2023
[38]

Ordóñez and Daniel Roggen

Francisco J. Ordóñez and Daniel Roggen. 2016. Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition.Sensors16, 1 (2016), 115

2016
[39]

Allan Pinkus. 1999. Approximation theory of the MLP model in neural networks.Acta numerica8 (1999), 143–195

1999
[40]

Eleonora Poeta, Flavio Giobergia, Eliana Pastor, Tania Cerquitelli, and Elena Baralis. 2024. A benchmarking study of kolmogorov-arnold networks on tabular data. In2024 IEEE 18th International Conference on Application of Information and Communication Technologies (AICT). IEEE, 1–6

2024
[41]

Attila Reiss and Didier Stricker. 2012. Introducing a new benchmarked dataset for activity monitoring. In2012 16th international symposium on wearable computers. IEEE, 108–109

2012
[42]

Reyes-Ortiz, Davide Anguita, Alessandro Ghio, Luca Oneto, and Xavier Parra

Jorge L. Reyes-Ortiz, Davide Anguita, Alessandro Ghio, Luca Oneto, and Xavier Parra. 2013.Human Activity Recognition Using Smartphones. doi:10.24432/C54S4K

work page doi:10.24432/c54s4k 2013
[43]

Yoli Shavit and Itzik Klein. 2021. Boosting inertial-based human activity recognition with transformers.IEEE Access9 (2021), 53540–53547

2021
[44]

Haoran Shen, Chen Zeng, Jiahui Wang, and Qiao Wang. 2025. Reduced effectiveness of kolmogorov-arnold networks on functions with noise. In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1–5

2025
[45]

Shriyank Somvanshi, Syed Aaqib Javed, Md Monzurul Islam, Diwas Pandit, and Subasish Das. 2024. A survey on kolmogorov-arnold network.arXiv preprint arXiv:2411.06078(2024)

work page arXiv 2024
[46]

Yueyuan Sui, Minghui Zhao, Junxi Xia, Xiaofan Jiang, and Stephen Xia. 2024. Tramba: A hybrid transformer and mamba architecture for practical audio and bone conduction speech super resolution and enhancement on mobile and wearable platforms.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies8, 4 (2024), 1–29

2024
[47]

Ilya O Tolstikhin, Neil Houlsby, Alexander Kolesnikov, Lucas Beyer, Xiaohua Zhai, Thomas Unterthiner, Jessica Yung, Andreas Steiner, Daniel Keysers, Jakob Uszkoreit, et al. 2021. Mlp-mixer: An all-mlp architecture for vision.Advances in neural information processing systems34 (2021), 24261–24272

2021
[48]

Juan Diego Toscano, Vivek Oommen, Alan John Varghese, Zongren Zou, Nazanin Ahmadi Daryakenari, Chenxi Wu, and George Em Karniadakis
[49]

From pinns to pikans: Recent advances in physics-informed machine learning.Machine Learning for Computational Science and Engineering1, 1 (2025), 1–43

2025
[50]

Yu-Hsuan Tseng and Chih-Yu Wen. 2023. Hybrid Learning Models for IMU-Based HAR with Feature Analysis and Data Correction.Sensors23, 18 (2023), 7802

2023
[51]

Yannick Werner, Akash Malemath, Mengxi Liu, Vitor Fortes Rey, Nikolaos Palaiodimopoulos, Paul Lukowicz, and Maximilian Kiefer-Emmanouilidis
[52]

doi:10.1038/s41598-025-22705-9

QuKAN: A Quantum Circuit Born Machine Approach to Quantum Kolmogorov Arnold Networks.Scientific Reports15, 1 (09 Oct 2025), 35239. doi:10.1038/s41598-025-22705-9

work page doi:10.1038/s41598-025-22705-9 2025
[53]

Jinfeng Xu, Zheyu Chen, Jinze Li, Shuo Yang, Wei Wang, Xiping Hu, and Edith C-H Ngai. 2024. FourierKAN-GCF: Fourier Kolmogorov-Arnold Network–An Effective and Efficient Feature Transformation for Graph Collaborative Filtering.arXiv preprint arXiv:2406.01034(2024)

work page arXiv 2024
[54]

Xingyi Yang and Xinchao Wang. 2024. Kolmogorov-arnold transformer.arXiv preprint arXiv:2409.10594(2024)

work page arXiv 2024
[55]

Yafeng Yin, Lei Xie, Zhiwei Jiang, Fu Xiao, Jiannong Cao, and Sanglu Lu. 2024. A systematic review of human activity recognition based on mobile devices: overview, progress and trends.IEEE Communications Surveys & Tutorials26, 2 (2024), 890–929

2024
[56]

Piero Zappi, Clemens Lombriser, Thomas Stiefmeier, Elisabetta Farella, Daniel Roggen, Luca Benini, and Gerhard Tröster. 2008. Activity recognition from on-body sensors: accuracy-power trade-off by dynamic sensor selection. InWireless Sensor Networks: 5th European Conference, EWSN 2008, Bologna, Italy, January 30-February 1, 2008. Proceedings. Springer, 17–33

2008
[57]

Licheng Zhang, Xihong Wu, and Dingsheng Luo. 2015. Recognizing human activities from raw accelerometer data using deep neural networks. In 2015 IEEE 14th International conference on machine learning and applications (ICMLA). IEEE, 865–870. Manuscript submitted to ACM 24 Liu et al

2015
[58]

Ye Zhang, Longguang Wang, Huiling Chen, Aosheng Tian, Shilin Zhou, and Yulan Guo. 2022. IF-ConvTransformer: A framework for human activity recognition using IMU fusion and ConvTransformer.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies6, 2 (2022), 1–26

2022
[59]

Yexu Zhou, Tobias King, Haibin Zhao, Yiran Huang, Till Riedel, and Michael Beigl. 2024. MLP-HAR: Boosting Performance and Efficiency of HAR Models on Edge Devices with Purely Fully Connected Layers. InProceedings of the 2024 ACM International Symposium on Wearable Computers. 133–139

2024
[60]

Yexu Zhou, Haibin Zhao, Yiran Huang, Till Riedel, Michael Hefenbrock, and Michael Beigl. 2022. Tinyhar: A lightweight deep learning model designed for human activity recognition. InProceedings of the 2022 ACM International Symposium on Wearable Computers. 89–93. Received 20 February 2007; revised 12 March 2009; accepted 5 June 2009 Manuscript submitted to ACM

2022

[1] [1]

Sidra Abbas, Shtwai Alsubai, Muhammad Ibrar Ul Haque, Gabriel Avelino Sampedro, Ahmad Almadhor, Abdullah Al Hejaili, and Iryna Ivanochko

[2] [2]

Active machine learning for heterogeneity activity recognition through smartwatch sensors.IEEE Access12 (2024), 22595–22607

2024

[3] [3]

Reem Abdel-Salam, Rana Mostafa, and Mayada Hadhood. 2021. Human activity recognition using wearable sensors: review, challenges, evaluation benchmark. InInternational workshop on deep learning for human activity recognition. Springer, 1–15

2021

[4] [4]

Mohammed Ghaith Altarabichi. 2024. Dropkan: Regularizing kans by masking post-activations.arXiv preprint arXiv:2407.13044(2024)

work page arXiv 2024

[5] [5]

Vladimir I. Arnold. 1963. On Functions of Three Variables.Doklady Akademii Nauk SSSR148 (1963), 9–12

1963

[6] [6]

Vladimir I Arnold. 2009. On functions of three variables.Collected Works: Representations of Functions, Celestial Mechanics and KAM Theory, 1957–1965(2009), 5–8

2009

[7] [7]

Marc Bachlin, Meir Plotnik, Daniel Roggen, Inbal Maidan, Jeffrey M Hausdorff, Nir Giladi, and Gerhard Troster. 2009. Wearable assistant for Parkinson’s disease patients with the freezing of gait symptom.IEEE Transactions on Information Technology in Biomedicine14, 2 (2009), 436–446

2009

[8] [8]

Oresti Banos, Rafael Garcia, Juan A Holgado-Terriza, Miguel Damas, Hector Pomares, Ignacio Rojas, Alejandro Saez, and Claudia Villalonga. 2014. mHealthDroid: A novel framework for agile development of mobile health applications.Ambient Assisted Living and Daily Activities8868, 14 (2014), 91–98

2014

[9] [9]

Sizhen Bian, Mengxi Liu, Bo Zhou, and Paul Lukowicz. 2022. The state-of-the-art sensing techniques in human activity recognition: A survey. Sensors22, 12 (2022), 4596

2022

[10] [10]

Alexander Dylan Bodner, Antonio Santiago Tepsich, Jack Natan Spolski, and Santiago Pourteau. 2024. Convolutional kolmogorov-arnold networks. arXiv preprint arXiv:2406.13155(2024)

work page arXiv 2024

[11] [11]

Zavareh Bozorgasl and Hao Chen. [n. d.]. Wav-kan: Wavelet kolmogorov-arnold networks, 2024.arXiv preprint arXiv:2405.12832([n. d.])

work page arXiv 2024

[12] [12]

Yueyang Cang, Li Shi, et al. 2024. Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision.arXiv preprint arXiv:2411.06727(2024)

work page arXiv 2024

[13] [13]

Blealtan Cao. 2024. An Efficient Implementation of Kolmogorov-Arnold Network. https://github.com/Blealtan/efficient-kan. Accessed: 2025-04-10

2024

[14] [14]

Ricardo Chavarriaga, Hesam Sagha, Alberto Calatroni, Sundara Tejaswi Digumarti, Gerhard Tröster, José del R Millán, and Daniel Roggen. 2013. The Opportunity challenge: A benchmark database for on-body sensor-based activity recognition.Pattern Recognition Letters34, 15 (2013), 2033–2042

2013

[15] [15]

Kaixuan Chen, Dalin Zhang, Lina Yao, Bin Guo, Zhiwen Yu, and Yunhao Liu. 2021. Deep learning for sensor-based human activity recognition: Overview, challenges, and opportunities.ACM Computing Surveys (CSUR)54, 4 (2021), 1–40

2021

[16] [16]

Zhijie Chen and Xinglin Zhang. 2024. Larctan-skan: Simple and efficient single-parameterized kolmogorov-arnold networks using learnable trigonometric function.arXiv preprint arXiv:2410.19360(2024)

work page arXiv 2024

[17] [17]

Zhijie Chen and Xinglin Zhang. 2024. Lss-skan: Efficient kolmogorov-arnold networks based on single-parameterized function.arXiv preprint arXiv:2410.14951(2024)

work page arXiv 2024

[18] [18]

Iveta Dirgová Luptáková, Martin Kubovčík, and Jiří Pospíchal. 2022. Wearable sensor-based human activity recognition with transformer model. Sensors22, 5 (2022), 1911

2022

[19] [19]

Ivan Drokin. 2024. Kolmogorov-arnold convolutions: Design principles and empirical studies.arXiv preprint arXiv:2407.01092(2024)

work page arXiv 2024

[20] [20]

Yu Enokibori. 2024. rTsfNet: a DNN model with Multi-head 3D Rotation and Time Series Feature Extraction for IMU-based Human Activity Recognition.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies8, 4 (2024), 1–26

2024

[21] [21]

Changjun Fan and Fei Gao. 2021. Enhanced human activity recognition using wearable sensors via a hybrid feature selection method.Sensors21, 19 (2021), 6434

2021

[22] [22]

Remi Genet and Hugo Inzirillo. 2024. Tkan: Temporal kolmogorov-arnold networks.arXiv preprint arXiv:2405.07344(2024)

work page arXiv 2024

[23] [23]

Ahmed Dawod Mohammed Ibrahum, Zhengyu Shang, and Jang-Eui Hong. 2024. How Resilient Are Kolmogorov–Arnold Networks in Classification Tasks? A Robustness Investigation.Applied Sciences14, 22 (2024), 10173

2024

[24] [24]

Petr Ivashkov, Po-Wei Huang, Kelvin Koor, Lirandë Pira, and Patrick Rebentrost. 2026. QKAN: quantum Kolmogorov-Arnold networks with applications in machine learning and multivariate state preparation.npj Quantum Information12, 1 (11 Mar 2026), 73. doi:10.1038/s41534-026-01202-5

work page doi:10.1038/s41534-026-01202-5 2026

[25] [25]

Ali Jamali, Swalpa Kumar Roy, Danfeng Hong, Bing Lu, and Pedram Ghamisi. 2024. How to learn more? Exploring Kolmogorov–Arnold networks for hyperspectral image classification.Remote Sensing16, 21 (2024), 4015

2024

[26] [26]

Zanobya N Khan and Jamil Ahmad. 2021. Attention induced multi-head convolutional neural network for human activity recognition.Applied soft computing110 (2021), 107671

2021

[27] [27]

Benjamin C Koenig, Suyong Kim, and Sili Deng. 2024. KAN-ODEs: Kolmogorov–Arnold network ordinary differential equations for learning dynamical systems and hidden physics.Computer Methods in Applied Mechanics and Engineering432 (2024), 117397

2024

[28] [28]

Andrei Nikolaevich Kolmogorov. 1957. On the representations of continuous functions of many variables by superposition of continuous functions of one variable and addition. InDokl. Akad. Nauk USSR, Vol. 114. 953–956. Manuscript submitted to ACM KAN-MLP-Mixer 23

1957

[29] [29]

Tran Xuan Hieu Le, Thi Diem Tran, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Tinh Nguyen, Yasuhiko Nakashima, et al . 2024. Exploring the limitations of kolmogorov-arnold networks in classification: Insights to software training and hardware implementation. In2024 Twelfth International Symposium on Computing and Networking Workshops (CANDARW). IE...

2024

[30] [30]

Ziyao Li. 2024. Kolmogorov-Arnold Networks are Radial Basis Function Networks. (2024). arXiv:2405.06721 [cs.LG]

work page arXiv 2024

[31] [31]

Hanxiao Liu, Zihang Dai, David So, and Quoc V Le. 2021. Pay attention to mlps.Advances in neural information processing systems34 (2021), 9204–9215

2021

[32] [32]

Mengxi Liu, Daniel Geißler, Dominique Nshimyimana, Sizhen Bian, Bo Zhou, and Paul Lukowicz. 2024. Initial investigation of kolmogorov-arnold networks (kans) as feature extractors for imu based human activity recognition. InCompanion of the 2024 on ACM International Joint Conference on Pervasive and Ubiquitous Computing. 500–506

2024

[33] [33]

Ziming Liu, Pingchuan Ma, Yixuan Wang, Wojciech Matusik, and Max Tegmark. 2024. Kan 2.0: Kolmogorov-arnold networks meet science.arXiv preprint arXiv:2408.10205(2024)

work page arXiv 2024

[34] [34]

Ziming Liu, Yixuan Wang, Sachin Vaidya, Fabian Ruehle, James Halverson, Marin Soljačić, Thomas Y Hou, and Max Tegmark. 2024. Kan: Kolmogorov-arnold networks.arXiv preprint arXiv:2404.19756(2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024

[35] [35]

Mohammad Malekzadeh, Richard Clegg, Andrea Cavallaro, and Hamed Haddadi. 2021. Dana: Dimension-adaptive neural architecture for multivariate sensor data.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies5, 3 (2021), 1–27

2021

[36] [36]

Takeru Miyoshi, Makoto Koshino, and Hidetaka Nambo. 2025. Applying MLP-Mixer and gMLP to Human Activity Recognition.Sensors25, 2 (2025), 311

2025

[37] [37]

Kamsiriochukwu Ojiako and Katayoun Farrahi. 2023. MLPs Are All You Need for Human Activity Recognition.Applied Sciences13, 20 (2023), 11154

2023

[38] [38]

Ordóñez and Daniel Roggen

Francisco J. Ordóñez and Daniel Roggen. 2016. Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition.Sensors16, 1 (2016), 115

2016

[39] [39]

Allan Pinkus. 1999. Approximation theory of the MLP model in neural networks.Acta numerica8 (1999), 143–195

1999

[40] [40]

Eleonora Poeta, Flavio Giobergia, Eliana Pastor, Tania Cerquitelli, and Elena Baralis. 2024. A benchmarking study of kolmogorov-arnold networks on tabular data. In2024 IEEE 18th International Conference on Application of Information and Communication Technologies (AICT). IEEE, 1–6

2024

[41] [41]

Attila Reiss and Didier Stricker. 2012. Introducing a new benchmarked dataset for activity monitoring. In2012 16th international symposium on wearable computers. IEEE, 108–109

2012

[42] [42]

Reyes-Ortiz, Davide Anguita, Alessandro Ghio, Luca Oneto, and Xavier Parra

Jorge L. Reyes-Ortiz, Davide Anguita, Alessandro Ghio, Luca Oneto, and Xavier Parra. 2013.Human Activity Recognition Using Smartphones. doi:10.24432/C54S4K

work page doi:10.24432/c54s4k 2013

[43] [43]

Yoli Shavit and Itzik Klein. 2021. Boosting inertial-based human activity recognition with transformers.IEEE Access9 (2021), 53540–53547

2021

[44] [44]

Haoran Shen, Chen Zeng, Jiahui Wang, and Qiao Wang. 2025. Reduced effectiveness of kolmogorov-arnold networks on functions with noise. In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1–5

2025

[45] [45]

Shriyank Somvanshi, Syed Aaqib Javed, Md Monzurul Islam, Diwas Pandit, and Subasish Das. 2024. A survey on kolmogorov-arnold network.arXiv preprint arXiv:2411.06078(2024)

work page arXiv 2024

[46] [46]

Yueyuan Sui, Minghui Zhao, Junxi Xia, Xiaofan Jiang, and Stephen Xia. 2024. Tramba: A hybrid transformer and mamba architecture for practical audio and bone conduction speech super resolution and enhancement on mobile and wearable platforms.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies8, 4 (2024), 1–29

2024

[47] [47]

Ilya O Tolstikhin, Neil Houlsby, Alexander Kolesnikov, Lucas Beyer, Xiaohua Zhai, Thomas Unterthiner, Jessica Yung, Andreas Steiner, Daniel Keysers, Jakob Uszkoreit, et al. 2021. Mlp-mixer: An all-mlp architecture for vision.Advances in neural information processing systems34 (2021), 24261–24272

2021

[48] [48]

Juan Diego Toscano, Vivek Oommen, Alan John Varghese, Zongren Zou, Nazanin Ahmadi Daryakenari, Chenxi Wu, and George Em Karniadakis

[49] [49]

From pinns to pikans: Recent advances in physics-informed machine learning.Machine Learning for Computational Science and Engineering1, 1 (2025), 1–43

2025

[50] [50]

Yu-Hsuan Tseng and Chih-Yu Wen. 2023. Hybrid Learning Models for IMU-Based HAR with Feature Analysis and Data Correction.Sensors23, 18 (2023), 7802

2023

[51] [51]

Yannick Werner, Akash Malemath, Mengxi Liu, Vitor Fortes Rey, Nikolaos Palaiodimopoulos, Paul Lukowicz, and Maximilian Kiefer-Emmanouilidis

[52] [52]

doi:10.1038/s41598-025-22705-9

QuKAN: A Quantum Circuit Born Machine Approach to Quantum Kolmogorov Arnold Networks.Scientific Reports15, 1 (09 Oct 2025), 35239. doi:10.1038/s41598-025-22705-9

work page doi:10.1038/s41598-025-22705-9 2025

[53] [53]

Jinfeng Xu, Zheyu Chen, Jinze Li, Shuo Yang, Wei Wang, Xiping Hu, and Edith C-H Ngai. 2024. FourierKAN-GCF: Fourier Kolmogorov-Arnold Network–An Effective and Efficient Feature Transformation for Graph Collaborative Filtering.arXiv preprint arXiv:2406.01034(2024)

work page arXiv 2024

[54] [54]

Xingyi Yang and Xinchao Wang. 2024. Kolmogorov-arnold transformer.arXiv preprint arXiv:2409.10594(2024)

work page arXiv 2024

[55] [55]

Yafeng Yin, Lei Xie, Zhiwei Jiang, Fu Xiao, Jiannong Cao, and Sanglu Lu. 2024. A systematic review of human activity recognition based on mobile devices: overview, progress and trends.IEEE Communications Surveys & Tutorials26, 2 (2024), 890–929

2024

[56] [56]

Piero Zappi, Clemens Lombriser, Thomas Stiefmeier, Elisabetta Farella, Daniel Roggen, Luca Benini, and Gerhard Tröster. 2008. Activity recognition from on-body sensors: accuracy-power trade-off by dynamic sensor selection. InWireless Sensor Networks: 5th European Conference, EWSN 2008, Bologna, Italy, January 30-February 1, 2008. Proceedings. Springer, 17–33

2008

[57] [57]

Licheng Zhang, Xihong Wu, and Dingsheng Luo. 2015. Recognizing human activities from raw accelerometer data using deep neural networks. In 2015 IEEE 14th International conference on machine learning and applications (ICMLA). IEEE, 865–870. Manuscript submitted to ACM 24 Liu et al

2015

[58] [58]

Ye Zhang, Longguang Wang, Huiling Chen, Aosheng Tian, Shilin Zhou, and Yulan Guo. 2022. IF-ConvTransformer: A framework for human activity recognition using IMU fusion and ConvTransformer.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies6, 2 (2022), 1–26

2022

[59] [59]

Yexu Zhou, Tobias King, Haibin Zhao, Yiran Huang, Till Riedel, and Michael Beigl. 2024. MLP-HAR: Boosting Performance and Efficiency of HAR Models on Edge Devices with Purely Fully Connected Layers. InProceedings of the 2024 ACM International Symposium on Wearable Computers. 133–139

2024

[60] [60]

Yexu Zhou, Haibin Zhao, Yiran Huang, Till Riedel, Michael Hefenbrock, and Michael Beigl. 2022. Tinyhar: A lightweight deep learning model designed for human activity recognition. InProceedings of the 2022 ACM International Symposium on Wearable Computers. 89–93. Received 20 February 2007; revised 12 March 2009; accepted 5 June 2009 Manuscript submitted to ACM

2022