A curvature penalty for KANs, derived to respect compositional effects and equipped with a proven upper bound on full-model curvature, produces smoother activations while preserving accuracy.
Kolmogorov-Arnold Transformer
8 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 2polarities
background 2representative citing papers
A new framework called ERR decomposes UHD image restoration into three frequency stages with specialized sub-networks and introduces the LSUHDIR benchmark dataset of over 82,000 images.
A new tensor framework for multi-layer decoupling of multivariate functions is proposed via ParaTuck decompositions and bilevel optimization.
A hybrid KAN-MLP model for IMU-based human activity recognition achieves 5.33% relative macro F1 improvement over pure MLPs on eight datasets by placing KANs at input embedding and classification stages.
AAND is a two-stage anomaly detection method that advances a pre-trained teacher via residual anomaly amplification and applies hard knowledge distillation in reverse distillation to achieve SOTA results on MVTecAD, VisA, and MVTec3D-RGB.
FMC-DETR proposes a frequency-decoupled fusion framework with WeKat backbone, MDFC coordination, and CPF fusion modules that claims state-of-the-art results on remote sensing object detection benchmarks.
P1-KAN introduces a new KAN architecture with theoretical approximation guarantees that outperforms MLPs and prior KAN variants on irregular functions while matching spline KAN accuracy on smooth ones, demonstrated on hydraulic optimization.
A systematic review of Kolmogorov-Arnold Networks that maps their relation to Kolmogorov superposition theory, MLPs, and kernels, examines basis-function design choices, summarizes performance advances, and supplies a practitioner's selection guide plus open challenges.
citing papers explorer
-
KANs need curvature: penalties for compositional smoothness
A curvature penalty for KANs, derived to respect compositional effects and equipped with a proven upper bound on full-model curvature, produces smoother activations while preserving accuracy.
-
From Zero to Detail: A Progressive Spectral Decoupling Paradigm for UHD Image Restoration with New Benchmark
A new framework called ERR decomposes UHD image restoration into three frequency stages with specialized sub-networks and introduces the LSUHDIR benchmark dataset of over 82,000 images.
-
Tensor-based Multi-layer Decoupling
A new tensor framework for multi-layer decoupling of multivariate functions is proposed via ParaTuck decompositions and bilevel optimization.
-
KAN-MLP-Mixer: A comprehensive investigation of the usage of Kolmogorov-Arnold Networks (KANs) for improving IMU-based Human Activity Recognition
A hybrid KAN-MLP model for IMU-based human activity recognition achieves 5.33% relative macro F1 improvement over pure MLPs on eight datasets by placing KANs at input embedding and classification stages.
-
Advancing Pre-trained Teacher: Towards Robust Feature Discrepancy for Anomaly Detection
AAND is a two-stage anomaly detection method that advances a pre-trained teacher via residual anomaly amplification and applies hard knowledge distillation in reverse distillation to achieve SOTA results on MVTecAD, VisA, and MVTec3D-RGB.
-
FMC-DETR: Frequency-Decoupled Multi-Domain Coordination for Aerial-View Object Detection
FMC-DETR proposes a frequency-decoupled fusion framework with WeKat backbone, MDFC coordination, and CPF fusion modules that claims state-of-the-art results on remote sensing object detection benchmarks.
-
P1-KAN: an effective Kolmogorov-Arnold network with application to hydraulic valley optimization
P1-KAN introduces a new KAN architecture with theoretical approximation guarantees that outperforms MLPs and prior KAN variants on irregular functions while matching spline KAN accuracy on smooth ones, demonstrated on hydraulic optimization.
-
A Practitioner's Guide to Kolmogorov-Arnold Networks
A systematic review of Kolmogorov-Arnold Networks that maps their relation to Kolmogorov superposition theory, MLPs, and kernels, examines basis-function design choices, summarizes performance advances, and supplies a practitioner's selection guide plus open challenges.