pith. sign in

arxiv: 2410.03801 · v5 · submitted 2024-10-04 · 💻 cs.LG · cs.NE· stat.ML

P1-KAN: an effective Kolmogorov-Arnold network with application to hydraulic valley optimization

Pith reviewed 2026-05-23 19:53 UTC · model grok-4.3

classification 💻 cs.LG cs.NEstat.ML
keywords Kolmogorov-Arnold networkP1-KANfunction approximationerror boundsuniversal approximationhydraulic optimizationneural networksmachine learning
0
0 comments X

The pith

P1-KAN approximates high-dimensional irregular functions with error bounds and outperforms other KANs and MLPs in accuracy and speed.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper proposes P1-KAN, a new Kolmogorov-Arnold network for approximating potentially irregular functions in high dimensions. It derives error bounds under the assumption that the Kolmogorov-Arnold expansion functions are sufficiently smooth and establishes universal approximation theorems when the target function is only continuous. The network shows better accuracy and faster convergence than multilayer perceptrons. On irregular functions it outperforms several other KAN variants while achieving similar accuracy to the original spline-based KAN on smooth functions. The approach is demonstrated on the optimization of a French hydraulic valley.

Core claim

P1-KAN is an effective Kolmogorov-Arnold network that provides approximation error bounds assuming smooth expansion functions and universal approximation theorems for continuous functions. It demonstrates superior accuracy and convergence speed over multilayer perceptrons, outperforms competing KAN networks on irregular functions, and matches the original KAN on smooth functions, with an application to hydraulic valley optimization.

What carries the argument

P1-KAN architecture, a Kolmogorov-Arnold network variant that uses specific basis expansions to handle irregular high-dimensional functions.

If this is right

  • Hydraulic valley optimization yields improved results through more accurate function approximation.
  • KANs become applicable to a wider range of non-smooth high-dimensional problems.
  • Training converges faster than multilayer perceptrons while reaching comparable or better accuracy.
  • Similar performance to the original spline KAN is retained on smooth target functions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The architecture may extend to other engineering optimization tasks that involve irregular objective landscapes.
  • Combining P1-KAN with existing networks could address datasets containing both smooth and irregular regions.
  • If the smoothness assumption is verified in applications, the bounds could support more reliable error estimates in deployed models.

Load-bearing premise

The error bounds assume that the Kolmogorov-Arnold expansion functions are sufficiently smooth.

What would settle it

An irregular test function where P1-KAN fails to exceed the accuracy of the compared KAN networks or where measured errors violate the stated bounds.

Figures

Figures reproduced from arXiv: 2410.03801 by Xavier Warin.

Figure 1
Figure 1. Figure 1: Uniform P1 basis functions on [0, 1] with P = 5. where (a l,k,j p )p=0,P are trainable variables and (Ψl,j p )p=0,P is the basis of the shape function Ψl,j p with compact support in each interval [ˆx l j,p−1 , xˆ l j,p+1] for p = 1, . . . , P − 1 and defined as: Ψ l,j p (x) =    x−xˆ l j,p−1 xˆ l j,p−xˆ l j,p−1 x ∈ [ˆx l j,p−1 , xˆ l j,p] xˆ l j,p+1−x xˆ l j,p+1−xˆ l j,p x ∈ [ˆx l j,p, xˆ l j,p+1] suc… view at source ↗
Figure 2
Figure 2. Figure 2: Reference function and approximation with or without adaptation. Vertices on the x-axis. P Error without adaptation Error with adaptation 5 0.1545 0.00492 10 0.0737 3.55E-4 20 4.7E-4 5.22E-5 [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Function A in 2D [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: To approximate a function f with a neural network κ, we use the classical quadratic loss function defined as: L = E[(f(X) − κ(X))2 ], where X is a uniform random variable on [0, 1]n. Using a stochastic gradient algorithm with the ADAM optimizer, a learning rate of 10−3 , and a batch size of 1000, we minimize the loss L. The MLPs use a ReLU activation function, with either 2 layers with 10, 20, 40 neurons f… view at source ↗
Figure 5
Figure 5. Figure 5: Results in dimension 6 for function A [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Results in dimension 12 for function A. As these graphs are obtained with one run, we provide in [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Results in dimension 2 for function B [PITH_FULL_IMAGE:figures/full_fig_p013_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Results in dimension 5 for function B Results in [PITH_FULL_IMAGE:figures/full_fig_p014_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Structure of the valley [PITH_FULL_IMAGE:figures/full_fig_p018_9.png] view at source ↗
read the original abstract

A new Kolmogorov-Arnold network (KAN) is proposed to approximate potentially irregular functions in high dimensions. We provide error bounds for this approximation, assuming that the Kolmogorov-Arnold expansion functions are sufficiently smooth. When the function is only continuous, we also provide universal approximation theorems. We show that it outperforms multilayer perceptrons in terms of accuracy and convergence speed. We also compare it with several proposed KAN networks: it outperforms all networks for irregular functions and achieves similar accuracy to the original spline-based KAN network for smooth functions. Finally, we compare some of the KAN networks in optimizing a French hydraulic valley.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes P1-KAN, a Kolmogorov-Arnold network variant for high-dimensional approximation of potentially irregular functions. It derives approximation error bounds under the assumption that the Kolmogorov-Arnold expansion functions are sufficiently smooth, provides universal approximation theorems when the target is merely continuous, and reports empirical results claiming outperformance over MLPs in accuracy and convergence speed, outperformance over other KAN variants on irregular functions, and comparable accuracy to the original spline-based KAN on smooth functions. The work concludes with an application to optimization of a French hydraulic valley.

Significance. If the empirical comparisons prove robust, P1-KAN could supply a practical architecture for high-dimensional function approximation in engineering contexts. The combination of explicit error bounds (under smoothness) and a UAT for the continuous case is a constructive element; the hydraulic-valley application supplies a concrete, falsifiable use case.

major comments (2)
  1. [Theory / error bounds derivation] Theory section on error bounds: the stated bounds are derived only under the explicit assumption that the Kolmogorov-Arnold expansion functions are sufficiently smooth. The central empirical claim, however, concerns superior performance on irregular (likely non-smooth) functions, for which the smoothness hypothesis is violated and only the rate-free UAT remains. This leaves the theoretical rationale for the headline performance advantage on irregular targets unsupported by quantitative rates.
  2. [Experiments / comparison tables] Experimental section (comparisons with other KANs and MLPs): the manuscript does not supply sufficient detail on parameter-count matching, optimizer hyper-parameters, or basis-function choices to allow independent verification that the reported accuracy and convergence advantages on irregular functions are not artifacts of unequal experimental conditions.
minor comments (1)
  1. [Application / hydraulic valley] In the hydraulic-valley optimization subsection, confirm that all reported objective values include the number of independent runs and standard deviations.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed review. We address each major comment below and indicate the revisions we will make.

read point-by-point responses
  1. Referee: [Theory / error bounds derivation] Theory section on error bounds: the stated bounds are derived only under the explicit assumption that the Kolmogorov-Arnold expansion functions are sufficiently smooth. The central empirical claim, however, concerns superior performance on irregular (likely non-smooth) functions, for which the smoothness hypothesis is violated and only the rate-free UAT remains. This leaves the theoretical rationale for the headline performance advantage on irregular targets unsupported by quantitative rates.

    Authors: We agree that the quantitative error bounds require the smoothness assumption on the expansion functions and therefore do not apply to irregular targets. The manuscript provides a rate-free universal approximation theorem for the continuous case. The reported advantages on irregular functions are empirical. We will revise the theory and discussion sections to explicitly separate the smooth-case bounds from the continuous-case UAT and to avoid any implication that quantitative rates are available for non-smooth targets. revision: partial

  2. Referee: [Experiments / comparison tables] Experimental section (comparisons with other KANs and MLPs): the manuscript does not supply sufficient detail on parameter-count matching, optimizer hyper-parameters, or basis-function choices to allow independent verification that the reported accuracy and convergence advantages on irregular functions are not artifacts of unequal experimental conditions.

    Authors: We acknowledge that the current experimental description lacks the level of detail needed for independent verification. In the revised manuscript we will add explicit tables or text specifying how parameter counts were matched across architectures, the full set of optimizer hyperparameters and training protocols, and the precise basis-function choices used for each KAN variant. revision: yes

Circularity Check

0 steps flagged

Derivation chain is self-contained; no reductions to inputs by construction

full rationale

The paper states error bounds under an explicit smoothness assumption on Kolmogorov-Arnold expansion functions and supplies universal approximation theorems when the target is merely continuous. Performance claims rest on direct empirical comparisons against external baselines (MLPs and other published KAN variants) rather than any fitted parameter renamed as a prediction or any self-citation chain. No self-definitional, fitted-input, or load-bearing self-citation patterns appear; the derivation therefore remains independent of its own outputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

Abstract-only review prevents exhaustive ledger; the central claims rest on standard approximation-theory assumptions plus the new network definition itself.

axioms (1)
  • domain assumption Kolmogorov-Arnold expansion functions are sufficiently smooth for the error bounds to hold
    Explicitly invoked in the abstract when stating the error bounds.
invented entities (1)
  • P1-KAN network architecture no independent evidence
    purpose: Approximate irregular high-dimensional functions with improved accuracy and speed
    New proposed architecture whose definition and performance are the paper's contribution; no independent evidence outside the paper is described.

pith-pipeline@v0.9.0 · 5628 in / 1249 out tokens · 15805 ms · 2026-05-23T19:53:49.500290+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Hyper Input Convex Neural Networks for Shape Constrained Learning and Optimal Transport

    cs.LG 2026-04 unverdicted novelty 7.0

    HyCNNs are a new architecture that learns convex functions with exponentially fewer parameters than ICNNs and outperforms them in convex regression and high-dimensional optimal transport on synthetic and single-cell RNA data.

Reference graph

Works this paper leans on

30 extracted references · 30 canonical work pages · cited by 1 Pith paper · 2 internal anchors

  1. [1]

    Deepokan: Deep operator network based on kolmogorov arnold networks for mechanics problems

    Diab W Abueidda, Panos Pantidis, and Mostafa E Mobasher. Deepokan: Deep operator network based on kolmogorov arnold networks for mechanics problems. arXiv preprint arXiv:2405.19143 , 2024

  2. [2]

    Dynamic programming and stochastic control processes

    Richard Bellman. Dynamic programming and stochastic control processes. Information and control, 1(3):228–239, 1958

  3. [3]

    Convolutional kolmogorov- arnold networks

    Alexander Dylan Bodner, Antonio Santiago Tepsich, Jack Natan Spolski, and Santiago Pourteau. Convolutional kolmogorov- arnold networks. arXiv preprint arXiv:2406.13155 , 2024

  4. [4]

    Bozorgasl and H

    Zavareh Bozorgasl and Hao Chen. Wav-kan: Wavelet kolmogorov-arnold networks. arXiv preprint arXiv:2405.12832, 2024

  5. [5]

    B-splines

    Arindam Chaudhuri. B-splines. arXiv preprint arXiv:2108.06617 , 2021

  6. [6]

    Demonstrating the efficacy of kolmogorov-arnold networks in vision tasks.arXiv preprint arXiv:2406.14916, 2024

    Minjong Cheon. Demonstrating the efficacy of kolmogorov-arnold networks in vision tasks.arXiv preprint arXiv:2406.14916, 2024

  7. [7]

    Tkan: Temporal kolmogorov-arnold networks

    Remi Genet and Hugo Inzirillo. Tkan: Temporal kolmogorov-arnold networks. arXiv preprint arXiv:2405.07344 , 2024

  8. [8]

    arXiv:2101.08068 (2021)

    Maximilien Germain, Huyˆ en Pham, Xavier Warin, et al. Neural networks-based algorithms for stochastic control and pdes in finance. arXiv preprint arXiv:2101.08068 , 2021

  9. [9]

    STochastic OPTimization library in C++

    Hugo Gevret, Nicolas Langren´ e, Jerome Lelong, Rafael D Lobato, Thomas Ouillon, Xavier Warin, and Aditya Maheshwari. STochastic OPTimization library in C++ . PhD thesis, EDF Lab, 2018

  10. [10]

    Representation properties of networks: Kolmogorov’s theorem is irrelevant

    Federico Girosi and Tomaso Poggio. Representation properties of networks: Kolmogorov’s theorem is irrelevant. Neural Computation, 1(4):465–469, 1989

  11. [11]

    Gaussian Error Linear Units (GELUs)

    Dan Hendrycks and Kevin Gimpel. Gaussian error linear units (gelus). arXiv preprint arXiv:1606.08415 , 2016

  12. [12]

    Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks

    Kurt Hornik, Maxwell Stinchcombe, and Halbert White. Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks. Neural networks, 3(5):551–560, 1990

  13. [13]

    Coxkan: Kolmogorov-arnold networks for interpretable, high-performance survival analysis

    William Knottenbelt, Zeyu Gao, Rebecca Wray, Woody Zhidong Zhang, Jiashuai Liu, and Mireia Crispin-Ortuzar. Coxkan: Kolmogorov-arnold networks for interpretable, high-performance survival analysis. arXiv preprint arXiv:2409.04290, 2024

  14. [14]

    On the representation of continuous functions of several variables by superpositions of continuous functions of a smaller number of variables

    Andrei Nikolaevich Kolmogorov. On the representation of continuous functions of several variables by superpositions of continuous functions of a smaller number of variables . American Mathematical Society, 1961

  15. [15]

    Exploring the limitations of kolmogorov-arnold networks in classification: Insights to software training and hardware implementation

    Tran Xuan Hieu Le, Thi Diem Tran, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Tinh Nguyen, Yasuhiko Nakashima, et al. Exploring the limitations of kolmogorov-arnold networks in classification: Insights to software training and hardware implementation. arXiv preprint arXiv:2407.17790 , 2024

  16. [16]

    U-kan makes strong backbone for medical image segmentation and generation

    Chenxin Li, Xinyu Liu, Wuyang Li, Cheng Wang, Hengyu Liu, and Yixuan Yuan. U-kan makes strong backbone for medical image segmentation and generation. arXiv preprint arXiv:2406.02918 , 2024

  17. [17]

    Kolmogorov-arnold networks are radial basis function networks

    Ziyao Li. Kolmogorov-arnold networks are radial basis function networks. 2024

  18. [18]

    KAN: Kolmogorov-Arnold Networks

    Ziming Liu, Yixuan Wang, Sachin Vaidya, Fabian Ruehle, James Halverson, Marin Soljaˇ ci´ c, Thomas Y Hou, and Max Tegmark. Kan: Kolmogorov-arnold networks. arXiv preprint arXiv:2404.19756 , 2024

  19. [19]

    Theoretical issues in deep networks

    Tomaso Poggio, Andrzej Banburski, and Qianli Liao. Theoretical issues in deep networks. Proceedings of the National Academy of Sciences, 117(48):30039–30045, 2020

  20. [20]

    Relu-kan: New kolmogorov-arnold networks that only need matrix addition, dot multiplication, and relu

    Qi Qiu, Tao Zhu, Helin Gong, Liming Chen, and Huansheng Ning. Relu-kan: New kolmogorov-arnold networks that only need matrix addition, dot multiplication, and relu. arXiv preprint arXiv:2406.02075 , 2024

  21. [21]

    M´ ethodes Num´ eriques: Algorithmes, analyse et applications

    Alfio Maria Quarteroni, Riccardo Sacco, and Fausto Saleri. M´ ethodes Num´ eriques: Algorithmes, analyse et applications. Springer Science & Business Media, 2007

  22. [22]

    Reduced effectiveness of kolmogorov-arnold networks on functions with noise

    Haoran Shen, Chen Zeng, Jiahui Wang, and Qiao Wang. Reduced effectiveness of kolmogorov-arnold networks on functions with noise. arXiv preprint arXiv:2407.14882 , 2024

  23. [23]

    Higher-order-relu-kans (hrkans) for solving physics-informed neural networks (pinns) more accurately, robustly and faster

    Chi Chiu So and Siu Pang Yung. Higher-order-relu-kans (hrkans) for solving physics-informed neural networks (pinns) more accurately, robustly and faster. arXiv preprint arXiv:2409.14248 , 2024

  24. [24]

    Sidharth, A

    Sidharth SS. Chebyshev polynomial-based kolmogorov-arnold networks: An efficient architecture for nonlinear function approximation. arXiv preprint arXiv:2405.07200 , 2024

  25. [25]

    Bsrbf-kan: A combination of b-splines and radial basic functions in kolmogorov-arnold networks

    Hoang-Thang Ta. Bsrbf-kan: A combination of b-splines and radial basic functions in kolmogorov-arnold networks. arXiv preprint arXiv:2406.11173, 2024

  26. [26]

    Kolmogorov-arnold networks (kans) for time series analysis

    Cristian J Vaca-Rubio, Luis Blanco, Roberto Pereira, and M` arius Caus. Kolmogorov-arnold networks (kans) for time series analysis. arXiv preprint arXiv:2405.08790 , 2024

  27. [27]

    Reservoir optimization and machine learning methods

    Xavier Warin. Reservoir optimization and machine learning methods. EURO Journal on Computational Optimization , 11:100068, 2023

  28. [28]

    Fourierkan-gcf: Fourier kolmogorov-arnold network–an effective and efficient feature transformation for graph collaborative filtering.arXiv preprint arXiv:2406.01034, 2024

    Jinfeng Xu, Zheyu Chen, Jinze Li, Shuo Yang, Wei Wang, Xiping Hu, and Edith C-H Ngai. Fourierkan-gcf: Fourier kolmogorov-arnold network–an effective and efficient feature transformation for graph collaborative filtering.arXiv preprint arXiv:2406.01034, 2024

  29. [29]

    Kolmogorov-arnold transformer,

    Xingyi Yang and Xinchao Wang. Kolmogorov-arnold transformer. arXiv preprint arXiv:2409.10594 , 2024

  30. [30]

    Kan or mlp: A fairer comparison, 2024

    Runpeng Yu, Weihao Yu, and Xinchao Wang. Kan or mlp: A fairer comparison. arXiv preprint arXiv:2407.16674 , 2024. P1-KAN: AN EFFECTIVE KOLMOGOROV-ARNOLD NETWORK WITH APPLICATION TO HYDRAULIC VALLEY OPTIMIZATION 21 Xavier Warin, EDF Lab Paris-Saclay and FiMe, Laboratoire de Finance des March´es de l’Energie, 91120 Palaiseau, France Email address: xavier.wa...