First population risk bounds for KANs under mini-batch DP-SGD with correlated noise, using a new non-convex optimization analysis combined with stability-based generalization.
Sharper guarantees for learning neural network classifiers with gradient methods
2 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.LG 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
For two-layer KANs trained with gradient descent under logistic loss and NTK-separable assumption, polylogarithmic width suffices for 1/T optimization and 1/n generalization rates, while differential privacy requires the same width and yields √d/(nε) utility.
citing papers explorer
-
Population Risk Bounds for Kolmogorov-Arnold Networks Trained by DP-SGD with Correlated Noise
First population risk bounds for KANs under mini-batch DP-SGD with correlated noise, using a new non-convex optimization analysis combined with stability-based generalization.
-
Optimization, Generalization and Differential Privacy Bounds for Gradient Descent on Kolmogorov-Arnold Networks
For two-layer KANs trained with gradient descent under logistic loss and NTK-separable assumption, polylogarithmic width suffices for 1/T optimization and 1/n generalization rates, while differential privacy requires the same width and yields √d/(nε) utility.