A shared mixed-activation network of width 2dN+d+2 yields layer-wise L^p approximation rates bounded by the modulus of continuity at geometric scale N^{-ℓ}, reducing to (2d+1)N^{-ℓ} for 1-Lipschitz targets.
Neural network approximation: Three hidden layers are enough.Neural Networks, 141:160–173
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
citation-role summary
method 1
citation-polarity summary
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1roles
method 1polarities
use method 1representative citing papers
citing papers explorer
-
Geometric Layer-wise Approximation Rates for Deep Networks
A shared mixed-activation network of width 2dN+d+2 yields layer-wise L^p approximation rates bounded by the modulus of continuity at geometric scale N^{-ℓ}, reducing to (2d+1)N^{-ℓ} for 1-Lipschitz targets.