pith. sign in

arxiv: 1808.09372 · v2 · pith:DVBXKXWZnew · submitted 2018-08-28 · 🧮 math.PR · math.ST· stat.ML· stat.TH

Mean Field Analysis of Neural Networks: A Central Limit Theorem

classification 🧮 math.PR math.STstat.MLstat.TH
keywords limitcentralneuralstochastictheoremanalysisfluctuationshidden
0
0 comments X
read the original abstract

We rigorously prove a central limit theorem for neural network models with a single hidden layer. The central limit theorem is proven in the asymptotic regime of simultaneously (A) large numbers of hidden units and (B) large numbers of stochastic gradient descent training iterations. Our result describes the neural network's fluctuations around its mean-field limit. The fluctuations have a Gaussian distribution and satisfy a stochastic partial differential equation. The proof relies upon weak convergence methods from stochastic analysis. In particular, we prove relative compactness for the sequence of processes and uniqueness of the limiting process in a suitable Sobolev space.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Limit Theorems for Stochastic Gradient Descent in High-Dimensional Single-Layer Networks

    stat.ML 2025-11 unverdicted novelty 5.0

    At the critical step-size scaling for SGD in high-dimensional single-layer networks, effective dynamics gain a diffusive correction term that changes the phase diagram and reduces to an Ornstein-Uhlenbeck process near...