pith. sign in

arxiv: 1902.02880 · v1 · pith:AEFFHPXInew · submitted 2019-02-07 · 💻 cs.LG · cond-mat.dis-nn· cond-mat.stat-mech· stat.ML

Mean Field Limit of the Learning Dynamics of Multilayer Neural Networks

classification 💻 cs.LG cond-mat.dis-nncond-mat.stat-mechstat.ML
keywords networksbehaviorcomplexdynamicsmultilayerneuralneuronsnumber
0
0 comments X
read the original abstract

Can multilayer neural networks -- typically constructed as highly complex structures with many nonlinearly activated neurons across layers -- behave in a non-trivial way that yet simplifies away a major part of their complexities? In this work, we uncover a phenomenon in which the behavior of these complex networks -- under suitable scalings and stochastic gradient descent dynamics -- becomes independent of the number of neurons as this number grows sufficiently large. We develop a formalism in which this many-neurons limiting behavior is captured by a set of equations, thereby exposing a previously unknown operating regime of these networks. While the current pursuit is mathematically non-rigorous, it is complemented with several experiments that validate the existence of this behavior.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.