Non-Deterministic Learning Dynamics in Large Neural Networks due to Structural Data Bias

A.C.C. Coolen; H.C. Rae; J.A.F. Heimel

arxiv: cond-mat/0007232 · v1 · submitted 2000-07-13 · ❄️ cond-mat.dis-nn

Non-Deterministic Learning Dynamics in Large Neural Networks due to Structural Data Bias

H.C. Rae , J.A.F. Heimel , A.C.C. Coolen This is my paper

classification ❄️ cond-mat.dis-nn

keywords learninglargebiasdatadynamicslawsmacroscopicnetworks

0 comments

read the original abstract

We study the dynamics of on-line learning in large perceptrons, for the case of training sets with a structural bias of the input vectors, by deriving exact and closed macroscopic dynamical laws using non-equilibrium statistical mechanical tools. In sharp contrast to the more conventional theories developed for homogeneously distributed or only weakly biased data, these laws are found to describe a non-trivial and persistently non-deterministic macroscopic evolution, and a generalisation error which retains both stochastic and sample-to-sample fluctuations, even for infinitely large networks. Furthermore, for the standard error-correcting microscopic algorithms (such as the perceptron learning rule) one obtains learning curves with distinct bias-induced phases. Our theoretical predictions find excellent confirmation in numerical simulations.

This paper has not been read by Pith yet.

Non-Deterministic Learning Dynamics in Large Neural Networks due to Structural Data Bias

discussion (0)