On the loss landscape of a class of deep neural networks with no bad local valleys
classification
💻 cs.LG
cs.AIcs.CVstat.ML
keywords
locallossnetworksclasscross-entropydeepneuralactivation
read the original abstract
We identify a class of over-parameterized deep neural networks with standard activation functions and cross-entropy loss which provably have no bad local valley, in the sense that from any point in parameter space there exists a continuous path on which the cross-entropy loss is non-increasing and gets arbitrarily close to zero. This implies that these networks have no sub-optimal strict local minima.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.