qualitatively characterizing neural network optimization problems

Jonathan Frankle · 2012 · arXiv 2012.06898

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Scalar Representations of Neural Network Training Dynamics

cs.LG · 2026-06-29 · unverdicted · novelty 5.0

Scalar embeddings of neural network training trajectories treated as temporal networks preserve main dynamical features including Lyapunov exponents, enable definition of a characteristic decorrelation time, and show asymptotic state spacings compatible with a skew lognormal distribution.

The Platonic Representation Hypothesis

cs.LG · 2024-05-13 · unverdicted · novelty 5.0

Representations learned by large AI models are converging toward a shared statistical model of reality.

citing papers explorer

Showing 2 of 2 citing papers.

Scalar Representations of Neural Network Training Dynamics cs.LG · 2026-06-29 · unverdicted · none · ref 27
Scalar embeddings of neural network training trajectories treated as temporal networks preserve main dynamical features including Lyapunov exponents, enable definition of a characteristic decorrelation time, and show asymptotic state spacings compatible with a skew lognormal distribution.
The Platonic Representation Hypothesis cs.LG · 2024-05-13 · unverdicted · none · ref 125
Representations learned by large AI models are converging toward a shared statistical model of reality.

qualitatively characterizing neural network optimization problems

fields

years

verdicts

representative citing papers

citing papers explorer