Deep linear network training dynamics from random initialization: Data, width, depth, and hyperparameter transfer

Blake Bordelon, Cengiz Pehlevan · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

Explicit scaling prescriptions for hyperparameters in DenseAMs are derived from model dynamics and shown to match empirical results across scales.

Showing 1 of 1 citing paper.

Hyperparameter Transfer for Dense Associative Memories cs.LG · 2026-05-11 · unverdicted · none · ref 15
Explicit scaling prescriptions for hyperparameters in DenseAMs are derived from model dynamics and shown to match empirical results across scales.