Quantitative Gaussian approximation of randomly initialized deep neural networks.Machine Learning, 113(9):6373–6393, Sep 2024

Andrea Basteri, Dario Trevisan · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Stochastic Scaling Limits and Synchronization by Noise in Deep Transformer Models

math.PR · 2026-04-29 · unverdicted · novelty 7.0

Transformers converge pathwise to a stochastic particle system and SPDE in the scaling limit, exhibiting synchronization by noise and exponential energy dissipation when common noise is coercive relative to self-attention drift.

citing papers explorer

Showing 1 of 1 citing paper.

Stochastic Scaling Limits and Synchronization by Noise in Deep Transformer Models math.PR · 2026-04-29 · unverdicted · none · ref 5
Transformers converge pathwise to a stochastic particle system and SPDE in the scaling limit, exhibiting synchronization by noise and exponential energy dissipation when common noise is coercive relative to self-attention drift.

Quantitative Gaussian approximation of randomly initialized deep neural networks.Machine Learning, 113(9):6373–6393, Sep 2024

fields

years

verdicts

representative citing papers

citing papers explorer