Title resolution pending

· 2025 · arXiv 2312.11737

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Stochastic Scaling Limits and Synchronization by Noise in Deep Transformer Models

math.PR · 2026-04-29 · unverdicted · novelty 7.0

Transformers converge pathwise to a stochastic particle system and SPDE in the scaling limit, exhibiting synchronization by noise and exponential energy dissipation when common noise is coercive relative to self-attention drift.

Posterior Bayesian Neural Networks with Dependent Weights

stat.ML · 2025-07-29 · unverdicted · novelty 7.0

In the wide-width limit under Gaussian likelihood, the posterior of the network output is identified when the random covariance matrix is positive definite, with mild conditions ensuring invertibility and order-independent sequential limits.

Universality in Deep Neural Networks: An approach via the Lindeberg exchange principle

math.PR · 2026-05-04 · unverdicted · novelty 6.0

Quantitative 2-Wasserstein bounds are established between finite-width deep neural networks and their infinite-width Gaussian limits using a Lindeberg principle for successive Gaussian replacement of weights.

citing papers explorer

Showing 3 of 3 citing papers.

Stochastic Scaling Limits and Synchronization by Noise in Deep Transformer Models math.PR · 2026-04-29 · unverdicted · none · ref 51
Transformers converge pathwise to a stochastic particle system and SPDE in the scaling limit, exhibiting synchronization by noise and exponential energy dissipation when common noise is coercive relative to self-attention drift.
Posterior Bayesian Neural Networks with Dependent Weights stat.ML · 2025-07-29 · unverdicted · none · ref 35
In the wide-width limit under Gaussian likelihood, the posterior of the network output is identified when the random covariance matrix is positive definite, with mild conditions ensuring invertibility and order-independent sequential limits.
Universality in Deep Neural Networks: An approach via the Lindeberg exchange principle math.PR · 2026-05-04 · unverdicted · none · ref 16
Quantitative 2-Wasserstein bounds are established between finite-width deep neural networks and their infinite-width Gaussian limits using a Lindeberg principle for successive Gaussian replacement of weights.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer