Two-Point Deterministic Equivalence for Stochastic Gradient Dynamics in Linear Models

· 2025 · cond-mat.dis-nn · arXiv 2502.05074

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We derive a novel deterministic equivalence for the two-point function of a random matrix resolvent. Using this result, we give a unified derivation of the performance of a wide variety of high-dimensional linear models trained with stochastic gradient descent. This includes high-dimensional linear regression, kernel regression, and linear random feature models. Our results include previously known asymptotics as well as novel ones.

representative citing papers

An Asymptotic Theory of Chain-of-Thought in In-Context Learning

stat.ML · 2026-06-02 · unverdicted · novelty 6.0

Exact RMT-derived formula for CoT generalization error in linear ICL reveals phase transition between exponential/polynomial improvement, saturation, and overthinking regimes depending on depth, pretraining, and context length.

citing papers explorer

Showing 1 of 1 citing paper.

An Asymptotic Theory of Chain-of-Thought in In-Context Learning stat.ML · 2026-06-02 · unverdicted · none · ref 37 · internal anchor
Exact RMT-derived formula for CoT generalization error in linear ICL reveals phase transition between exponential/polynomial improvement, saturation, and overthinking regimes depending on depth, pretraining, and context length.

Two-Point Deterministic Equivalence for Stochastic Gradient Dynamics in Linear Models

fields

years

verdicts

representative citing papers

citing papers explorer