pith. sign in

Training dynamics of in-context learning in linear attention

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

years

2026 6

clear filters

representative citing papers

An Asymptotic Theory of Chain-of-Thought in In-Context Learning

stat.ML · 2026-06-02 · unverdicted · novelty 6.0

Exact RMT-derived formula for CoT generalization error in linear ICL reveals phase transition between exponential/polynomial improvement, saturation, and overthinking regimes depending on depth, pretraining, and context length.

Learning to Adapt: In-Context Learning Beyond Stationarity

cs.LG · 2026-04-13 · unverdicted · novelty 6.0

Gated linear attention enables lower training and test errors in non-stationary in-context learning by adaptively modulating past inputs through a learnable recency bias under an autoregressive model of task evolution.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • An Asymptotic Theory of Chain-of-Thought in In-Context Learning stat.ML · 2026-06-02 · unverdicted · none · ref 27

    Exact RMT-derived formula for CoT generalization error in linear ICL reveals phase transition between exponential/polynomial improvement, saturation, and overthinking regimes depending on depth, pretraining, and context length.