How two-layer neural networks learn, one (giant) step at a time.arXiv preprint arXiv:2305.18270,

Yatin Dandi, Florent Krzakala, Bruno Loureiro, Luca Pesce, Ludovic Stephan · arXiv 2305.18270

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

How Do Transformers Learn to Associate Tokens: Gradient Leading Terms Bring Mechanistic Interpretability

cs.CL · 2026-01-27 · unverdicted · novelty 7.0

Transformer weights at early training stages are closed-form compositions of bigram, token-interchangeability, and context mappings that directly reflect text-corpus statistics and explain the emergence of semantic associations.

citing papers explorer

Showing 1 of 1 citing paper.

How Do Transformers Learn to Associate Tokens: Gradient Leading Terms Bring Mechanistic Interpretability cs.CL · 2026-01-27 · unverdicted · none · ref 5
Transformer weights at early training stages are closed-form compositions of bigram, token-interchangeability, and context mappings that directly reflect text-corpus statistics and explain the emergence of semantic associations.

How two-layer neural networks learn, one (giant) step at a time.arXiv preprint arXiv:2305.18270,

fields

years

verdicts

representative citing papers

citing papers explorer