Language model features form an early stable carrier scaffold of about 50 sparse features that is load-bearing, predictable from onset firing, and recruits most later features.
Predicting the formation of induc- tion heads.arXiv preprint arXiv:2511.16893, 2025
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
q-bio.NC 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Features have life history. And we should care
Language model features form an early stable carrier scaffold of about 50 sparse features that is load-bearing, predictable from onset firing, and recruits most later features.