Results derived using the same models used for Figure 7a

As in Figure 7a, each seed has been shifted along the horizontal axis by the value ofK∗ 1 determined for that seed by theϕ(2) β threshold criteria

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Distinct mechanisms underlying in-context learning in transformers

cs.LG · 2026-04-14 · unverdicted · novelty 6.0

Transformers develop four algorithmic phases of in-context learning on Markov chains via two distinct multi-layer subcircuit mechanisms, with phase boundaries set by data diversity K.

citing papers explorer

Showing 1 of 1 citing paper.

Distinct mechanisms underlying in-context learning in transformers cs.LG · 2026-04-14 · unverdicted · none · ref 30
Transformers develop four algorithmic phases of in-context learning on Markov chains via two distinct multi-layer subcircuit mechanisms, with phase boundaries set by data diversity K.

Results derived using the same models used for Figure 7a

fields

years

verdicts

representative citing papers

citing papers explorer