← back to paper
arxiv: 2605.03953 · 2 revisions
Transformers with Selective Access to Early Representations