← back to paper
arxiv: 2605.00768 · 2 revisions
Characterizing the Expressivity of Local Attention in Transformers