← back to paper
arxiv: 2605.03110 · 2 revisions
Cascade Token Selection for Transformer Attention Acceleration