← back to paper
arxiv: 2605.03109 · 2 revisions
Gated Subspace Inference for Transformer Acceleration