arXiv preprint arXiv:2603.13430 , year=

Dynamic Sparse Attention: Access Patterns, Architecture , author= · arXiv 2603.13430

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Predict, Reuse, and Repair: Accelerating Dynamic Sparse Attention for Long-Context LLM Decoding

cs.LG · 2026-06-29 · conditional · novelty 5.0

PRR accelerates dynamic sparse attention decoding in long-context LLMs via EMA-based prediction, speculative attention, and FlashAttention repair, achieving up to 40% latency reduction.

citing papers explorer

Showing 1 of 1 citing paper.

Predict, Reuse, and Repair: Accelerating Dynamic Sparse Attention for Long-Context LLM Decoding cs.LG · 2026-06-29 · conditional · none · ref 41
PRR accelerates dynamic sparse attention decoding in long-context LLMs via EMA-based prediction, speculative attention, and FlashAttention repair, achieving up to 40% latency reduction.

arXiv preprint arXiv:2603.13430 , year=

fields

years

verdicts

representative citing papers

citing papers explorer