pith. sign in

Post-training sparse attention with double sparsity

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 2 method 1

citation-polarity summary

years

2026 5

verdicts

UNVERDICTED 5

representative citing papers

HieraSparse: Hierarchical Semi-Structured Sparse KV Attention

cs.DC · 2026-04-18 · unverdicted · novelty 5.0

HieraSparse delivers a hierarchical semi-structured sparse KV attention system that achieves 1.2x KV compression and 4.57x decode attention speedup versus prior unstructured sparsity methods at equivalent sparsity, plus up to 1.85x prefill speedup and 1.37x/1.77x speedups with magnitude pruning and

SOCKET: SOft Collision Kernel EsTimator for Sparse Attention

cs.LG · 2026-02-06 · unverdicted · novelty 5.0

SOCKET replaces hard LSH bucket matches with soft probabilistic collision aggregation to enable efficient, high-quality token selection for sparse attention, matching or exceeding prior methods with up to 1.5x throughput gains.

citing papers explorer

Showing 5 of 5 citing papers.