pith. sign in

Title resolution pending

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2025 1

verdicts

UNVERDICTED 1

representative citing papers

Accelerating Sparse Transformer Inference on GPU

cs.LG · 2025-06-06 · unverdicted · novelty 5.0

STOF framework optimizes sparse Transformer on GPU via analytical kernel mapping for MHA and two-stage search for fusion, reporting up to 1.6x MHA and 1.4x end-to-end speedups over prior work.

citing papers explorer

Showing 1 of 1 citing paper.

  • Accelerating Sparse Transformer Inference on GPU cs.LG · 2025-06-06 · unverdicted · none · ref 41

    STOF framework optimizes sparse Transformer on GPU via analytical kernel mapping for MHA and two-stage search for fusion, reporting up to 1.6x MHA and 1.4x end-to-end speedups over prior work.