pith. sign in

NVIDIA A10 Tensor Core GPU

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.PL 1

years

2025 1

verdicts

CONDITIONAL 1

representative citing papers

Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUs

cs.PL · 2025-10-09 · conditional · novelty 6.0

Neptune introduces dependency-breaking fusion with algebraic corrections for reduction sequences, generating FlashAttention-like kernels from plain attention code with 1.35x average speedup across ten benchmarks and four GPU architectures.

citing papers explorer

Showing 1 of 1 citing paper.

  • Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUs cs.PL · 2025-10-09 · conditional · none · ref 10

    Neptune introduces dependency-breaking fusion with algebraic corrections for reduction sequences, generating FlashAttention-like kernels from plain attention code with 1.35x average speedup across ten benchmarks and four GPU architectures.