Oats: Outlier-aware pruning through sparse and low rank decomposition.arXiv preprint arXiv:2409.13652

Stephen Zhang, Vardan Papyan · 2024 · arXiv 2409.13652

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models

cs.LG · 2026-05-26 · unverdicted · novelty 6.0

RT-Lynx shifts DiT sparsity from weights to activations, reports up to 1.55x linear-layer speedup while preserving generation quality across multiple diffusion models.

ELAS: Efficient Pre-Training of Low-Rank Large Language Models via 2:4 Activation Sparsity

cs.LG · 2026-05-05 · unverdicted · novelty 5.0

ELAS pre-trains low-rank LLMs by applying 2:4 activation sparsity after squared ReLU to cut memory and accelerate training with minimal performance loss.

citing papers explorer

Showing 2 of 2 citing papers after filters.

RT-Lynx: Putting the GEMM Sparsity In a Right Way for Diffusion Models cs.LG · 2026-05-26 · unverdicted · none · ref 78
RT-Lynx shifts DiT sparsity from weights to activations, reports up to 1.55x linear-layer speedup while preserving generation quality across multiple diffusion models.
ELAS: Efficient Pre-Training of Low-Rank Large Language Models via 2:4 Activation Sparsity cs.LG · 2026-05-05 · unverdicted · none · ref 13
ELAS pre-trains low-rank LLMs by applying 2:4 activation sparsity after squared ReLU to cut memory and accelerate training with minimal performance loss.

Oats: Outlier-aware pruning through sparse and low rank decomposition.arXiv preprint arXiv:2409.13652

fields

years

verdicts

representative citing papers

citing papers explorer