HEAPr: Hessian-based efficient atomic expert pruning in output space.arXiv preprint arXiv:2509.22299,

Li, K · arXiv 2509.22299

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

representative citing papers

Deterministic Differentiable Structured Pruning for Large Language Models

cs.LG · 2026-03-09 · unverdicted · novelty 6.0

DDP replaces stochastic hard-concrete masks with a deterministic soft surrogate for l0-constrained structured pruning, delivering 1% performance loss on Qwen3 models at 20% sparsity and faster convergence than prior methods.

citing papers explorer

Showing 1 of 1 citing paper.

Deterministic Differentiable Structured Pruning for Large Language Models cs.LG · 2026-03-09 · unverdicted · none · ref 5 · internal anchor
DDP replaces stochastic hard-concrete masks with a deterministic soft surrogate for l0-constrained structured pruning, delivering 1% performance loss on Qwen3 models at 20% sparsity and faster convergence than prior methods.

HEAPr: Hessian-based efficient atomic expert pruning in output space.arXiv preprint arXiv:2509.22299,

fields

years

verdicts

representative citing papers

citing papers explorer