Prun- ing large language models with semi-structural adaptive sparse training.Proceedings of the AAAI Conference on Artificial Intelligence, 39(23):24167–24175, Apr

Huang, W · DOI 10.1609/aaai.v39i23.34592

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

Deterministic Differentiable Structured Pruning for Large Language Models

cs.LG · 2026-03-09 · unverdicted · novelty 6.0

DDP replaces stochastic hard-concrete masks with a deterministic soft surrogate for l0-constrained structured pruning, delivering 1% performance loss on Qwen3 models at 20% sparsity and faster convergence than prior methods.

citing papers explorer

Showing 1 of 1 citing paper.

Deterministic Differentiable Structured Pruning for Large Language Models cs.LG · 2026-03-09 · unverdicted · none · ref 3
DDP replaces stochastic hard-concrete masks with a deterministic soft surrogate for l0-constrained structured pruning, delivering 1% performance loss on Qwen3 models at 20% sparsity and faster convergence than prior methods.

Prun- ing large language models with semi-structural adaptive sparse training.Proceedings of the AAAI Conference on Artificial Intelligence, 39(23):24167–24175, Apr

fields

years

verdicts

representative citing papers

citing papers explorer