Deep neural networks compression: A comparative survey and choice recommenda- tions

· 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Prune-Quantize-Distill: An Ordered Pipeline for Efficient Neural Network Compression

cs.LG · 2026-04-05 · unverdicted · novelty 4.0

The prune-quantize-distill ordering produces a better accuracy-size-latency frontier on CIFAR-10/100 than any single technique or other orderings, with INT8 QAT providing the main runtime gain.

citing papers explorer

Showing 1 of 1 citing paper.

Prune-Quantize-Distill: An Ordered Pipeline for Efficient Neural Network Compression cs.LG · 2026-04-05 · unverdicted · none · ref 3
The prune-quantize-distill ordering produces a better accuracy-size-latency frontier on CIFAR-10/100 than any single technique or other orderings, with INT8 QAT providing the main runtime gain.

Deep neural networks compression: A comparative survey and choice recommenda- tions

fields

years

verdicts

representative citing papers

citing papers explorer