Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding

· 2016

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Split CNN Inference on Networked Microcontrollers

cs.DC · 2026-05-10 · unverdicted · novelty 6.0

A fine-grained split inference system enables CNN models infeasible on single MCUs to run across networked devices by partitioning at sub-layer granularity, reducing per-device peak RAM while keeping practical latency.

citing papers explorer

Showing 1 of 1 citing paper.

Split CNN Inference on Networked Microcontrollers cs.DC · 2026-05-10 · unverdicted · none · ref 6
A fine-grained split inference system enables CNN models infeasible on single MCUs to run across networked devices by partitioning at sub-layer granularity, reducing per-device peak RAM while keeping practical latency.

Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer