pith. sign in

arxiv: 1706.05791 · v1 · pith:EFSSACCPnew · submitted 2017-06-19 · 💻 cs.CV

An Entropy-based Pruning Method for CNN Compression

classification 💻 cs.CV
keywords methodcompressionfilterentropy-basedpruningduringimportancemodel
0
0 comments X
read the original abstract

This paper aims to simultaneously accelerate and compress off-the-shelf CNN models via filter pruning strategy. The importance of each filter is evaluated by the proposed entropy-based method first. Then several unimportant filters are discarded to get a smaller CNN model. Finally, fine-tuning is adopted to recover its generalization ability which is damaged during filter pruning. Our method can reduce the size of intermediate activations, which would dominate most memory footprint during model training stage but is less concerned in previous compression methods. Experiments on the ILSVRC-12 benchmark demonstrate the effectiveness of our method. Compared with previous filter importance evaluation criteria, our entropy-based method obtains better performance. We achieve 3.3x speed-up and 16.64x compression on VGG-16, 1.54x acceleration and 1.47x compression on ResNet-50, both with about 1% top-5 accuracy decrease.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Temporal Aware Pruning for Efficient Diffusion-based Video Generation

    cs.CV 2026-05 unverdicted novelty 6.0

    TAPE introduces temporal-aware token pruning for diffusion-based video generation, using frame smoothing, layer reselection, and timestep budgets to achieve speedups while maintaining visual fidelity and coherence.

  2. Temporal Aware Pruning for Efficient Diffusion-based Video Generation

    cs.CV 2026-05 unverdicted novelty 5.0

    TAPE applies temporal-aware token pruning with smoothing, reselection, and timestep scheduling to speed up video diffusion models while preserving visual fidelity and coherence.