pith. sign in

arxiv: 1905.06435 · v1 · pith:KCHC3AKSnew · submitted 2019-05-15 · 💻 cs.LG · cs.CV· stat.ML

Dynamic Neural Network Channel Execution for Efficient Training

classification 💻 cs.LG cs.CVstat.ML
keywords computationaltrainingchannelsdynamicneuralchannelcompactefficient
0
0 comments X
read the original abstract

Existing methods for reducing the computational burden of neural networks at run-time, such as parameter pruning or dynamic computational path selection, focus solely on improving computational efficiency during inference. On the other hand, in this work, we propose a novel method which reduces the memory footprint and number of computing operations required for training and inference. Our framework efficiently integrates pruning as part of the training procedure by exploring and tracking the relative importance of convolutional channels. At each training step, we select only a subset of highly salient channels to execute according to the combinatorial upper confidence bound algorithm, and run a forward and backward pass only on these activated channels, hence learning their parameters. Consequently, we enable the efficient discovery of compact models. We validate our approach empirically on state-of-the-art CNNs - VGGNet, ResNet and DenseNet, and on several image classification datasets. Results demonstrate our framework for dynamic channel execution reduces computational cost up to 4x and parameter count up to 9x, thus reducing the memory and computational demands for discovering and training compact neural network models.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Balancing Uncertainty and Diversity of Samples: Leveraging Diversity of Least, High Confidence Samples for Effective Active Learning

    cs.CV 2026-05 unverdicted novelty 4.0

    Proposes LCD and three other hybrid uncertainty-diversity sampling methods for active learning that outperform prior approaches by selecting uncertain yet diverse samples.