pith. machine review for the scientific record. sign in

arxiv: 1711.09485 · v2 · submitted 2017-11-26 · 💻 cs.CV

Recognition: unknown

SkipNet: Learning Dynamic Routing in Convolutional Networks

Authors on Pith no claims yet
classification 💻 cs.CV
keywords learningconvolutionalnetworksdynamicskipnetaccuracygatinglayers
0
0 comments X
read the original abstract

While deeper convolutional networks are needed to achieve maximum accuracy in visual perception tasks, for many inputs shallower networks are sufficient. We exploit this observation by learning to skip convolutional layers on a per-input basis. We introduce SkipNet, a modified residual network, that uses a gating network to selectively skip convolutional blocks based on the activations of the previous layer. We formulate the dynamic skipping problem in the context of sequential decision making and propose a hybrid learning algorithm that combines supervised learning and reinforcement learning to address the challenges of non-differentiable skipping decisions. We show SkipNet reduces computation by 30-90% while preserving the accuracy of the original model on four benchmark datasets and outperforms the state-of-the-art dynamic networks and static compression methods. We also qualitatively evaluate the gating policy to reveal a relationship between image scale and saliency and the number of layers skipped.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

    cs.LG 2024-04 conditional novelty 6.0

    Mixture-of-Depths enables transformers to dynamically allocate compute by routing only the top-k tokens through each layer's full computations, matching baseline performance with a fraction of the FLOPs per forward pa...