pith. machine review for the scientific record. sign in

arxiv: 1705.07485 · v2 · submitted 2017-05-21 · 💻 cs.LG · cs.CV

Recognition: unknown

Shake-Shake regularization

Authors on Pith no claims yet
classification 💻 cs.LG cs.CV
keywords shake-shakeregularizationresultsaffineaimsapplicationsappliedarchitectures
0
0 comments X
read the original abstract

The method introduced in this paper aims at helping deep learning practitioners faced with an overfit problem. The idea is to replace, in a multi-branch network, the standard summation of parallel branches with a stochastic affine combination. Applied to 3-branch residual networks, shake-shake regularization improves on the best single shot published results on CIFAR-10 and CIFAR-100 by reaching test errors of 2.86% and 15.85%. Experiments on architectures without skip connections or Batch Normalization show encouraging results and open the door to a large set of applications. Code is available at https://github.com/xgastaldi/shake-shake

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Improved Regularization of Convolutional Neural Networks with Cutout

    cs.CV 2017-08 accept novelty 7.0

    Randomly masking square regions of input images during CNN training yields new state-of-the-art test errors of 2.56% on CIFAR-10, 15.20% on CIFAR-100, and 1.30% on SVHN.