FreezeOut: Accelerate Training by Progressively Freezing Layers

Andrew Brock; J.M. Ritchie; Nick Weston; Theodore Lim

arxiv: 1706.04983 · v2 · pith:Z4D2MFQFnew · submitted 2017-06-15 · 📊 stat.ML · cs.LG

FreezeOut: Accelerate Training by Progressively Freezing Layers

Andrew Brock , Theodore Lim , J.M. Ritchie , Nick Weston This is my paper

classification 📊 stat.ML cs.LG

keywords freezeoutlayerstrainingaccuracyfreezinglossthemabstract

0 comments

read the original abstract

The early layers of a deep neural net have the fewest parameters, but take up the most computation. In this extended abstract, we propose to only train the hidden layers for a set portion of the training run, freezing them out one-by-one and excluding them from the backward pass. Through experiments on CIFAR, we empirically demonstrate that FreezeOut yields savings of up to 20% wall-clock time during training with 3% loss in accuracy for DenseNets, a 20% speedup without loss of accuracy for ResNets, and no improvement for VGG networks. Our code is publicly available at https://github.com/ajbrock/FreezeOut

This paper has not been read by Pith yet.

FreezeOut: Accelerate Training by Progressively Freezing Layers

discussion (0)