Do deep nets really need weight decay and dropout?

· 2018 · cs.CV · arXiv 1802.07042

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

The impressive success of modern deep neural networks on computer vision tasks has been achieved through models of very large capacity compared to the number of available training examples. This overparameterization is often said to be controlled with the help of different regularization techniques, mainly weight decay and dropout. However, since these techniques reduce the effective capacity of the model, typically even deeper and wider architectures are required to compensate for the reduced capacity. Therefore, there seems to be a waste of capacity in this practice. In this paper we build upon recent research that suggests that explicit regularization may not be as important as widely believed and carry out an ablation study that concludes that weight decay and dropout may not be necessary for object recognition if enough data augmentation is introduced.

representative citing papers

Further advantages of data augmentation on convolutional neural networks

cs.CV · 2019-06-26 · unverdicted · novelty 4.0

Data augmentation enables CNNs to adapt to varying architectures and data amounts without hyperparameter fine-tuning, unlike weight decay and dropout.

citing papers explorer

Showing 1 of 1 citing paper.

Further advantages of data augmentation on convolutional neural networks cs.CV · 2019-06-26 · unverdicted · none · ref 16 · internal anchor
Data augmentation enables CNNs to adapt to varying architectures and data amounts without hyperparameter fine-tuning, unlike weight decay and dropout.

Do deep nets really need weight decay and dropout?

fields

years

verdicts

representative citing papers

citing papers explorer