pith. sign in

arxiv: 1905.11320 · v2 · pith:TKQPU5M5new · submitted 2019-05-27 · 💻 cs.LG · stat.ML

On approximating dropout noise injection

classification 💻 cs.LG stat.ML
keywords approximationdropoutinjectionlogisticnoiseregressionapproachapproximating
0
0 comments X
read the original abstract

This paper examines the assumptions of the derived equivalence between dropout noise injection and $L_2$ regularisation for logistic regression with negative log loss. We show that the approximation method is based on a divergent Taylor expansion, making, subsequent work using this approximation to compare the dropout trained logistic regression model with standard regularisers unfortunately ill-founded to date. Moreover, the approximation approach is shown to be invalid using any robust constraints. We show how this finding extends to general neural network topologies that use a cross-entropy prediction layer.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.