pith. sign in

arxiv: 1810.05466 · v1 · pith:2KV2EAQEnew · submitted 2018-10-12 · 💻 cs.LG · stat.ML

Mode Normalization

classification 💻 cs.LG stat.ML
keywords normalizationlearningmethodsingleaccelerateapproacharguablybatch
0
0 comments X
read the original abstract

Normalization methods are a central building block in the deep learning toolbox. They accelerate and stabilize training, while decreasing the dependence on manually tuned learning rate schedules. When learning from multi-modal distributions, the effectiveness of batch normalization (BN), arguably the most prominent normalization method, is reduced. As a remedy, we propose a more flexible approach: by extending the normalization to more than a single mean and variance, we detect modes of data on-the-fly, jointly normalizing samples that share common features. We demonstrate that our method outperforms BN and other widely used normalization techniques in several experiments, including single and multi-task datasets.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.