The Quest for the Golden Activation Function

Mina Basirat; Peter M. Roth

arxiv: 1808.00783 · v1 · pith:MRA5HMA3new · submitted 2018-08-02 · 💻 cs.NE · cs.CV· cs.LG· stat.ML

The Quest for the Golden Activation Function

Mina Basirat , Peter M. Roth This is my paper

classification 💻 cs.NE cs.CVcs.LGstat.ML

keywords activationfunctionsdifferentfunctiondemonstratedesignmanualselection

0 comments

read the original abstract

Deep Neural Networks have been shown to be beneficial for a variety of tasks, in particular allowing for end-to-end learning and reducing the requirement for manual design decisions. However, still many parameters have to be chosen in advance, also raising the need to optimize them. One important, but often ignored system parameter is the selection of a proper activation function. Thus, in this paper we target to demonstrate the importance of activation functions in general and show that for different tasks different activation functions might be meaningful. To avoid the manual design or selection of activation functions, we build on the idea of genetic algorithms to learn the best activation function for a given task. In addition, we introduce two new activation functions, ELiSH and HardELiSH, which can easily be incorporated in our framework. In this way, we demonstrate for three different image classification benchmarks that different activation functions are learned, also showing improved results compared to typically used baselines.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Neural Network Architecture Search with Differentiable Cartesian Genetic Programming for Regression
cs.NE 2019-07 unverdicted novelty 7.0

dCGPANN encodes neural nets so evolutionary operators can rewire, prune, adapt activations and add skips while gradient descent tunes parameters, yielding smaller networks with lower regression error in fixed time.