pith. sign in

arxiv: 1712.05440 · v1 · pith:UF2NKNWMnew · submitted 2017-12-14 · 💻 cs.LG · cs.GT

Nonparametric Neural Networks

classification 💻 cs.LG cs.GT
keywords networknetworksneuralautomaticallyframeworknonparametricoptimizationpenalty
0
0 comments X
read the original abstract

Automatically determining the optimal size of a neural network for a given task without prior information currently requires an expensive global search and training many networks from scratch. In this paper, we address the problem of automatically finding a good network size during a single training cycle. We introduce *nonparametric neural networks*, a non-probabilistic framework for conducting optimization over all possible network sizes and prove its soundness when network growth is limited via an L_p penalty. We train networks under this framework by continuously adding new units while eliminating redundant units via an L_2 penalty. We employ a novel optimization algorithm, which we term *adaptive radial-angular gradient descent* or *AdaRad*, and obtain promising results.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.