pith. sign in

arxiv: 1904.04612 · v1 · pith:BO23KYYDnew · submitted 2019-04-09 · 💻 cs.LG · cs.CV

Automated Search for Configurations of Deep Neural Network Architectures

classification 💻 cs.LG cs.CV
keywords architecturesmodelautomatedconfigurationmethodsearchconfigurationsdeep
0
0 comments X p. Extension
pith:BO23KYYD Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{BO23KYYD}

Prints a linked pith:BO23KYYD badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Deep Neural Networks (DNNs) are intensively used to solve a wide variety of complex problems. Although powerful, such systems require manual configuration and tuning. To this end, we view DNNs as configurable systems and propose an end-to-end framework that allows the configuration, evaluation and automated search for DNN architectures. Therefore, our contribution is threefold. First, we model the variability of DNN architectures with a Feature Model (FM) that generalizes over existing architectures. Each valid configuration of the FM corresponds to a valid DNN model that can be built and trained. Second, we implement, on top of Tensorflow, an automated procedure to deploy, train and evaluate the performance of a configured model. Third, we propose a method to search for configurations and demonstrate that it leads to good DNN models. We evaluate our method by applying it on image classification tasks (MNIST, CIFAR-10) and show that, with limited amount of computation and training, our method can identify high-performing architectures (with high accuracy). We also demonstrate that we outperform existing state-of-the-art architectures handcrafted by ML researchers. Our FM and framework have been released %and are publicly available to support replication and future research.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.