Improving Neural Architecture Search Image Classifiers via Ensemble Learning

Charles Weill; Hanna Mazzawi; Javier Gonzalvo; Vladimir Macko

arxiv: 1903.06236 · v1 · pith:5472J6PMnew · submitted 2019-03-14 · 💻 cs.LG · stat.ML

Improving Neural Architecture Search Image Classifiers via Ensemble Learning

Vladimir Macko , Charles Weill , Hanna Mazzawi , Javier Gonzalvo This is my paper

classification 💻 cs.LG stat.ML

keywords neuralarchitectureensemblenetworknetworkssinglebestblock

0 comments

read the original abstract

Finding the best neural network architecture requires significant time, resources, and human expertise. These challenges are partially addressed by neural architecture search (NAS) which is able to find the best convolutional layer or cell that is then used as a building block for the network. However, once a good building block is found, manual design is still required to assemble the final architecture as a combination of multiple blocks under a predefined parameter budget constraint. A common solution is to stack these blocks into a single tower and adjust the width and depth to fill the parameter budget. However, these single tower architectures may not be optimal. Instead, in this paper we present the AdaNAS algorithm, that uses ensemble techniques to compose a neural network as an ensemble of smaller networks automatically. Additionally, we introduce a novel technique based on knowledge distillation to iteratively train the smaller networks using the previous ensemble as a teacher. Our experiments demonstrate that ensembles of networks improve accuracy upon a single neural network while keeping the same number of parameters. Our models achieve comparable results with the state-of-the-art on CIFAR-10 and sets a new state-of-the-art on CIFAR-100.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Semi-Supervised Learning for Cancer Detection of Lymph Node Metastases
cs.CV 2019-06 unverdicted novelty 3.0

A CNN model trained with pseudo-label semi-supervised learning reports higher AUC than a supervised baseline on the PCam histopathology dataset.