Designing Neural Network Architectures using Reinforcement Learning

· 2016 · cs.LG · arXiv 1611.02167

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open full Pith review browse 6 citing papers arXiv PDF

abstract

At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using $Q$-learning with an $\epsilon$-greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

AGAN: Towards Automated Design of Generative Adversarial Networks

cs.LG · 2019-06-25 · unverdicted · novelty 8.0

AGAN is the first neural architecture search method for GANs that discovers architectures outperforming state-of-the-art on CIFAR-10 unsupervised image generation and competitive on supervised tasks.

Neural Network Architecture Search with Differentiable Cartesian Genetic Programming for Regression

cs.NE · 2019-07-03 · unverdicted · novelty 7.0

dCGPANN encodes neural nets so evolutionary operators can rewire, prune, adapt activations and add skips while gradient descent tunes parameters, yielding smaller networks with lower regression error in fixed time.

EPNAS: Efficient Progressive Neural Architecture Search

cs.LG · 2019-07-07 · unverdicted · novelty 5.0

EPNAS uses a progressive search policy with REINFORCE performance prediction to search neural architectures in parallel, supporting multiple resource constraints and outperforming ENAS and PNAS on CIFAR-10 and ImageNet in speed and accuracy.

Hyp-RL : Hyperparameter Optimization by Reinforcement Learning

cs.LG · 2019-06-27 · unverdicted · novelty 5.0

Reinforcement learning selects hyperparameters sequentially by learning from actual future validation loss reductions and outperforms SMBO methods on 50 datasets.

Self-Adaptive 2D-3D Ensemble of Fully Convolutional Networks for Medical Image Segmentation

eess.IV · 2019-07-26 · unverdicted · novelty 4.0

Self-adaptive 2D-3D FCN ensemble optimized by multiobjective evolution for prostate segmentation on PROMISE12 achieves top-10 ranking with smaller size than prior auto-designed models.

Spiking Neural Network Architecture Search: A Survey

cs.NE · 2025-10-16 · unverdicted · novelty 2.0

A survey of Spiking Neural Network architecture search techniques viewed through a hardware/software co-design lens.

citing papers explorer

Showing 6 of 6 citing papers.

AGAN: Towards Automated Design of Generative Adversarial Networks cs.LG · 2019-06-25 · unverdicted · none · ref 26 · internal anchor
AGAN is the first neural architecture search method for GANs that discovers architectures outperforming state-of-the-art on CIFAR-10 unsupervised image generation and competitive on supervised tasks.
Neural Network Architecture Search with Differentiable Cartesian Genetic Programming for Regression cs.NE · 2019-07-03 · unverdicted · none · ref 2 · internal anchor
dCGPANN encodes neural nets so evolutionary operators can rewire, prune, adapt activations and add skips while gradient descent tunes parameters, yielding smaller networks with lower regression error in fixed time.
EPNAS: Efficient Progressive Neural Architecture Search cs.LG · 2019-07-07 · unverdicted · none · ref 4 · internal anchor
EPNAS uses a progressive search policy with REINFORCE performance prediction to search neural architectures in parallel, supporting multiple resource constraints and outperforming ENAS and PNAS on CIFAR-10 and ImageNet in speed and accuracy.
Hyp-RL : Hyperparameter Optimization by Reinforcement Learning cs.LG · 2019-06-27 · unverdicted · none · ref 2 · internal anchor
Reinforcement learning selects hyperparameters sequentially by learning from actual future validation loss reductions and outperforms SMBO methods on 50 datasets.
Self-Adaptive 2D-3D Ensemble of Fully Convolutional Networks for Medical Image Segmentation eess.IV · 2019-07-26 · unverdicted · none · ref 13 · internal anchor
Self-adaptive 2D-3D FCN ensemble optimized by multiobjective evolution for prostate segmentation on PROMISE12 achieves top-10 ranking with smaller size than prior auto-designed models.
Spiking Neural Network Architecture Search: A Survey cs.NE · 2025-10-16 · unverdicted · none · ref 79 · internal anchor
A survey of Spiking Neural Network architecture search techniques viewed through a hardware/software co-design lens.

Designing Neural Network Architectures using Reinforcement Learning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer