pith. sign in

Designing Neural Network Architectures using Reinforcement Learning

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it
abstract

At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using $Q$-learning with an $\epsilon$-greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks.

citation-role summary

background 1

citation-polarity summary

years

2025 1 2019 5

verdicts

UNVERDICTED 6

roles

background 1

polarities

background 1

representative citing papers

EPNAS: Efficient Progressive Neural Architecture Search

cs.LG · 2019-07-07 · unverdicted · novelty 5.0

EPNAS uses a progressive search policy with REINFORCE performance prediction to search neural architectures in parallel, supporting multiple resource constraints and outperforming ENAS and PNAS on CIFAR-10 and ImageNet in speed and accuracy.

citing papers explorer

Showing 6 of 6 citing papers.