Efficient Neural Architecture Search via Parameter Sharing

· 2018 · cs.LG · arXiv 1802.03268

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

open full Pith review browse 8 citing papers arXiv PDF

abstract

We propose Efficient Neural Architecture Search (ENAS), a fast and inexpensive approach for automatic model design. In ENAS, a controller learns to discover neural network architectures by searching for an optimal subgraph within a large computational graph. The controller is trained with policy gradient to select a subgraph that maximizes the expected reward on the validation set. Meanwhile the model corresponding to the selected subgraph is trained to minimize a canonical cross entropy loss. Thanks to parameter sharing between child models, ENAS is fast: it delivers strong empirical performances using much fewer GPU-hours than all existing automatic model design approaches, and notably, 1000x less expensive than standard Neural Architecture Search. On the Penn Treebank dataset, ENAS discovers a novel architecture that achieves a test perplexity of 55.8, establishing a new state-of-the-art among all methods without post-training processing. On the CIFAR-10 dataset, ENAS designs novel architectures that achieve a test error of 2.89%, which is on par with NASNet (Zoph et al., 2018), whose test error is 2.65%.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Characterizing Learning in Deep Neural Networks using Tractable Algorithmic Complexity Analysis

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

QuBD extends algorithmic complexity estimation to quantized DNN weights, revealing that complexity decreases during learning, increases with overfitting, follows grokking patterns, and correlates with generalization.

Auto-FP: An Experimental Study of Automated Feature Preprocessing for Tabular Data

cs.LG · 2023-10-04 · unverdicted · novelty 7.0

Experimental comparison of 15 HPO and NAS algorithms for automated feature preprocessing on 45 tabular datasets finds evolution-based methods and random search as top performers.

Switchable Normalization for Learning-to-Normalize Deep Representation

cs.CV · 2019-07-22 · unverdicted · novelty 7.0

Switchable Normalization learns per-layer weights to combine channel, layer, and minibatch normalizers, claiming robustness to batch size and better results than fixed normalizers on ImageNet, COCO, CityScapes, ADE20K, MegaFace, and Kinetics.

Video Action Recognition Via Neural Architecture Searching

cs.CV · 2019-07-10 · unverdicted · novelty 6.0

Uses differentiable NAS with temporal segments and pseudo-3D operators to discover a video action recognition network that outperforms hand-designed models on UCF101 with ~1% of the parameters when trained from scratch.

EnforceNet: Monocular Camera Localization in Large Scale Indoor Sparse LiDAR Point Cloud

cs.CV · 2019-07-16 · unverdicted · novelty 5.0

EnforceNet achieves centimeter-level monocular camera localization in sparse LiDAR maps of indoor parking garages via a novel resistor module that improves generalization, accuracy, and training speed.

EPNAS: Efficient Progressive Neural Architecture Search

cs.LG · 2019-07-07 · unverdicted · novelty 5.0

EPNAS uses a progressive search policy with REINFORCE performance prediction to search neural architectures in parallel, supporting multiple resource constraints and outperforming ENAS and PNAS on CIFAR-10 and ImageNet in speed and accuracy.

Self-Adaptive 2D-3D Ensemble of Fully Convolutional Networks for Medical Image Segmentation

eess.IV · 2019-07-26 · unverdicted · novelty 4.0

Self-adaptive 2D-3D FCN ensemble optimized by multiobjective evolution for prostate segmentation on PROMISE12 achieves top-10 ranking with smaller size than prior auto-designed models.

Genetic Network Architecture Search

cs.NE · 2019-07-05 · unverdicted · novelty 3.0

Genetic algorithm searches convolution cell architectures with weight sharing via SGD, reporting 96% accuracy on CIFAR10 and 80.1% on CIFAR100.

citing papers explorer

Showing 8 of 8 citing papers.

Characterizing Learning in Deep Neural Networks using Tractable Algorithmic Complexity Analysis cs.LG · 2026-05-15 · unverdicted · none · ref 54 · internal anchor
QuBD extends algorithmic complexity estimation to quantized DNN weights, revealing that complexity decreases during learning, increases with overfitting, follows grokking patterns, and correlates with generalization.
Auto-FP: An Experimental Study of Automated Feature Preprocessing for Tabular Data cs.LG · 2023-10-04 · unverdicted · none · ref 63 · internal anchor
Experimental comparison of 15 HPO and NAS algorithms for automated feature preprocessing on 45 tabular datasets finds evolution-based methods and random search as top performers.
Switchable Normalization for Learning-to-Normalize Deep Representation cs.CV · 2019-07-22 · unverdicted · none · ref 14 · internal anchor
Switchable Normalization learns per-layer weights to combine channel, layer, and minibatch normalizers, claiming robustness to batch size and better results than fixed normalizers on ImageNet, COCO, CityScapes, ADE20K, MegaFace, and Kinetics.
Video Action Recognition Via Neural Architecture Searching cs.CV · 2019-07-10 · unverdicted · none · ref 18 · internal anchor
Uses differentiable NAS with temporal segments and pseudo-3D operators to discover a video action recognition network that outperforms hand-designed models on UCF101 with ~1% of the parameters when trained from scratch.
EnforceNet: Monocular Camera Localization in Large Scale Indoor Sparse LiDAR Point Cloud cs.CV · 2019-07-16 · unverdicted · none · ref 35 · internal anchor
EnforceNet achieves centimeter-level monocular camera localization in sparse LiDAR maps of indoor parking garages via a novel resistor module that improves generalization, accuracy, and training speed.
EPNAS: Efficient Progressive Neural Architecture Search cs.LG · 2019-07-07 · unverdicted · none · ref 34 · internal anchor
EPNAS uses a progressive search policy with REINFORCE performance prediction to search neural architectures in parallel, supporting multiple resource constraints and outperforming ENAS and PNAS on CIFAR-10 and ImageNet in speed and accuracy.
Self-Adaptive 2D-3D Ensemble of Fully Convolutional Networks for Medical Image Segmentation eess.IV · 2019-07-26 · unverdicted · none · ref 12 · internal anchor
Self-adaptive 2D-3D FCN ensemble optimized by multiobjective evolution for prostate segmentation on PROMISE12 achieves top-10 ranking with smaller size than prior auto-designed models.
Genetic Network Architecture Search cs.NE · 2019-07-05 · unverdicted · none · ref 6 · internal anchor
Genetic algorithm searches convolution cell architectures with weight sharing via SGD, reporting 96% accuracy on CIFAR10 and 80.1% on CIFAR100.

Efficient Neural Architecture Search via Parameter Sharing

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer