Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation

· 2019 · cs.CV · arXiv 1901.02985

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Recently, Neural Architecture Search (NAS) has successfully identified neural network architectures that exceed human designed ones on large-scale image classification. In this paper, we study NAS for semantic image segmentation. Existing works often focus on searching the repeatable cell structure, while hand-designing the outer network structure that controls the spatial resolution changes. This choice simplifies the search space, but becomes increasingly problematic for dense image prediction which exhibits a lot more network level architectural variations. Therefore, we propose to search the network level structure in addition to the cell level structure, which forms a hierarchical architecture search space. We present a network level search space that includes many popular designs, and develop a formulation that allows efficient gradient-based architecture search (3 P100 GPU days on Cityscapes images). We demonstrate the effectiveness of the proposed method on the challenging Cityscapes, PASCAL VOC 2012, and ADE20K datasets. Auto-DeepLab, our architecture searched specifically for semantic image segmentation, attains state-of-the-art performance without any ImageNet pretraining.

representative citing papers

Gated-SCNN: Gated Shape CNNs for Semantic Segmentation

cs.CV · 2019-07-12 · unverdicted · novelty 6.0

Gated-SCNN adds a gated shape stream to standard CNNs for semantic segmentation, achieving improved boundary quality and SOTA results on Cityscapes.

Video Action Recognition Via Neural Architecture Searching

cs.CV · 2019-07-10 · unverdicted · novelty 6.0

Uses differentiable NAS with temporal segments and pseudo-3D operators to discover a video action recognition network that outperforms hand-designed models on UCF101 with ~1% of the parameters when trained from scratch.

citing papers explorer

Showing 2 of 2 citing papers.

Gated-SCNN: Gated Shape CNNs for Semantic Segmentation cs.CV · 2019-07-12 · unverdicted · none · ref 34 · internal anchor
Gated-SCNN adds a gated shape stream to standard CNNs for semantic segmentation, achieving improved boundary quality and SOTA results on Cityscapes.
Video Action Recognition Via Neural Architecture Searching cs.CV · 2019-07-10 · unverdicted · none · ref 17 · internal anchor
Uses differentiable NAS with temporal segments and pseudo-3D operators to discover a video action recognition network that outperforms hand-designed models on UCF101 with ~1% of the parameters when trained from scratch.

Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation

fields

years

verdicts

representative citing papers

citing papers explorer