AutoSlim: Towards One-Shot Architecture Search for Channel Numbers

Jiahui Yu; Thomas Huang

arxiv: 1903.11728 · v2 · pith:472BV26Znew · submitted 2019-03-27 · 💻 cs.CV · cs.AI

AutoSlim: Towards One-Shot Architecture Search for Channel Numbers

Jiahui Yu , Thomas Huang This is my paper

classification 💻 cs.CV cs.AI

keywords channelaccuracyflopsbetternetworkconfigurationsnumbersslimmable

0 comments

read the original abstract

We study how to set channel numbers in a neural network to achieve better accuracy under constrained resources (e.g., FLOPs, latency, memory footprint or model size). A simple and one-shot solution, named AutoSlim, is presented. Instead of training many network samples and searching with reinforcement learning, we train a single slimmable network to approximate the network accuracy of different channel configurations. We then iteratively evaluate the trained slimmable model and greedily slim the layer with minimal accuracy drop. By this single pass, we can obtain the optimized channel configurations under different resource constraints. We present experiments with MobileNet v1, MobileNet v2, ResNet-50 and RL-searched MNasNet on ImageNet classification. We show significant improvements over their default channel configurations. We also achieve better accuracy than recent channel pruning methods and neural architecture search methods. Notably, by setting optimized channel numbers, our AutoSlim-MobileNet-v2 at 305M FLOPs achieves 74.2% top-1 accuracy, 2.4% better than default MobileNet-v2 (301M FLOPs), and even 0.2% better than RL-searched MNasNet (317M FLOPs). Our AutoSlim-ResNet-50 at 570M FLOPs, without depthwise convolutions, achieves 1.3% better accuracy than MobileNet-v1 (569M FLOPs). Code and models will be available at: https://github.com/JiahuiYu/slimmable_networks

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Slimmable ConvNeXt: Width-Adaptive Inference for Efficient Multi-Device Deployment
cs.CV 2026-05 unverdicted novelty 7.0

Slimmable ConvNeXt adapts ConvNeXt for width-adaptive inference using LayerNorm and inverted bottlenecks, reaching 80.8% top-1 at 4.5 GMACs and outperforming HydraViT, MatFormer, and SortedNet on ImageNet-1k.
Response-Conditioned Parallel-to-Sequential Orchestration for Multi-Agent Systems
cs.CL 2026-05 unverdicted novelty 6.0

Nexa learns a response-conditioned policy that starts with parallel agent execution and adds at most one round of sequential message passing via a predicted sparse DAG, strictly subsuming pure parallel mode.