Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

Alex Alemi; Christian Szegedy; Sergey Ioffe; Vincent Vanhoucke

arxiv: 1602.07261 · v2 · pith:IRBEJYXBnew · submitted 2016-02-23 · 💻 cs.CV

Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

Christian Szegedy , Sergey Ioffe , Vincent Vanhoucke , Alex Alemi This is my paper

classification 💻 cs.CV

keywords residualinceptionnetworksconnectionsperformancearchitecturetrainingvery

0 comments

read the original abstract

Very deep convolutional networks have been central to the largest advances in image recognition performance in recent years. One example is the Inception architecture that has been shown to achieve very good performance at relatively low computational cost. Recently, the introduction of residual connections in conjunction with a more traditional architecture has yielded state-of-the-art performance in the 2015 ILSVRC challenge; its performance was similar to the latest generation Inception-v3 network. This raises the question of whether there are any benefit in combining the Inception architecture with residual connections. Here we give clear empirical evidence that training with residual connections accelerates the training of Inception networks significantly. There is also some evidence of residual Inception networks outperforming similarly expensive Inception networks without residual connections by a thin margin. We also present several new streamlined architectures for both residual and non-residual Inception networks. These variations improve the single-frame recognition performance on the ILSVRC 2012 classification task significantly. We further demonstrate how proper activation scaling stabilizes the training of very wide residual Inception networks. With an ensemble of three residual and one Inception-v4, we achieve 3.08 percent top-5 error on the test set of the ImageNet classification (CLS) challenge

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 9 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Deep Variational Information Bottleneck
cs.LG 2016-12 unverdicted novelty 8.0

Deep VIB is a neural-network parameterization of the information bottleneck objective trained via variational inference and the reparameterization trick, yielding improved generalization and adversarial robustness.
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
cs.CV 2017-04 accept novelty 7.0

MobileNets introduce depthwise separable convolutions plus width and resolution multipliers to produce efficient CNNs that trade off latency and accuracy for mobile and embedded vision applications.
Wide Residual Networks
cs.CV 2016-05 accept novelty 7.0

Wide residual networks achieve higher accuracy and faster training than very deep thin residual networks by increasing width and decreasing depth, setting new state-of-the-art results on CIFAR, SVHN, and ImageNet.
Personalized Face Privacy Protection From a Single Image
cs.CV 2026-05 unverdicted novelty 5.0

FaceCloak learns a lightweight identity-specific cloaking mask from a single image via synthetic face generation and iterative embedding perturbation to evade multiple recognition models.
Abnormal Colon Polyp Image Synthesis Using Conditional Adversarial Networks for Improved Detection Performance
eess.IV 2019-06 unverdicted novelty 5.0

A cGAN method with edge-filtered combined inputs generates synthetic polyp images from normal colonoscopy frames to augment training data and improve detection performance.
New pointwise convolution in Deep Neural Networks through Extremely Fast and Non Parametric Transforms
cs.CV 2019-06 unverdicted novelty 5.0

Replacing pointwise convolutions with DWHT yields a model with 79.1% fewer parameters, 48.4% fewer FLOPs, and 1.49% higher accuracy than MobileNet-V1 on CIFAR-100.
Lung Nodules Detection and Segmentation Using 3D Mask-RCNN
eess.IV 2019-07 unverdicted novelty 4.0

Adapted Mask-RCNN to 3D and applied it to lung nodule detection and segmentation on CT scans, reporting competitive detection results on the LUNA16 dataset.
Automatic Colon Polyp Detection using Region based Deep CNN and Post Learning Approaches
cs.CV 2019-06 unverdicted novelty 4.0

Region-based deep CNN with transfer learning and post-learning methods achieves better polyp detection performance than prior systems on large colonoscopy image and video databases.
Measuring the Transferability of Adversarial Examples
cs.LG 2019-07 unverdicted novelty 3.0

Empirical measurement of adversarial example transferability between VGG and Inception model classes with methodological refinements to attack strength selection, perturbation clipping, and evaluation via SSIM.