Deep Residual Networks and Weight Initialization

Masato Taki

arxiv: 1709.02956 · v1 · pith:KLMJMNXInew · submitted 2017-09-09 · 💻 cs.LG · stat.ML

Deep Residual Networks and Weight Initialization

Masato Taki This is my paper

classification 💻 cs.LG stat.ML

keywords deepnetworkresnetsinitialinitializationneuralresidualresnet

0 comments

read the original abstract

Residual Network (ResNet) is the state-of-the-art architecture that realizes successful training of really deep neural network. It is also known that good weight initialization of neural network avoids problem of vanishing/exploding gradients. In this paper, simplified models of ResNets are analyzed. We argue that goodness of ResNet is correlated with the fact that ResNets are relatively insensitive to choice of initial weights. We also demonstrate how batch normalization improves backpropagation of deep ResNets without tuning initial values of weights.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

SpikeDet: Better Firing Patterns for Accurate and Energy-Efficient Object Detection with Spiking Neural Networks
cs.CV 2025-01 unverdicted novelty 6.0

SpikeDet reaches 52.2% AP on COCO 2017 with spiking networks by optimizing firing patterns via MDSNet and SMFM, using half the energy of prior SNN detectors.