Accelerating the Super-Resolution Convolutional Neural Network

Chao Dong; Chen Change Loy; Xiaoou Tang

arxiv: 1608.00367 · v1 · pith:WV3GY24Fnew · submitted 2016-08-01 · 💻 cs.CV

Accelerating the Super-Resolution Convolutional Neural Network

Chao Dong , Chen Change Loy , Xiaoou Tang This is my paper

classification 💻 cs.CV

keywords mappingperformancenetworksrcnnsuper-resolutionacceleratingconvolutionalimage

0 comments

read the original abstract

As a successful deep model applied in image super-resolution (SR), the Super-Resolution Convolutional Neural Network (SRCNN) has demonstrated superior performance to the previous hand-crafted models either in speed and restoration quality. However, the high computational cost still hinders it from practical usage that demands real-time performance (24 fps). In this paper, we aim at accelerating the current SRCNN, and propose a compact hourglass-shape CNN structure for faster and better SR. We re-design the SRCNN structure mainly in three aspects. First, we introduce a deconvolution layer at the end of the network, then the mapping is learned directly from the original low-resolution image (without interpolation) to the high-resolution one. Second, we reformulate the mapping layer by shrinking the input feature dimension before mapping and expanding back afterwards. Third, we adopt smaller filter sizes but more mapping layers. The proposed model achieves a speed up of more than 40 times with even superior restoration quality. Further, we present the parameter settings that can achieve real-time performance on a generic CPU while still maintaining good performance. A corresponding transfer strategy is also proposed for fast training and testing across different upscaling factors.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Deep Residual Learning for Image Compression
eess.IV 2019-06 unverdicted novelty 3.0

A learned image compression system using deep residual learning and sub-pixel convolution reaches 0.972 MS-SSIM at 0.15 bits per pixel in the CLIC validation phase.