Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

Damien Vincent; David Minnen; George Toderici; Joel Shor; Michele Covell; Nick Johnston; Saurabh Singh; Sung Jin Hwang; Troy Chinen

arxiv: 1703.10114 · v1 · pith:CTUY5BLQnew · submitted 2017-03-29 · 💻 cs.CV

Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

Nick Johnston , Damien Vincent , David Minnen , Michele Covell , Saurabh Singh , Troy Chinen , Sung Jin Hwang , Joel Shor

show 1 more author

George Toderici

This is my paper

classification 💻 cs.CV

keywords imagenetworksrecurrentadaptivecompressionlossymethodnetwork

0 comments

read the original abstract

We propose a method for lossy image compression based on recurrent, convolutional neural networks that outperforms BPG (4:2:0 ), WebP, JPEG2000, and JPEG as measured by MS-SSIM. We introduce three improvements over previous research that lead to this state-of-the-art result. First, we show that training with a pixel-wise loss weighted by SSIM increases reconstruction quality according to several metrics. Second, we modify the recurrent architecture to improve spatial diffusion, which allows the network to more effectively capture and propagate image information through the network's hidden state. Finally, in addition to lossless entropy coding, we use a spatially adaptive bit allocation algorithm to more efficiently use the limited number of bits to encode visually complex image regions. We evaluate our method on the Kodak and Tecnick image sets and compare against standard codecs as well recently published methods based on deep neural networks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Learning Image and Video Compression through Spatial-Temporal Energy Compaction
eess.IV 2019-06 unverdicted novelty 5.0

Learned image and video compression via autoencoders with spatial-temporal energy compaction penalties outperforms standards on MS-SSIM and visual quality.
Deep Residual Learning for Image Compression
eess.IV 2019-06 unverdicted novelty 3.0

A learned image compression system using deep residual learning and sub-pixel convolution reaches 0.972 MS-SSIM at 0.15 bits per pixel in the CLIC validation phase.