pith. sign in

arxiv: 1806.01496 · v1 · pith:ACDOOP2Anew · submitted 2018-06-05 · 📡 eess.IV · cs.CV

Deep Image Compression via End-to-End Learning

classification 📡 eess.IV cs.CV
keywords lossratecnnscompressiondeepimagelearningmethod
0
0 comments X
read the original abstract

We present a lossy image compression method based on deep convolutional neural networks (CNNs), which outperforms the existing BPG, WebP, JPEG2000 and JPEG as measured via multi-scale structural similarity (MS-SSIM), at the same bit rate. Currently, most of the CNNs based approaches train the network using a L2 loss between the reconstructions and the ground-truths in the pixel domain, which leads to over-smoothing results and visual quality degradation especially at a very low bit rate. Therefore, we improve the subjective quality with the combination of a perception loss and an adversarial loss additionally. To achieve better rate-distortion optimization (RDO), we also introduce an easy-to-hard transfer learning when adding quantization error and rate constraint. Finally, we evaluate our method on public Kodak and the Test Dataset P/M released by the Computer Vision Lab of ETH Zurich, resulting in averaged 7.81% and 19.1% BD-rate reduction over BPG, respectively.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. A Deep Image Compression Framework for Face Recognition

    cs.CV 2019-07 unverdicted novelty 6.0

    A deep convolutional autoencoder compression framework jointly optimized with face recognition achieves higher verification accuracy on LFW images than JPEG2000 or JPEG.