Deep Residual Learning for Image Compression

Heming Sun; Jiro Katto; Masaru Takeuchi; Zhengxue Cheng

arxiv: 1906.09731 · v1 · pith:HOU55SWKnew · submitted 2019-06-24 · 📡 eess.IV

Deep Residual Learning for Image Compression

Zhengxue Cheng , Heming Sun , Masaru Takeuchi , Jiro Katto This is my paper

Pith reviewed 2026-05-25 17:27 UTC · model grok-4.3

classification 📡 eess.IV

keywords learned image compressiondeep residual learningsub-pixel convolutionMS-SSIMrate constraintimage reconstructionneural compressionencoder-decoder

0 comments

The pith

Deep residual learning with sub-pixel convolution reaches 0.972 MS-SSIM at 0.15 bpp in learned image compression.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents an approach for the learned image compression challenge consisting of deep residual networks in the compression model and sub-pixel convolution for upsampling. It reports that three variants of this approach satisfy a fixed rate constraint while scoring 0.972 on the MS-SSIM metric during validation. The work focuses on showing that these architectural choices produce competitive reconstruction quality under the bitrate limit with moderate computational cost. A sympathetic reader would see the result as evidence that residual blocks can stabilize training of deep compression networks at low rates.

Core claim

The central claim is that deep residual learning for image compression combined with sub-pixel convolution as up-sampling operations produces models that achieve an MS-SSIM of 0.972 at the 0.15 bpp rate constraint with moderate complexity on the challenge validation set.

What carries the argument

Deep residual learning blocks inside the encoder-decoder pipeline together with sub-pixel convolution layers that restore spatial resolution.

If this is right

Residual connections permit deeper compression networks to be trained without loss of gradient flow at low bitrates.
Sub-pixel convolution supplies an efficient learned upsampling step that contributes to the observed reconstruction quality.
The three named variants demonstrate that the same residual-plus-subpixel backbone can be tuned for different emphasis while still meeting the rate target.
Moderate complexity at the reported performance level indicates the architecture is usable within the computational limits of the challenge.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same residual structure could be tested on video sequences by adding temporal prediction layers to check whether the compression gain carries over.
If the validation result holds, replacing standard upsampling blocks with sub-pixel convolution might become a default choice in other learned compression pipelines.
Performance at one fixed bitrate leaves open whether the same network family can be rate-controlled across a wider range without retraining.

Load-bearing premise

The MS-SSIM score measured on the challenge validation set at the fixed 0.15 bpp constraint will translate to meaningful perceptual gains or competitive performance on other datasets and real-world usage conditions.

What would settle it

Running the trained models on a held-out test set drawn from a different image distribution and checking whether MS-SSIM stays near 0.972 or falls sharply at the same bitrate.

Figures

Figures reproduced from arXiv: 1906.09731 by Heming Sun, Jiro Katto, Masaru Takeuchi, Zhengxue Cheng.

**Figure 3.** Figure 3: The network structure of one residual unit. [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗

**Figure 2.** Figure 2: Network structure of proposed deep residual [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

read the original abstract

In this paper, we provide a detailed description on our approach designed for CVPR 2019 Workshop and Challenge on Learned Image Compression (CLIC). Our approach mainly consists of two proposals, i.e. deep residual learning for image compression and sub-pixel convolution as up-sampling operations. Experimental results have indicated that our approaches, Kattolab, Kattolabv2 and KattolabSSIM, achieve 0.972 in MS-SSIM at the rate constraint of 0.15bpp with moderate complexity during the validation phase.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This is a narrow CLIC workshop report that applies known residual blocks and sub-pixel convolution to hit 0.972 MS-SSIM at 0.15 bpp on the validation set.

read the letter

The paper is a workshop challenge submission. It describes two components already published elsewhere—residual learning blocks and sub-pixel convolution as upsampling—and applies them to the CLIC task. The main reported outcome is the 0.972 MS-SSIM score at the fixed 0.15 bpp constraint with moderate complexity for their three variants. That is the entire central claim. No new primitive or derivation is presented. The work is useful mainly as a record of what produced a competitive entry in that specific challenge. The description of how the blocks were wired together is straightforward and could save someone time if they are building their own CLIC submission. The moderate-complexity note is also practical information for people who care about deployment constraints. Beyond that, the paper supplies no training schedule, loss details, ablation results, or side-by-side numbers against other entries, so the reader cannot judge why the score was reached or whether the same choices would transfer. The assumption that the validation-set number will hold up elsewhere is left untested in the text we have. This kind of short challenge report is mainly for people already tracking the CLIC leaderboard or looking for quick implementation pointers. It does not contain enough new material or supporting evidence to justify sending it to a journal referee. A workshop proceedings is the appropriate home.

Referee Report

1 major / 1 minor

Summary. The paper describes an approach submitted to the CVPR 2019 CLIC challenge on learned image compression. The method combines deep residual learning for compression with sub-pixel convolution for up-sampling. The central claim is that the three variants (Kattolab, Kattolabv2, KattolabSSIM) achieve an MS-SSIM of 0.972 at the fixed rate constraint of 0.15 bpp on the challenge validation set with moderate complexity.

Significance. The reported MS-SSIM value on the CLIC validation set at 0.15 bpp constitutes a narrow empirical fact about a competition entry. If the underlying implementation details were supplied and shown to be reproducible, the work could serve as a concrete data point on the utility of residual blocks and sub-pixel upsampling in rate-constrained learned codecs; however, the current manuscript supplies none of the supporting evidence required to evaluate that utility.

major comments (1)

[Abstract] Abstract: the sole quantitative claim (MS-SSIM = 0.972 at 0.15 bpp) is presented without any description of the training loss, optimizer, dataset, rate-control mechanism, or baseline comparisons. Because this numeric result is the only load-bearing assertion in the manuscript, the absence of these details prevents verification of the claim.

minor comments (1)

The manuscript is written as a short workshop report; any journal version would need an explicit methods section, ablation tables, and rate-distortion curves to meet archival standards.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We appreciate the referee's feedback on the abstract. We address the major comment below.

read point-by-point responses

Referee: [Abstract] Abstract: the sole quantitative claim (MS-SSIM = 0.972 at 0.15 bpp) is presented without any description of the training loss, optimizer, dataset, rate-control mechanism, or baseline comparisons. Because this numeric result is the only load-bearing assertion in the manuscript, the absence of these details prevents verification of the claim.

Authors: The manuscript consists of a concise abstract for a CLIC challenge submission. We agree that it does not supply descriptions of the training loss, optimizer, dataset, rate-control mechanism, or baseline comparisons. To enable verification of the reported MS-SSIM result, we will revise the manuscript to include these details. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper is an empirical competition submission that describes a neural network architecture (deep residual learning plus sub-pixel convolution) and reports a measured MS-SSIM value of 0.972 on the CLIC validation set at the fixed 0.15 bpp constraint. No derivation chain, equations, fitted parameters, or uniqueness theorems are present; the central claim is a direct factual report of an experimental outcome with no reduction to its own inputs by construction. Self-citation is absent and the result is externally verifiable on the challenge benchmark.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The claim rests on a trained neural network whose weights are free parameters fitted to image data, plus standard assumptions of supervised deep learning; no new entities are postulated.

free parameters (1)

neural network weights
All parameters of the residual encoder-decoder network are learned from data during training.

axioms (1)

domain assumption Standard supervised training via back-propagation and gradient descent converges to a useful compression model
Implicit in any deep-learning compression paper; required for the reported performance to be attainable.

pith-pipeline@v0.9.0 · 5611 in / 1221 out tokens · 34992 ms · 2026-05-25T17:27:37.670925+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Our approach mainly consists of two proposals, i.e. deep residual learning for image compression and sub-pixel convolution as up-sampling operations... achieve 0.972 in MS-SSIM at the rate constraint of 0.15bpp
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

J = λ d(x, ˆx) + R(ˆy)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

27 extracted references · 27 canonical work pages · 8 internal anchors

[1]

The JPEG still picture compression stan- dard

G. K Wallace, “The JPEG still picture compression stan- dard”, IEEE Trans. on Consumer Electronics, vol. 38, no. 1, pp. 43-59, Feb. 1991

work page 1991
[2]

An overview of the JPEG2000 still image compression standard

Majid Rabbani, Rajan Joshi, “An overview of the JPEG2000 still image compression standard” , ELSEVIER Signal Pro- cessing: Image Communication, vol. 17, no, 1, pp. 3-48, Jan. 2002

work page 2002
[3]

Overview of the High Efﬁciency Video Coding (HEVC) Standard

G. J. Sullivan, J. Ohm, W. Han and T. Wiegand, “Overview of the High Efﬁciency Video Coding (HEVC) Standard” , IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 12, pp. 1649-1668, Dec. 2012

work page 2012
[4]

Real Time Adaptive Image Com- pression

Ripple Oren, L. Bourdev, “Real Time Adaptive Image Com- pression”, Proc. of Machine Learning Research, V ol. 70, pp. 2922-2930, 2017

work page 2017
[5]

Generative Compres- sion

S. Santurkar, D. Budden, N. Shavit, “Generative Compres- sion”, Picture Coding Symposium, June 24-27, 2018

work page 2018
[6]

Generative Adversarial Networks for Extreme Learned Image Compression

E. Agustsson, M. Tschannen, F. Mentzer, R. Timofte, and L. V . Gool, “Generative Adversarial Networks for Extreme Learned Image Compression”, arXiv:1804.02958

work page arXiv
[7]

Variable Rate Image Compression with Recurrent Neural Networks

G. Toderici, S. M.O’Malley, S. J. Hwang, et al.,“Variable rate image compression with recurrent neural networks” , arXiv: 1511.06085, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[8]

Full Resolution Image Compression with Recurrent Neural Networks

G, Toderici, D. Vincent, N. Johnson, et al., “Full Resolution Image Compression with Recurrent Neural Networks”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 1-9, July 21-26, 2017

work page 2017
[9]

Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

Nick Johnson, Damien Vincent, David Minnen, et al., “Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks” , arXiv:1703.10114, pp. 1-9, March 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[10]

Lossy Image Compression with Compressive Au- toencoders

Lucas Theis, Wenzhe Shi, Andrew Cunninghan and Ferenc Huszar, “Lossy Image Compression with Compressive Au- toencoders”, Intl. Conf. on Learning Representations (ICLR), pp. 1-19, April 24-26, 2017

work page 2017
[11]

End-to-End Optimized Image Compression

J. Balle, Valero Laparra, Eero P. Simoncelli, “End-to-End Optimized Image Compression”, Intl. Conf. on Learning Rep- resentations (ICLR), pp. 1-27, April 24-26, 2017

work page 2017
[12]

Variational Image Compression with a Hyperprior

J. Balle, D. Minnen, S. Singh, S. J. Hwang, N. Johnston, “Variational Image Compression with a Hyperprior” , Intl. Conf. on Learning Representations (ICLR), pp. 1-23, 2018

work page 2018
[13]

Efﬁcient Nonlinear Transforms for Lossy Image Compression

J. Balle, “Efﬁcient Nonlinear Transforms for Lossy Image Compression”, Picture Coding Symposium, 2018

work page 2018
[14]

Joint Autoregressive and Hierarchical Priors for Learned Image Compression

D. Minnen, J. Balle, G. Toderici, “Joint Autoregressive and Hierarchical Priors for Learned Image Compression” , arXiv.1809.02736

work page internal anchor Pith review Pith/arXiv arXiv
[15]

Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations

E. Agustsson, F. Mentzer, M. Tschannen, L. Cavigelli, R. Timofte, L. Benini, L. V . Gool, “Soft-to-Hard Vector Quan- tization for End-to-End Learning Compressible Representa- tions”, Neural Information Processing Systems (NIPS) 2017, arXiv:1704.00648v2

work page internal anchor Pith review Pith/arXiv arXiv 2017
[16]

Conditional Probability Models for Deep Image Compression

F. Mentzer, E. Agustsson, M. Tschannen, R. Timofte, L. V . Gool, “Conditional Probability Models for Deep Image Compression”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 17-22, 2018

work page 2018
[17]

Deep Convolu- tional AutoEncoder-based Lossy Image Compression

Z. Cheng, H. Sun, M. Takeuchi, J. Katto, “Deep Convolu- tional AutoEncoder-based Lossy Image Compression” , Pic- ture Coding Symposium, pp. 1-5, June 24-27, 2018

work page 2018
[18]

Performance Comparison of Convolutional AutoEncoders, Generative Ad- versarial Networks and Super-Resolution for Image Compres- sion

Z. Cheng, H. Sun, M. Takeuchi, J. Katto, “Performance Comparison of Convolutional AutoEncoders, Generative Ad- versarial Networks and Super-Resolution for Image Compres- sion”, CVPR Workshop and Challenge on Learned Image Compression (CLIC), pp. 1-4, June 17-22, 2018

work page 2018
[19]

Learning Con- volutional Networks for Content-weighted Image Compres- sion

M. Li, W. Zuo, S. Gu, D. Zhao, D. Zhang, “Learning Con- volutional Networks for Content-weighted Image Compres- sion”, IEEE Conf. on Computer Vision and Pattern Recog- nition (CVPR), June 17-22, 2018

work page 2018
[20]

Deep Residual Learning for Image Recognition

K. He, X. Zhang, S. Ren and J. Sun, “Deep Residual Learn- ing for Image Recognition”, arXiv.1512.03385, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[21]

Image Super-Resolution Using Deep Convolutional Networks

C. Dong, C. C. Loy, K. He, X. tang,“Image Super-resolution using Deep Convolutional Networks”, arXiv.1501.00092

work page internal anchor Pith review Pith/arXiv arXiv
[22]

Accelerating the Super-Resolution Convolutional Neural Network

C. Dong, C. C. Loy, X. tang, “Accelerating the Super-Resolution Convolutional Neural Network” . arXiv.1608.00367

work page internal anchor Pith review Pith/arXiv arXiv
[23]

Real-time single image and video super-resolution using an efﬁcient sub-pixel convo- lutional neural network

W. Shi, J. Caballero, F. Huszar, et al.“Real-time single image and video super-resolution using an efﬁcient sub-pixel convo- lutional neural network”, Intl. IEEE Conf. on Computer Vi- sion and Pattern Recognition, June 26-July 1, 2016

work page 2016
[24]

ImageNet: A Large-Scale Hierarchical Image Database

J. Deng, W. Dong, R. Socher, L. Li, K. Li and L. Fei-Fei, “ImageNet: A Large-Scale Hierarchical Image Database” , IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1-8, June 20-25, 2009

work page 2009
[25]

Adam: A Method for Stochastic Optimization

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization”, arXiv:1412.6980, pp.1-15, Dec. 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[26]

Kodak Lossless True Color Image Suite, Download from http://r0k.us/graphics/kodak/

work page
[27]

Workshop and Challenge on Learned Image Compression, CVPR2019, http://www.compression.cc/

work page

[1] [1]

The JPEG still picture compression stan- dard

G. K Wallace, “The JPEG still picture compression stan- dard”, IEEE Trans. on Consumer Electronics, vol. 38, no. 1, pp. 43-59, Feb. 1991

work page 1991

[2] [2]

An overview of the JPEG2000 still image compression standard

Majid Rabbani, Rajan Joshi, “An overview of the JPEG2000 still image compression standard” , ELSEVIER Signal Pro- cessing: Image Communication, vol. 17, no, 1, pp. 3-48, Jan. 2002

work page 2002

[3] [3]

Overview of the High Efﬁciency Video Coding (HEVC) Standard

G. J. Sullivan, J. Ohm, W. Han and T. Wiegand, “Overview of the High Efﬁciency Video Coding (HEVC) Standard” , IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 12, pp. 1649-1668, Dec. 2012

work page 2012

[4] [4]

Real Time Adaptive Image Com- pression

Ripple Oren, L. Bourdev, “Real Time Adaptive Image Com- pression”, Proc. of Machine Learning Research, V ol. 70, pp. 2922-2930, 2017

work page 2017

[5] [5]

Generative Compres- sion

S. Santurkar, D. Budden, N. Shavit, “Generative Compres- sion”, Picture Coding Symposium, June 24-27, 2018

work page 2018

[6] [6]

Generative Adversarial Networks for Extreme Learned Image Compression

E. Agustsson, M. Tschannen, F. Mentzer, R. Timofte, and L. V . Gool, “Generative Adversarial Networks for Extreme Learned Image Compression”, arXiv:1804.02958

work page arXiv

[7] [7]

Variable Rate Image Compression with Recurrent Neural Networks

G. Toderici, S. M.O’Malley, S. J. Hwang, et al.,“Variable rate image compression with recurrent neural networks” , arXiv: 1511.06085, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[8] [8]

Full Resolution Image Compression with Recurrent Neural Networks

G, Toderici, D. Vincent, N. Johnson, et al., “Full Resolution Image Compression with Recurrent Neural Networks”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 1-9, July 21-26, 2017

work page 2017

[9] [9]

Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

Nick Johnson, Damien Vincent, David Minnen, et al., “Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks” , arXiv:1703.10114, pp. 1-9, March 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[10] [10]

Lossy Image Compression with Compressive Au- toencoders

Lucas Theis, Wenzhe Shi, Andrew Cunninghan and Ferenc Huszar, “Lossy Image Compression with Compressive Au- toencoders”, Intl. Conf. on Learning Representations (ICLR), pp. 1-19, April 24-26, 2017

work page 2017

[11] [11]

End-to-End Optimized Image Compression

J. Balle, Valero Laparra, Eero P. Simoncelli, “End-to-End Optimized Image Compression”, Intl. Conf. on Learning Rep- resentations (ICLR), pp. 1-27, April 24-26, 2017

work page 2017

[12] [12]

Variational Image Compression with a Hyperprior

J. Balle, D. Minnen, S. Singh, S. J. Hwang, N. Johnston, “Variational Image Compression with a Hyperprior” , Intl. Conf. on Learning Representations (ICLR), pp. 1-23, 2018

work page 2018

[13] [13]

Efﬁcient Nonlinear Transforms for Lossy Image Compression

J. Balle, “Efﬁcient Nonlinear Transforms for Lossy Image Compression”, Picture Coding Symposium, 2018

work page 2018

[14] [14]

Joint Autoregressive and Hierarchical Priors for Learned Image Compression

D. Minnen, J. Balle, G. Toderici, “Joint Autoregressive and Hierarchical Priors for Learned Image Compression” , arXiv.1809.02736

work page internal anchor Pith review Pith/arXiv arXiv

[15] [15]

Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations

E. Agustsson, F. Mentzer, M. Tschannen, L. Cavigelli, R. Timofte, L. Benini, L. V . Gool, “Soft-to-Hard Vector Quan- tization for End-to-End Learning Compressible Representa- tions”, Neural Information Processing Systems (NIPS) 2017, arXiv:1704.00648v2

work page internal anchor Pith review Pith/arXiv arXiv 2017

[16] [16]

Conditional Probability Models for Deep Image Compression

F. Mentzer, E. Agustsson, M. Tschannen, R. Timofte, L. V . Gool, “Conditional Probability Models for Deep Image Compression”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 17-22, 2018

work page 2018

[17] [17]

Deep Convolu- tional AutoEncoder-based Lossy Image Compression

Z. Cheng, H. Sun, M. Takeuchi, J. Katto, “Deep Convolu- tional AutoEncoder-based Lossy Image Compression” , Pic- ture Coding Symposium, pp. 1-5, June 24-27, 2018

work page 2018

[18] [18]

Performance Comparison of Convolutional AutoEncoders, Generative Ad- versarial Networks and Super-Resolution for Image Compres- sion

Z. Cheng, H. Sun, M. Takeuchi, J. Katto, “Performance Comparison of Convolutional AutoEncoders, Generative Ad- versarial Networks and Super-Resolution for Image Compres- sion”, CVPR Workshop and Challenge on Learned Image Compression (CLIC), pp. 1-4, June 17-22, 2018

work page 2018

[19] [19]

Learning Con- volutional Networks for Content-weighted Image Compres- sion

M. Li, W. Zuo, S. Gu, D. Zhao, D. Zhang, “Learning Con- volutional Networks for Content-weighted Image Compres- sion”, IEEE Conf. on Computer Vision and Pattern Recog- nition (CVPR), June 17-22, 2018

work page 2018

[20] [20]

Deep Residual Learning for Image Recognition

K. He, X. Zhang, S. Ren and J. Sun, “Deep Residual Learn- ing for Image Recognition”, arXiv.1512.03385, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[21] [21]

Image Super-Resolution Using Deep Convolutional Networks

C. Dong, C. C. Loy, K. He, X. tang,“Image Super-resolution using Deep Convolutional Networks”, arXiv.1501.00092

work page internal anchor Pith review Pith/arXiv arXiv

[22] [22]

Accelerating the Super-Resolution Convolutional Neural Network

C. Dong, C. C. Loy, X. tang, “Accelerating the Super-Resolution Convolutional Neural Network” . arXiv.1608.00367

work page internal anchor Pith review Pith/arXiv arXiv

[23] [23]

Real-time single image and video super-resolution using an efﬁcient sub-pixel convo- lutional neural network

W. Shi, J. Caballero, F. Huszar, et al.“Real-time single image and video super-resolution using an efﬁcient sub-pixel convo- lutional neural network”, Intl. IEEE Conf. on Computer Vi- sion and Pattern Recognition, June 26-July 1, 2016

work page 2016

[24] [24]

ImageNet: A Large-Scale Hierarchical Image Database

J. Deng, W. Dong, R. Socher, L. Li, K. Li and L. Fei-Fei, “ImageNet: A Large-Scale Hierarchical Image Database” , IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1-8, June 20-25, 2009

work page 2009

[25] [25]

Adam: A Method for Stochastic Optimization

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization”, arXiv:1412.6980, pp.1-15, Dec. 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[26] [26]

Kodak Lossless True Color Image Suite, Download from http://r0k.us/graphics/kodak/

work page

[27] [27]

Workshop and Challenge on Learned Image Compression, CVPR2019, http://www.compression.cc/

work page