Soft-to-hard vector quantization for end-to-end learned compression of images and neural networks

Eirikur Agustsson, Fabian Mentzer, Michael Tschannen, Lukas Cavigelli, Radu Timofte, Luca Benini, Luc Van Gool · 2017 · cs.LG · arXiv 1704.00648

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

abstract

We present a new approach to learn compressible representations in deep architectures with an end-to-end training strategy. Our method is based on a soft (continuous) relaxation of quantization and entropy, which we anneal to their discrete counterparts throughout training. We showcase this method for two challenging applications: Image compression and neural network compression. While these tasks have typically been approached with different methods, our soft-to-hard quantization approach gives results competitive with the state-of-the-art for both.

citation-role summary

background 1

citation-polarity summary

unclear 1

representative citing papers

An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion

cs.CV · 2022-08-02 · unverdicted · novelty 8.0

Textual Inversion learns a single embedding vector from a few images to represent personal concepts inside the text embedding space of a frozen text-to-image model, enabling their composition in natural language prompts.

Learning Image and Video Compression through Spatial-Temporal Energy Compaction

eess.IV · 2019-06-24 · unverdicted · novelty 5.0

Learned image and video compression via autoencoders with spatial-temporal energy compaction penalties outperforms standards on MS-SSIM and visual quality.

Deep Residual Learning for Image Compression

eess.IV · 2019-06-24 · unverdicted · novelty 3.0

A learned image compression system using deep residual learning and sub-pixel convolution reaches 0.972 MS-SSIM at 0.15 bits per pixel in the CLIC validation phase.

citing papers explorer

Showing 3 of 3 citing papers.

An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion cs.CV · 2022-08-02 · unverdicted · none · ref 2
Textual Inversion learns a single embedding vector from a few images to represent personal concepts inside the text embedding space of a frozen text-to-image model, enabling their composition in natural language prompts.
Learning Image and Video Compression through Spatial-Temporal Energy Compaction eess.IV · 2019-06-24 · unverdicted · none · ref 14 · internal anchor
Learned image and video compression via autoencoders with spatial-temporal energy compaction penalties outperforms standards on MS-SSIM and visual quality.
Deep Residual Learning for Image Compression eess.IV · 2019-06-24 · unverdicted · none · ref 15 · internal anchor
A learned image compression system using deep residual learning and sub-pixel convolution reaches 0.972 MS-SSIM at 0.15 bits per pixel in the CLIC validation phase.

Soft-to-hard vector quantization for end-to-end learned compression of images and neural networks

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer