Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations
read the original abstract
We present a new approach to learn compressible representations in deep architectures with an end-to-end training strategy. Our method is based on a soft (continuous) relaxation of quantization and entropy, which we anneal to their discrete counterparts throughout training. We showcase this method for two challenging applications: Image compression and neural network compression. While these tasks have typically been approached with different methods, our soft-to-hard quantization approach gives results competitive with the state-of-the-art for both.
This paper has not been read by Pith yet.
Forward citations
Cited by 3 Pith papers
-
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Textual Inversion learns a single embedding vector from a few images to represent personal concepts inside the text embedding space of a frozen text-to-image model, enabling their composition in natural language prompts.
-
Learning Image and Video Compression through Spatial-Temporal Energy Compaction
Learned image and video compression via autoencoders with spatial-temporal energy compaction penalties outperforms standards on MS-SSIM and visual quality.
-
Deep Residual Learning for Image Compression
A learned image compression system using deep residual learning and sub-pixel convolution reaches 0.972 MS-SSIM at 0.15 bits per pixel in the CLIC validation phase.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.