pith. sign in

arxiv: 1807.10029 · v1 · pith:HWTV5KO7new · submitted 2018-07-26 · 💻 cs.CV · cs.AI

LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks

classification 💻 cs.CV cs.AI
keywords quantizationnetworkquantizersaccuracydeeplq-netsmethodmodel
0
0 comments X
read the original abstract

Although weight and activation quantization is an effective approach for Deep Neural Network (DNN) compression and has a lot of potentials to increase inference speed leveraging bit-operations, there is still a noticeable gap in terms of prediction accuracy between the quantized model and the full-precision model. To address this gap, we propose to jointly train a quantized, bit-operation-compatible DNN and its associated quantizers, as opposed to using fixed, handcrafted quantization schemes such as uniform or logarithmic quantization. Our method for learning the quantizers applies to both network weights and activations with arbitrary-bit precision, and our quantizers are easy to train. The comprehensive experiments on CIFAR-10 and ImageNet datasets show that our method works consistently well for various network structures such as AlexNet, VGG-Net, GoogLeNet, ResNet, and DenseNet, surpassing previous quantization methods in terms of accuracy by an appreciable margin. Code available at https://github.com/Microsoft/LQ-Nets

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Evolutionary fine tuning of quantized convolution-based deep learning models

    cs.LG 2026-04 unverdicted novelty 5.0

    Evolutionary fine-tuning of select weights in pre-quantized convolutional networks improves accuracy over standard rounding for VGG, ResNet, and autoencoder models.