A Survey on Methods and Theories of Quantized Neural Networks

arxiv: 1808.04752 · v2 · pith:S7X3J4PLnew · submitted 2018-08-13 · 💻 cs.LG · cs.NE· stat.ML

A Survey on Methods and Theories of Quantized Neural Networks

Yunhui Guo This is my paper

classification 💻 cs.LG cs.NEstat.ML

keywords neuralnetworksquantizeddeepdevicesmemorymethodsmodels

0 comments p. Extension

pith:S7X3J4PL Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{S7X3J4PL}

Prints a linked pith:S7X3J4PL badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Deep neural networks are the state-of-the-art methods for many real-world tasks, such as computer vision, natural language processing and speech recognition. For all its popularity, deep neural networks are also criticized for consuming a lot of memory and draining battery life of devices during training and inference. This makes it hard to deploy these models on mobile or embedded devices which have tight resource constraints. Quantization is recognized as one of the most effective approaches to satisfy the extreme memory requirements that deep neural network models demand. Instead of adopting 32-bit floating point format to represent weights, quantized representations store weights using more compact formats such as integers or even binary numbers. Despite a possible degradation in predictive performance, quantization provides a potential solution to greatly reduce the model size and the energy consumption. In this survey, we give a thorough review of different aspects of quantized neural networks. Current challenges and trends of quantized neural networks are also discussed.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Vanishing Contributions: A Unified Framework for Smooth and Iterative Model Compression
cs.LG 2025-10 unverdicted novelty 5.0

VCON is a unified framework for smooth iterative DNN compression that uses parallel execution and an affine combination to progressively replace the original model with its compressed form during fine-tuning.