Learning Accurate Low-Bit Deep Neural Networks with Stochastic Quantization

· 2017 · cs.CV · arXiv 1708.01001

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Low-bit deep neural networks (DNNs) become critical for embedded applications due to their low storage requirement and computing efficiency. However, they suffer much from the non-negligible accuracy drop. This paper proposes the stochastic quantization (SQ) algorithm for learning accurate low-bit DNNs. The motivation is due to the following observation. Existing training algorithms approximate the real-valued elements/filters with low-bit representation all together in each iteration. The quantization errors may be small for some elements/filters, while are remarkable for others, which lead to inappropriate gradient direction during training, and thus bring notable accuracy drop. Instead, SQ quantizes a portion of elements/filters to low-bit with a stochastic probability inversely proportional to the quantization error, while keeping the other portion unchanged with full-precision. The quantized and full-precision portions are updated with corresponding gradients separately in each iteration. The SQ ratio is gradually increased until the whole network is quantized. This procedure can greatly compensate the quantization error and thus yield better accuracy for low-bit DNNs. Experiments show that SQ can consistently and significantly improve the accuracy for different low-bit DNNs on various datasets and various network structures.

representative citing papers

Layerwise Progressive Freezing: A Training Scaffold for Depth-Scalable Binary Networks

cs.LG · 2026-06-26 · unverdicted · novelty 7.0

StoMPP progressively binarizes BNN layers layerwise from input to output via stochastic masks, delivering depth-scalable accuracy gains in a fully STE-free regime by controlling activation-induced gradient blockades.

citing papers explorer

Showing 1 of 1 citing paper.

Layerwise Progressive Freezing: A Training Scaffold for Depth-Scalable Binary Networks cs.LG · 2026-06-26 · unverdicted · none · ref 32 · internal anchor
StoMPP progressively binarizes BNN layers layerwise from input to output via stochastic masks, delivering depth-scalable accuracy gains in a fully STE-free regime by controlling activation-induced gradient blockades.

Learning Accurate Low-Bit Deep Neural Networks with Stochastic Quantization

fields

years

verdicts

representative citing papers

citing papers explorer