pith. sign in

arxiv: 1812.02375 · v1 · pith:KECLEECInew · submitted 2018-12-06 · 💻 cs.LG · cs.CV

DNQ: Dynamic Network Quantization

classification 💻 cs.LG cs.CV
keywords quantizationbit-widthnetworkcontrollerdynamicnetworksneuralquantizer
0
0 comments X
read the original abstract

Network quantization is an effective method for the deployment of neural networks on memory and energy constrained mobile devices. In this paper, we propose a Dynamic Network Quantization (DNQ) framework which is composed of two modules: a bit-width controller and a quantizer. Unlike most existing quantization methods that use a universal quantization bit-width for the whole network, we utilize policy gradient to train an agent to learn the bit-width of each layer by the bit-width controller. This controller can make a trade-off between accuracy and compression ratio. Given the quantization bit-width sequence, the quantizer adopts the quantization distance as the criterion of the weights importance during quantization. We extensively validate the proposed approach on various main-stream neural networks and obtain impressive results.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.