pith. sign in

arxiv: 1802.08635 · v2 · pith:J4IPI4KVnew · submitted 2018-02-23 · 💻 cs.LG

Loss-aware Weight Quantization of Deep Networks

classification 💻 cs.LG
keywords quantizationweightnetworksaccuratedeeploss-awarenetworkproposed
0
0 comments X
read the original abstract

The huge size of deep networks hinders their use in small computing devices. In this paper, we consider compressing the network by weight quantization. We extend a recently proposed loss-aware weight binarization scheme to ternarization, with possibly different scaling parameters for the positive and negative weights, and m-bit (where m > 2) quantization. Experiments on feedforward and recurrent neural networks show that the proposed scheme outperforms state-of-the-art weight quantization algorithms, and is as accurate (or even more accurate) than the full-precision network.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.