Quantized Guided Pruning for Efficient Hardware Implementations of Convolutional Neural Networks

arxiv: 1812.11337 · v1 · pith:234ZUZQEnew · submitted 2018-12-29 · 💻 cs.LG · cs.CV· cs.NE

Quantized Guided Pruning for Efficient Hardware Implementations of Convolutional Neural Networks

Ghouthi Boukli Hacene (ELEC) , Vincent Gripon , Matthieu Arzel (ELEC) , Nicolas Farrugia (ELEC) , Yoshua Bengio (DIRO) This is my paper

classification 💻 cs.LG cs.CVcs.NE

keywords convolutionalhardwarearchitecturecnnscomplexitycomputationaldevicesefficient

0 comments p. Extension

pith:234ZUZQE Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{234ZUZQE}

Prints a linked pith:234ZUZQE badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Convolutional Neural Networks (CNNs) are state-of-the-art in numerous computer vision tasks such as object classification and detection. However, the large amount of parameters they contain leads to a high computational complexity and strongly limits their usability in budget-constrained devices such as embedded devices. In this paper, we propose a combination of a new pruning technique and a quantization scheme that effectively reduce the complexity and memory usage of convolutional layers of CNNs, and replace the complex convolutional operation by a low-cost multiplexer. We perform experiments on the CIFAR10, CIFAR100 and SVHN and show that the proposed method achieves almost state-of-the-art accuracy, while drastically reducing the computational and memory footprints. We also propose an efficient hardware architecture to accelerate CNN operations. The proposed hardware architecture is a pipeline and accommodates multiple layers working at the same time to speed up the inference process.

This paper has not been read by Pith yet.

Quantized Guided Pruning for Efficient Hardware Implementations of Convolutional Neural Networks

discussion (0)