Sparse Winograd Convolutional neural networks on small-scale systolic arrays

Benjamin Kuschner; Feng Shi; Haochen Li; Song-Chun Zhu; Yuhe Gao

REVIEW

Sparse Winograd Convolutional neural networks on small-scale systolic arrays

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1810.01973 v1 pith:77FB2DPW submitted 2018-10-03 cs.DC cs.AIcs.LG

Feng Shi , Haochen Li , Yuhe Gao , Benjamin Kuschner , Song-Chun Zhu This is my paper

classification cs.DCcs.AIcs.LG

keywords winogradarraysconvolutiondesignhighmemorysmall-scalesparse

verification ladder T0 review T1 audit T2 compute T3 formal

0 comments

read the original abstract

The reconfigurability, energy-efficiency, and massive parallelism on FPGAs make them one of the best choices for implementing efficient deep learning accelerators. However, state-of-art implementations seldom consider the balance between high throughput of computation power and the ability of the memory subsystem to support it. In this paper, we implement an accelerator on FPGA by combining the sparse Winograd convolution, clusters of small-scale systolic arrays, and a tailored memory layout design. We also provide an analytical model analysis for the general Winograd convolution algorithm as a design reference. Experimental results on VGG16 show that it achieves very high computational resource utilization, 20x ~ 30x energy efficiency, and more than 5x speedup compared with the dense implementation.

Discussion (0). Sign in to comment.

Pith tools