Structured Transforms for Small-Footprint Deep Learning

Sanjiv Kumar; Tara N. Sainath; Vikas Sindhwani

arxiv: 1510.01722 · v1 · pith:2VG6KNN4new · submitted 2015-10-06 · 📊 stat.ML · cs.CV· cs.LG

Structured Transforms for Small-Footprint Deep Learning

Vikas Sindhwani , Tara N. Sainath , Sanjiv Kumar This is my paper

classification 📊 stat.ML cs.CVcs.LG

keywords structuredtransformsdeeplearningmobileparameteraccelerateaccuracy-compactness-speed

0 comments

read the original abstract

We consider the task of building compact deep learning pipelines suitable for deployment on storage and power constrained mobile devices. We propose a unified framework to learn a broad family of structured parameter matrices that are characterized by the notion of low displacement rank. Our structured transforms admit fast function and gradient evaluation, and span a rich range of parameter sharing configurations whose statistical modeling capacity can be explicitly tuned along a continuum from structured to unstructured. Experimental results show that these transforms can significantly accelerate inference and forward/backward passes during training, and offer superior accuracy-compactness-speed tradeoffs in comparison to a number of existing techniques. In keyword spotting applications in mobile speech recognition, our methods are much more effective than standard linear low-rank bottleneck layers and nearly retain the performance of state of the art models, while providing more than 3.5-fold compression.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

EPNAS: Efficient Progressive Neural Architecture Search
cs.LG 2019-07 unverdicted novelty 5.0

EPNAS uses a progressive search policy with REINFORCE performance prediction to search neural architectures in parallel, supporting multiple resource constraints and outperforming ENAS and PNAS on CIFAR-10 and ImageNe...