Bayesian Tensorized Neural Networks with Automatic Rank Selection

Cole Hawkins; Zheng Zhang

arxiv: 1905.10478 · v1 · pith:ISAW6Q7Bnew · submitted 2019-05-24 · 💻 cs.LG · stat.ML

Bayesian Tensorized Neural Networks with Automatic Rank Selection

Cole Hawkins , Zheng Zhang This is my paper

classification 💻 cs.LG stat.ML

keywords neuralnetworktensorbayesiannetworksranktensorizedtraining

0 comments

read the original abstract

Tensor decomposition is an effective approach to compress over-parameterized neural networks and to enable their deployment on resource-constrained hardware platforms. However, directly applying tensor compression in the training process is a challenging task due to the difficulty of choosing a proper tensor rank. In order to achieve this goal, this paper proposes a Bayesian tensorized neural network. Our Bayesian method performs automatic model compression via an adaptive tensor rank determination. We also present approaches for posterior density calculation and maximum a posteriori (MAP) estimation for the end-to-end training of our tensorized neural network. We provide experimental validation on a fully connected neural network, a CNN and a residual neural network where our work produces $7.4\times$ to $137\times$ more compact neural networks directly from the training.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Tucker Tensor Decomposition on FPGA
eess.SP 2019-06 unverdicted novelty 5.0

FPGA accelerator for Tucker decomposition reports 2.16-30.2x speedup versus CPU/GPU toolboxes on cardiac MRI data via fixed-point design and warm-start SVD.