Unifying and Merging Well-trained Deep Neural Networks for Inference Stage

Chih-Yi Chiu; Chu-Song Chen; Jia-Hong Lee; Yi-Min Chou; Yi-Ming Chan

arxiv: 1805.04980 · v1 · pith:RXZMPOOYnew · submitted 2018-05-14 · 💻 cs.CV

Unifying and Merging Well-trained Deep Neural Networks for Inference Stage

Yi-Min Chou , Yi-Ming Chan , Jia-Hong Lee , Chih-Yi Chiu , Chu-Song Chen This is my paper

classification 💻 cs.CV

keywords methodnetworksmodelweightswell-trainedarchitecturesdifferentinference

0 comments

read the original abstract

We propose a novel method to merge convolutional neural-nets for the inference stage. Given two well-trained networks that may have different architectures that handle different tasks, our method aligns the layers of the original networks and merges them into a unified model by sharing the representative codes of weights. The shared weights are further re-trained to fine-tune the performance of the merged model. The proposed method effectively produces a compact model that may run original tasks simultaneously on resource-limited devices. As it preserves the general architectures and leverages the co-used weights of well-trained networks, a substantial training overhead can be reduced to shorten the system development time. Experimental results demonstrate a satisfactory performance and validate the effectiveness of the method.

This paper has not been read by Pith yet.

Unifying and Merging Well-trained Deep Neural Networks for Inference Stage

discussion (0)