Like What You Like: Knowledge Distill via Neuron Selectivity Transfer

Naiyan Wang; Zehao Huang

arxiv: 1707.01219 · v2 · pith:LCDYQU6Dnew · submitted 2017-07-05 · 💻 cs.CV · cs.LG· cs.NE

Like What You Like: Knowledge Distill via Neuron Selectivity Transfer

Zehao Huang , Naiyan Wang This is my paper

classification 💻 cs.CV cs.LGcs.NE

keywords knowledgenetworksmethodstudenttransferdistributionsfunctionlike

0 comments

read the original abstract

Despite deep neural networks have demonstrated extraordinary power in various applications, their superior performances are at expense of high storage and computational costs. Consequently, the acceleration and compression of neural networks have attracted much attention recently. Knowledge Transfer (KT), which aims at training a smaller student network by transferring knowledge from a larger teacher model, is one of the popular solutions. In this paper, we propose a novel knowledge transfer method by treating it as a distribution matching problem. Particularly, we match the distributions of neuron selectivity patterns between teacher and student networks. To achieve this goal, we devise a new KT loss function by minimizing the Maximum Mean Discrepancy (MMD) metric between these distributions. Combined with the original loss function, our method can significantly improve the performance of student networks. We validate the effectiveness of our method across several datasets, and further combine it with other KT methods to explore the best possible results. Last but not least, we fine-tune the model to other tasks such as object detection. The results are also encouraging, which confirm the transferability of the learned features.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Canine EEG Helps Human: Cross-Species and Cross-Modality Epileptic Seizure Detection via Multi-Space Alignment
eess.SP 2024-12 unverdicted novelty 6.0

A multi-space alignment framework using domain adaptation and knowledge distillation improves cross-species and cross-modality epileptic seizure detection from EEG, achieving over 90% AUC with limited target data.
Multi-Modality Distillation via Learning the teacher's modality-level Gram Matrix
cs.AI 2021-12 unverdicted novelty 4.0

Proposes a modality relation distillation method that transfers teacher modality relationships via the modality-level Gram Matrix.