Learning What and Where to Transfer

Hankook Lee; Jinwoo Shin; Sung Ju Hwang; Yunhun Jang

arxiv: 1905.05901 · v1 · pith:HDFMS6H2new · submitted 2019-05-15 · 💻 cs.LG · stat.ML

Learning What and Where to Transfer

Yunhun Jang , Hankook Lee , Sung Ju Hwang , Jinwoo Shin This is my paper

classification 💻 cs.LG stat.ML

keywords transferlearningknowledgenetworksourcetargetwhatapproach

0 comments

read the original abstract

As the application of deep learning has expanded to real-world problems with insufficient volume of training data, transfer learning recently has gained much attention as means of improving the performance in such small-data regime. However, when existing methods are applied between heterogeneous architectures and tasks, it becomes more important to manage their detailed configurations and often requires exhaustive tuning on them for the desired performance. To address the issue, we propose a novel transfer learning approach based on meta-learning that can automatically learn what knowledge to transfer from the source network to where in the target network. Given source and target networks, we propose an efficient training scheme to learn meta-networks that decide (a) which pairs of layers between the source and target networks should be matched for knowledge transfer and (b) which features and how much knowledge from each feature should be transferred. We validate our meta-transfer approach against recent transfer learning methods on various datasets and network architectures, on which our automated scheme significantly outperforms the prior baselines that find "what and where to transfer" in a hand-crafted manner.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Temporal Transfer Learning for Traffic Optimization with Coarse-grained Advisory Autonomy
cs.RO 2023-11 unverdicted novelty 7.0

Temporal Transfer Learning selects source tasks for zero-shot transfer of RL policies to solve a range of coarse-grained advisory autonomy hold durations in traffic optimization more reliably than baselines.