Incremental Learning Through Deep Adaptation

Amir Rosenfeld; John K. Tsotsos

arxiv: 1705.04228 · v2 · pith:H7AR7XOTnew · submitted 2017-05-11 · 💻 cs.CV · cs.LG

Incremental Learning Through Deep Adaptation

Amir Rosenfeld , John K. Tsotsos This is my paper

classification 💻 cs.CV cs.LG

keywords networklearnedexistingoriginalperformanceadaptationarchitecturedeep

0 comments

read the original abstract

Given an existing trained neural network, it is often desirable to learn new capabilities without hindering performance of those already learned. Existing approaches either learn sub-optimal solutions, require joint training, or incur a substantial increment in the number of parameters for each added domain, typically as many as the original network. We propose a method called \emph{Deep Adaptation Networks} (DAN) that constrains newly learned filters to be linear combinations of existing ones. DANs precisely preserve performance on the original domain, require a fraction (typically 13\%, dependent on network architecture) of the number of parameters compared to standard fine-tuning procedures and converge in less cycles of training to a comparable or better level of performance. When coupled with standard network quantization techniques, we further reduce the parameter cost to around 3\% of the original with negligible or no loss in accuracy. The learned architecture can be controlled to switch between various learned representations, enabling a single network to solve a task from multiple different domains. We conduct extensive experiments showing the effectiveness of our method on a range of image classification tasks and explore different aspects of its behavior.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Efficient Multi-Domain Network Learning by Covariance Normalization
cs.CV 2019-06 unverdicted novelty 5.0

CovNorm reduces parameters in domain-adaptive layers via two PCAs and a mini-adaptation layer, enabling efficient multi-domain learning with performance close to full fine-tuning.