pith. sign in

arxiv: 1802.02950 · v4 · pith:JPOOXK2Ynew · submitted 2018-02-08 · 💻 cs.CV

Rotate your Networks: Better Weight Consolidation and Less Catastrophic Forgetting

classification 💻 cs.CV
keywords consolidationforgettinglearningresultsweightbettercatastrophicelastic
0
0 comments X
read the original abstract

In this paper we propose an approach to avoiding catastrophic forgetting in sequential task learning scenarios. Our technique is based on a network reparameterization that approximately diagonalizes the Fisher Information Matrix of the network parameters. This reparameterization takes the form of a factorized rotation of parameter space which, when used in conjunction with Elastic Weight Consolidation (which assumes a diagonal Fisher Information Matrix), leads to significantly better performance on lifelong learning of sequential tasks. Experimental results on the MNIST, CIFAR-100, CUB-200 and Stanford-40 datasets demonstrate that we significantly improve the results of standard elastic weight consolidation, and that we obtain competitive results when compared to other state-of-the-art in lifelong learning without forgetting.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.