Rotate your Networks: Better Weight Consolidation and Less Catastrophic Forgetting

  • 2018-02-08 16:25:29
  • Xialei Liu, Marc Masana, Luis Herranz, Joost Van de Weijer, Antonio M. Lopez, Andrew D. Bagdanov
  • 41


In this paper we propose an approach to avoiding catastrophic forgetting insequential task learning scenarios. Our technique is based on a networkreparameterization that approximately diagonalizes the Fisher InformationMatrix of the network parameters. This reparameterization takes the form of afactorized rotation of parameter space which, when used in conjunction withElastic Weight Consolidation (which assumes a diagonal Fisher InformationMatrix), leads to significantly better performance on lifelong learning ofsequential tasks. Experimental results on the MNIST, CIFAR-100, CUB-200 andStanford-40 datasets demonstrate that we significantly improve the results ofstandard elastic weight consolidation, and that we obtain competitive resultswhen compared to other state-of-the-art in lifelong learning withoutforgetting.


Introduction (beta)



Conclusion (beta)