Temporal-Difference Variational Continual Learning

Abstract

Machine Learning models in real-world applications must continuously learnnew tasks to adapt to shifts in the data-generating distribution. Yet, forContinual Learning (CL), models often struggle to balance learning new tasks(plasticity) with retaining previous knowledge (memory stability).Consequently, they are susceptible to Catastrophic Forgetting, which degradesperformance and undermines the reliability of deployed systems. In the BayesianCL literature, variational methods tackle this challenge by employing alearning objective that recursively updates the posterior distribution whileconstraining it to stay close to its previous estimate. Nonetheless, we arguethat these methods may be ineffective due to compounding approximation errorsover successive recursions. To mitigate this, we propose new learningobjectives that integrate the regularization effects of multiple previousposterior estimations, preventing individual errors from dominating futureposterior updates and compounding over time. We reveal insightful connectionsbetween these objectives and Temporal-Difference methods, a popular learningmechanism in Reinforcement Learning and Neuroscience. Experiments onchallenging CL benchmarks show that our approach effectively mitigatesCatastrophic Forgetting, outperforming strong Variational CL methods.

Quick Read (beta)

loading the full paper ...