Variational Continual Learning

Cuong V Nguyen, Yingzhen Li, Thang D Bui, Richard E Turner · 2017 · stat.ML · arXiv 1710.10628

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

abstract

This paper develops variational continual learning (VCL), a simple but general framework for continual learning that fuses online variational inference (VI) and recent advances in Monte Carlo VI for neural networks. The framework can successfully train both deep discriminative models and deep generative models in complex continual learning settings where existing tasks evolve over time and entirely new tasks emerge. Experimental results show that VCL outperforms state-of-the-art continual learning methods on a variety of tasks, avoiding catastrophic forgetting in a fully automatic way.

representative citing papers

Preconditioned DeltaNet: Curvature-aware Sequence Modeling for Linear Recurrences

cs.LG · 2026-04-22 · unverdicted · novelty 7.0

Preconditioned delta-rule models with a diagonal curvature approximation improve upon standard DeltaNet, GDN, and KDA by better approximating the test-time regression objective.

Online Bayesian Calibration under Gradual and Abrupt System Changes

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

BRPC is an online Bayesian calibration framework that decouples parameter tracking from discrepancy modeling for gradual nonstationarity and adds restart mechanisms to handle abrupt regime shifts.

Pushing the Limits of Distillation-Based Continual Learning via Classifier-Proximal Lightweight Plugins

cs.LG · 2025-12-03 · unverdicted · novelty 5.0

DLC inserts lightweight classifier-proximal plugins into distillation-based continual learning to achieve 8% accuracy gains on large benchmarks with only 4% extra backbone parameters.

citing papers explorer

Showing 3 of 3 citing papers.

Preconditioned DeltaNet: Curvature-aware Sequence Modeling for Linear Recurrences cs.LG · 2026-04-22 · unverdicted · none · ref 36
Preconditioned delta-rule models with a diagonal curvature approximation improve upon standard DeltaNet, GDN, and KDA by better approximating the test-time regression objective.
Online Bayesian Calibration under Gradual and Abrupt System Changes cs.LG · 2026-05-07 · unverdicted · none · ref 23
BRPC is an online Bayesian calibration framework that decouples parameter tracking from discrepancy modeling for gradual nonstationarity and adds restart mechanisms to handle abrupt regime shifts.
Pushing the Limits of Distillation-Based Continual Learning via Classifier-Proximal Lightweight Plugins cs.LG · 2025-12-03 · unverdicted · none · ref 21 · internal anchor
DLC inserts lightweight classifier-proximal plugins into distillation-based continual learning to achieve 8% accuracy gains on large benchmarks with only 4% extra backbone parameters.

Variational Continual Learning

fields

years

verdicts

representative citing papers

citing papers explorer