pith. machine review for the scientific record. sign in

arxiv: 1810.12488 · v4 · submitted 2018-10-30 · 💻 cs.LG · cs.AI· cs.CV

Recognition: unknown

Re-evaluating Continual Learning Scenarios: A Categorization and Case for Strong Baselines

Authors on Pith no claims yet
classification 💻 cs.LG cs.AIcs.CV
keywords scenariosbaselinescategorizationcontinuallearningmethodsseveralstrong
0
0 comments X
read the original abstract

Continual learning has received a great deal of attention recently with several approaches being proposed. However, evaluations involve a diverse set of scenarios making meaningful comparison difficult. This work provides a systematic categorization of the scenarios and evaluates them within a consistent framework including strong baselines and state-of-the-art methods. The results provide an understanding of the relative difficulty of the scenarios and that simple baselines (Adagrad, L2 regularization, and naive rehearsal strategies) can surprisingly achieve similar performance to current mainstream methods. We conclude with several suggestions for creating harder evaluation scenarios and future research directions. The code is available at https://github.com/GT-RIPL/Continual-Learning-Benchmark

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Stop Marginalizing My Dreams: Model Inversion via Laplace Kernel for Continual Learning

    cs.LG 2026-05 unverdicted novelty 7.0

    REMIX uses Laplace kernel parameterization to enable scalable full-covariance modeling in model inversion, improving synthetic sample quality and performance in data-free continual learning.

  2. Temporal Taskification in Streaming Continual Learning: A Source of Evaluation Instability

    cs.LG 2026-04 conditional novelty 6.0

    Different valid temporal partitions of the same streaming dataset can produce materially different rankings and performance numbers for continual learning methods.

  3. Fine-Tuning Regimes Define Distinct Continual Learning Problems

    cs.LG 2026-04 unverdicted novelty 6.0

    The relative rankings of continual learning methods are not preserved across different fine-tuning regimes defined by trainable parameter depth.

  4. Continual Distillation of Teachers from Different Domains

    cs.LG 2026-04 conditional novelty 6.0

    SE2D stabilizes continual distillation across heterogeneous teachers by preserving logits on external unlabeled data to mitigate unseen knowledge forgetting.

  5. Tracking Adaptation Time: Metrics for Temporal Distribution Shift

    cs.LG 2026-04 unverdicted novelty 6.0

    Three complementary metrics are introduced to distinguish model adaptation from intrinsic data difficulty under temporal distribution shift.

  6. Neural Network Optimization Reimagined: Decoupled Techniques for Scratch and Fine-Tuning

    cs.CV 2026-04 unverdicted novelty 3.0

    DualOpt decouples optimization by using real-time layer-wise weight decay for scratch training and weight rollback for fine-tuning to improve convergence, generalization, and reduce knowledge forgetting.