A continual learning survey: Defying forgetting in classification tasks,

· 2021 · arXiv 2021.305744

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

read on arXiv browse 12 citing papers

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

Reasoning Portability: Guiding Continual Learning for MLLMs in the RLVR Era

cs.LG · 2026-05-17 · unverdicted · novelty 7.0

Formalizes Reasoning Portability (RP) and proposes RDB-CL to modulate per-sample KL regularization in RLVR for MLLM continual learning, achieving +12.0% Last accuracy over vanilla RLVR baseline by preserving reusable reasoning on high-RP samples.

Streaming Adversarial Robustness in Fuzzy ARTMAP: Mechanism-Aligned Evaluation, Progressive Training, and Interpretable Diagnostics

cs.LG · 2026-05-07 · conditional · novelty 7.0

Fuzzy ARTMAP models are highly vulnerable to a new white-box attack aligned with their category competition, but progressive selective training yields stronger replay-free robustness than offline adversarial training under adaptive evaluation.

CapTrack: Multifaceted Evaluation of Forgetting in LLM Post-Training

cs.LG · 2026-02-19 · unverdicted · novelty 7.0

CapTrack shows post-training causes drift beyond facts, with instruction fine-tuning producing stronger behavioral changes than preference optimization across model families.

DeCoFlow: Structural Decomposition of Normalizing Flows for Continual Anomaly Detection

cs.CV · 2026-06-25 · unverdicted · novelty 6.0

DeCoFlow decomposes normalizing flow subnets into frozen bases and low-rank adapters with alignment, auxiliary layers, and tail-aware loss to achieve continual anomaly detection with zero forgetting and few added parameters.

GoTTA be Diverse: Rethinking Memory Policies for Test-Time Adaptation

cs.CV · 2026-05-19 · unverdicted · novelty 6.0

Diversity-aware memory policies improve test-time adaptation performance most under constrained memory budgets and challenging non-i.i.d. streams.

Fine-Tuning Regimes Define Distinct Continual Learning Problems

cs.LG · 2026-04-23 · unverdicted · novelty 6.0

The relative rankings of continual learning methods are not preserved across different fine-tuning regimes defined by trainable parameter depth.

Self-Evolving Cognitive Framework via Causal World Modeling for Embodied Scientific Intelligence

cs.AI · 2026-06-21 · unverdicted · novelty 5.0

Proposes a self-evolving cognitive framework integrating causal world modeling, intervention-driven reasoning, and continual refinement for embodied scientific intelligence.

Anytime Training with Schedule-Free Spectral Optimization

cs.LG · 2026-05-21 · unverdicted · novelty 5.0

SF-NorMuon is a new schedule-free spectral optimizer that closes the gap with tuned AdamW on 125M-772M parameter models across 1-8x Chinchilla horizons while providing stationarity guarantees.

Attention to task structure for cognitive flexibility

cs.NE · 2026-04-14 · unverdicted · novelty 5.0

Task connectivity in graph-structured multi-task environments enhances generalization and stability, with stronger benefits for attention models than MLPs.

Out-of-Distribution Generalization in Time Series: A Survey

cs.LG · 2025-03-18 · unverdicted · novelty 5.0

This is the first comprehensive survey of OOD generalization methodologies for time series, organized across data distribution, representation learning, and OOD evaluation.

Learning Entropy and Spatial Adaptation Dynamics of Multilayer Perceptrons for Structural Point Extraction

cs.LG · 2026-06-08 · unverdicted · novelty 4.0

Spatial Learning Entropy Maps derived from MLP weight adaptations during spatial pixel prediction tasks highlight image points with high learning impact.

The Dynamic Gist-Based Memory Model (DGMM): A Memory-Centric Architecture for Artificial Intelligence

cs.AI · 2026-05-04 · unverdicted · novelty 3.0

DGMM is proposed as an explicit graph-structured memory architecture for AI that enables persistent episodic memory, cue-based recall, and context-dependent interpretation without retraining.

citing papers explorer

Showing 11 of 11 citing papers after filters.

Reasoning Portability: Guiding Continual Learning for MLLMs in the RLVR Era cs.LG · 2026-05-17 · unverdicted · none · ref 8
Formalizes Reasoning Portability (RP) and proposes RDB-CL to modulate per-sample KL regularization in RLVR for MLLM continual learning, achieving +12.0% Last accuracy over vanilla RLVR baseline by preserving reusable reasoning on high-RP samples.
CapTrack: Multifaceted Evaluation of Forgetting in LLM Post-Training cs.LG · 2026-02-19 · unverdicted · none · ref 9
CapTrack shows post-training causes drift beyond facts, with instruction fine-tuning producing stronger behavioral changes than preference optimization across model families.
DeCoFlow: Structural Decomposition of Normalizing Flows for Continual Anomaly Detection cs.CV · 2026-06-25 · unverdicted · none · ref 4
DeCoFlow decomposes normalizing flow subnets into frozen bases and low-rank adapters with alignment, auxiliary layers, and tail-aware loss to achieve continual anomaly detection with zero forgetting and few added parameters.
GoTTA be Diverse: Rethinking Memory Policies for Test-Time Adaptation cs.CV · 2026-05-19 · unverdicted · none · ref 18
Diversity-aware memory policies improve test-time adaptation performance most under constrained memory budgets and challenging non-i.i.d. streams.
Fine-Tuning Regimes Define Distinct Continual Learning Problems cs.LG · 2026-04-23 · unverdicted · none · ref 4
The relative rankings of continual learning methods are not preserved across different fine-tuning regimes defined by trainable parameter depth.
Self-Evolving Cognitive Framework via Causal World Modeling for Embodied Scientific Intelligence cs.AI · 2026-06-21 · unverdicted · none · ref 16
Proposes a self-evolving cognitive framework integrating causal world modeling, intervention-driven reasoning, and continual refinement for embodied scientific intelligence.
Anytime Training with Schedule-Free Spectral Optimization cs.LG · 2026-05-21 · unverdicted · none · ref 15
SF-NorMuon is a new schedule-free spectral optimizer that closes the gap with tuned AdamW on 125M-772M parameter models across 1-8x Chinchilla horizons while providing stationarity guarantees.
Attention to task structure for cognitive flexibility cs.NE · 2026-04-14 · unverdicted · none · ref 4
Task connectivity in graph-structured multi-task environments enhances generalization and stability, with stronger benefits for attention models than MLPs.
Out-of-Distribution Generalization in Time Series: A Survey cs.LG · 2025-03-18 · unverdicted · none · ref 17
This is the first comprehensive survey of OOD generalization methodologies for time series, organized across data distribution, representation learning, and OOD evaluation.
Learning Entropy and Spatial Adaptation Dynamics of Multilayer Perceptrons for Structural Point Extraction cs.LG · 2026-06-08 · unverdicted · none · ref 13
Spatial Learning Entropy Maps derived from MLP weight adaptations during spatial pixel prediction tasks highlight image points with high learning impact.
The Dynamic Gist-Based Memory Model (DGMM): A Memory-Centric Architecture for Artificial Intelligence cs.AI · 2026-05-04 · unverdicted · none · ref 16
DGMM is proposed as an explicit graph-structured memory architecture for AI that enables persistent episodic memory, cue-based recall, and context-dependent interpretation without retraining.

A continual learning survey: Defying forgetting in classification tasks,

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer