hub

Lifelong learning with dynamically expandable networks

Jaehong Yoon, Eunho Yang, Jeongtae Lee, Sung Ju Hwang · 2017 · cs.LG · arXiv 1708.01547

18 Pith papers cite this work. Polarity classification is still indexing.

18 Pith papers citing it

open full Pith review browse 18 citing papers arXiv PDF

abstract

We propose a novel deep network architecture for lifelong learning which we refer to as Dynamically Expandable Network (DEN), that can dynamically decide its network capacity as it trains on a sequence of tasks, to learn a compact overlapping knowledge sharing structure among tasks. DEN is efficiently trained in an online manner by performing selective retraining, dynamically expands network capacity upon arrival of each task with only the necessary number of units, and effectively prevents semantic drift by splitting/duplicating units and timestamping them. We validate DEN on multiple public datasets under lifelong learning scenarios, on which it not only significantly outperforms existing lifelong learning methods for deep networks, but also achieves the same level of performance as the batch counterparts with substantially fewer number of parameters. Further, the obtained network fine-tuned on all tasks obtained significantly better performance over the batch models, which shows that it can be used to estimate the optimal network structure even when all tasks are available in the first place.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 3

citation-polarity summary

unclear 2 background 1

representative citing papers

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning

cs.AI · 2023-06-05 · conditional · novelty 8.0

LIBERO is a new benchmark for lifelong robot learning that evaluates transfer of declarative, procedural, and mixed knowledge across 130 manipulation tasks with provided demonstration data.

Continual Learning of Domain-Invariant Representations

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

Introduces replay-based continual learning with sequential invariance alignment to learn domain-invariant representations, outperforming baselines on generalization to unseen domains across six datasets in vision, medicine, manufacturing, and ecology.

Unlocking Patch-Level Features for CLIP-Based Class-Incremental Learning

cs.CV · 2026-05-13 · unverdicted · novelty 7.0

SPA unlocks patch-level features in CLIP for class-incremental learning via semantic-guided selection and optimal transport alignment with class descriptions, plus projectors and pseudo-feature replay to reduce forgetting.

NetTailor: Tuning the Architecture, Not Just the Weights

cs.CV · 2019-06-29 · unverdicted · novelty 7.0

NetTailor adapts CNN architecture for new tasks by assembling pre-trained universal blocks with task-specific layers, trained via activation mimicry and complexity penalties to match accuracy while reducing size for simpler tasks.

Beyond Single-Model Optimization: Preserving Plasticity in Continual Reinforcement Learning

cs.LG · 2026-04-16 · unverdicted · novelty 7.0

TeLAPA maintains archives of behaviorally diverse yet competent policies aligned in a shared latent space to preserve plasticity and enable faster recovery after interference in continual reinforcement learning.

Continual Few-shot Adaptation for Synthetic Fingerprint Detection

cs.CV · 2026-03-15 · unverdicted · novelty 6.0

A continual few-shot adaptation method combining binary cross-entropy and supervised contrastive losses with replay achieves a good trade-off between fast adaptation to unseen synthetic fingerprint styles and retention of known styles.

Scalable and Efficient Continual Learning from Demonstration via a Hypernetwork-generated Stable Dynamics Model

cs.RO · 2023-11-06 · unverdicted · novelty 6.0

A hypernetwork generates clock-augmented stable neural ODEs (sNODEs) for scalable continual learning from demonstration, achieving O(N) training time via stochastic regularization while outperforming baselines on LfD tasks up to 26 skills and 32 dimensions.

Lifelong Learning Starting From Zero

cs.LG · 2019-06-24 · unverdicted · novelty 6.0

A blank-slate neural network grows via expansion, generalization, forgetting, and backpropagation for lifelong learning with claimed gains in accuracy, efficiency, and versatility.

NORACL: Neurogenesis for Oracle-free Resource-Adaptive Continual Learning

cs.LG · 2026-04-29 · unverdicted · novelty 6.0

NORACL dynamically grows network capacity via neurogenesis-inspired signals to achieve oracle-level continual learning performance without pre-specifying architecture size.

Leveraging Complementary Embeddings for Replay Selection in Continual Learning with Small Buffers

cs.LG · 2026-04-09 · unverdicted · novelty 6.0

MERS improves replay buffer selection in continual learning by integrating supervised and self-supervised embeddings via a graph-based approach, outperforming single-embedding baselines on CIFAR-100 and TinyImageNet in low-memory regimes.

STAR-IOD: Scale-decoupled Topology Alignment with Pseudo-label Refinement for Remote Sensing Incremental Object Detection

cs.CV · 2026-05-20 · unverdicted · novelty 5.0

STAR-IOD applies scale-decoupled topology alignment and K-Means-based pseudo-label refinement to reduce catastrophic forgetting in remote sensing incremental object detection, reporting 1.7% and 2.1% mAP gains on new DIOR-IOD and DOTA-IOD datasets.

On the Stability of Growth in Structural Plasticity

cs.LG · 2026-05-14 · unverdicted · novelty 5.0

Newborn units in growing neural networks are forward-active but backward-starved, receiving weaker gradients than existing units and creating integration challenges that make growth less reliable than pruning in complex tasks.

BRAIN: Bias-Mitigation Continual Learning Approach to Vision-Brain Understanding

cs.CV · 2025-08-25 · unverdicted · novelty 5.0

BRAIN uses bias-mitigation continual learning with a new de-bias contrastive loss and angular forgetting mitigation to achieve SOTA performance on vision-brain understanding benchmarks despite brain signal inconsistencies across sessions.

Scalable Multi-Task Learning through Spiking Neural Networks with Adaptive Task-Switching Policy for Intelligent Autonomous Agents

cs.NE · 2025-04-18 · unverdicted · novelty 5.0

SwitchMT uses adaptive task-switching in deep spiking Q-networks with active dendrites to reduce task interference in multi-task RL, achieving competitive Atari scores without added network complexity.

HEDP: A Hybrid Energy-Distance Prompt-based Framework for Domain Incremental Learning

cs.AI · 2026-05-07 · unverdicted · novelty 5.0

HEDP uses energy regularization inspired by Helmholtz free energy plus hybrid energy-distance weighting in prompts to improve domain selection and achieve a 2.57% accuracy gain on benchmarks like CORe50 while mitigating catastrophic forgetting.

A Domain Incremental Continual Learning Benchmark for ICU Time Series Model Transportability

cs.LG · 2026-05-05 · unverdicted · novelty 5.0

Proposes a domain incremental continual learning benchmark for ICU time series model transportability across US regions and evaluates data replay and EWC methods.

Adaptive Reorganization of Neural Pathways for Continual Learning with Spiking Neural Networks

cs.NE · 2023-09-18 · unverdicted · novelty 4.0

SOR-SNN employs Self-Organizing Regulation networks to reorganize a single SNN into sparse pathways, achieving better performance, energy efficiency, memory use, backward transfer, and self-repair on continual learning tasks including CIFAR100 and ImageNet.

Incremental Concept Learning via Online Generative Memory Recall

cs.LG · 2019-07-05 · unverdicted · novelty 4.0

Pseudo-rehearsal method with cGAN-generated old-concept samples, balanced online recall, and concept contrastive loss for class-incremental learning on MNIST, Fashion-MNIST and SVHN.

citing papers explorer

Showing 18 of 18 citing papers.

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning cs.AI · 2023-06-05 · conditional · none · ref 72
LIBERO is a new benchmark for lifelong robot learning that evaluates transfer of declarative, procedural, and mixed knowledge across 130 manipulation tasks with provided demonstration data.
Continual Learning of Domain-Invariant Representations cs.LG · 2026-05-15 · unverdicted · none · ref 26 · internal anchor
Introduces replay-based continual learning with sequential invariance alignment to learn domain-invariant representations, outperforming baselines on generalization to unseen domains across six datasets in vision, medicine, manufacturing, and ecology.
Unlocking Patch-Level Features for CLIP-Based Class-Incremental Learning cs.CV · 2026-05-13 · unverdicted · none · ref 59 · internal anchor
SPA unlocks patch-level features in CLIP for class-incremental learning via semantic-guided selection and optimal transport alignment with class descriptions, plus projectors and pseudo-feature replay to reduce forgetting.
NetTailor: Tuning the Architecture, Not Just the Weights cs.CV · 2019-06-29 · unverdicted · none · ref 69 · internal anchor
NetTailor adapts CNN architecture for new tasks by assembling pre-trained universal blocks with task-specific layers, trained via activation mimicry and complexity penalties to match accuracy while reducing size for simpler tasks.
Beyond Single-Model Optimization: Preserving Plasticity in Continual Reinforcement Learning cs.LG · 2026-04-16 · unverdicted · none · ref 29
TeLAPA maintains archives of behaviorally diverse yet competent policies aligned in a shared latent space to preserve plasticity and enable faster recovery after interference in continual reinforcement learning.
Continual Few-shot Adaptation for Synthetic Fingerprint Detection cs.CV · 2026-03-15 · unverdicted · none · ref 19 · internal anchor
A continual few-shot adaptation method combining binary cross-entropy and supervised contrastive losses with replay achieves a good trade-off between fast adaptation to unseen synthetic fingerprint styles and retention of known styles.
Scalable and Efficient Continual Learning from Demonstration via a Hypernetwork-generated Stable Dynamics Model cs.RO · 2023-11-06 · unverdicted · none · ref 54 · internal anchor
A hypernetwork generates clock-augmented stable neural ODEs (sNODEs) for scalable continual learning from demonstration, achieving O(N) training time via stochastic regularization while outperforming baselines on LfD tasks up to 26 skills and 32 dimensions.
Lifelong Learning Starting From Zero cs.LG · 2019-06-24 · unverdicted · none · ref 18 · internal anchor
A blank-slate neural network grows via expansion, generalization, forgetting, and backpropagation for lifelong learning with claimed gains in accuracy, efficiency, and versatility.
NORACL: Neurogenesis for Oracle-free Resource-Adaptive Continual Learning cs.LG · 2026-04-29 · unverdicted · none · ref 24
NORACL dynamically grows network capacity via neurogenesis-inspired signals to achieve oracle-level continual learning performance without pre-specifying architecture size.
Leveraging Complementary Embeddings for Replay Selection in Continual Learning with Small Buffers cs.LG · 2026-04-09 · unverdicted · none · ref 5
MERS improves replay buffer selection in continual learning by integrating supervised and self-supervised embeddings via a graph-based approach, outperforming single-embedding baselines on CIFAR-100 and TinyImageNet in low-memory regimes.
STAR-IOD: Scale-decoupled Topology Alignment with Pseudo-label Refinement for Remote Sensing Incremental Object Detection cs.CV · 2026-05-20 · unverdicted · none · ref 177 · internal anchor
STAR-IOD applies scale-decoupled topology alignment and K-Means-based pseudo-label refinement to reduce catastrophic forgetting in remote sensing incremental object detection, reporting 1.7% and 2.1% mAP gains on new DIOR-IOD and DOTA-IOD datasets.
On the Stability of Growth in Structural Plasticity cs.LG · 2026-05-14 · unverdicted · none · ref 44 · internal anchor
Newborn units in growing neural networks are forward-active but backward-starved, receiving weaker gradients than existing units and creating integration challenges that make growth less reliable than pruning in complex tasks.
BRAIN: Bias-Mitigation Continual Learning Approach to Vision-Brain Understanding cs.CV · 2025-08-25 · unverdicted · none · ref 107 · internal anchor
BRAIN uses bias-mitigation continual learning with a new de-bias contrastive loss and angular forgetting mitigation to achieve SOTA performance on vision-brain understanding benchmarks despite brain signal inconsistencies across sessions.
Scalable Multi-Task Learning through Spiking Neural Networks with Adaptive Task-Switching Policy for Intelligent Autonomous Agents cs.NE · 2025-04-18 · unverdicted · none · ref 11 · internal anchor
SwitchMT uses adaptive task-switching in deep spiking Q-networks with active dendrites to reduce task interference in multi-task RL, achieving competitive Atari scores without added network complexity.
HEDP: A Hybrid Energy-Distance Prompt-based Framework for Domain Incremental Learning cs.AI · 2026-05-07 · unverdicted · none · ref 70
HEDP uses energy regularization inspired by Helmholtz free energy plus hybrid energy-distance weighting in prompts to improve domain selection and achieve a 2.57% accuracy gain on benchmarks like CORe50 while mitigating catastrophic forgetting.
A Domain Incremental Continual Learning Benchmark for ICU Time Series Model Transportability cs.LG · 2026-05-05 · unverdicted · none · ref 14
Proposes a domain incremental continual learning benchmark for ICU time series model transportability across US regions and evaluates data replay and EWC methods.
Adaptive Reorganization of Neural Pathways for Continual Learning with Spiking Neural Networks cs.NE · 2023-09-18 · unverdicted · none · ref 23 · internal anchor
SOR-SNN employs Self-Organizing Regulation networks to reorganize a single SNN into sparse pathways, achieving better performance, energy efficiency, memory use, backward transfer, and self-repair on continual learning tasks including CIFAR100 and ImageNet.
Incremental Concept Learning via Online Generative Memory Recall cs.LG · 2019-07-05 · unverdicted · none · ref 30 · internal anchor
Pseudo-rehearsal method with cGAN-generated old-concept samples, balanced online recall, and concept contrastive loss for class-incremental learning on MNIST, Fashion-MNIST and SVHN.

Lifelong learning with dynamically expandable networks

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer