pith. sign in

hub Canonical reference

Progressive Neural Networks

Canonical reference. 77% of citing Pith papers cite this work as background.

86 Pith papers citing it
Background 77% of classified citations
abstract

Learning to solve complex sequences of tasks--while both leveraging transfer and avoiding catastrophic forgetting--remains a key obstacle to achieving human-level intelligence. The progressive networks approach represents a step forward in this direction: they are immune to forgetting and can leverage prior knowledge via lateral connections to previously learned features. We evaluate this architecture extensively on a wide variety of reinforcement learning tasks (Atari and 3D maze games), and show that it outperforms common baselines based on pretraining and finetuning. Using a novel sensitivity measure, we demonstrate that transfer occurs at both low-level sensory and high-level control layers of the learned policy.

hub tools

citation-role summary

background 12 baseline 1

citation-polarity summary

claims ledger

  • abstract Learning to solve complex sequences of tasks--while both leveraging transfer and avoiding catastrophic forgetting--remains a key obstacle to achieving human-level intelligence. The progressive networks approach represents a step forward in this direction: they are immune to forgetting and can leverage prior knowledge via lateral connections to previously learned features. We evaluate this architecture extensively on a wide variety of reinforcement learning tasks (Atari and 3D maze games), and show that it outperforms common baselines based on pretraining and finetuning. Using a novel sensitivi

co-cited works

clear filters

representative citing papers

ReConText3D: Replay-based Continual Text-to-3D Generation

cs.CV · 2026-04-15 · conditional · novelty 8.0

ReConText3D is the first replay-memory framework for continual text-to-3D generation that prevents catastrophic forgetting on new textual categories while preserving quality on previously seen classes.

Continuous-time Optimal Stopping through Deep Reinforcement Learning

cs.LG · 2026-06-16 · unverdicted · novelty 7.0

CARLOS employs an aggregate deep neural network trained on progressively finer time grids with adaptive sampling to learn continuous-time exercise boundaries for optimal stopping, delivering higher values than discrete Bermudan methods.

Continual Learning of Domain-Invariant Representations

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

Introduces replay-based continual learning with sequential invariance alignment to learn domain-invariant representations, outperforming baselines on generalization to unseen domains across six datasets in vision, medicine, manufacturing, and ecology.

A Generalist Agent

cs.AI · 2022-05-12 · accept · novelty 7.0

Gato is a multi-modal, multi-task, multi-embodiment generalist policy using one transformer network to handle text, vision, games, and robotics tasks.

NetTailor: Tuning the Architecture, Not Just the Weights

cs.CV · 2019-06-29 · unverdicted · novelty 7.0

NetTailor adapts CNN architecture for new tasks by assembling pre-trained universal blocks with task-specific layers, trained via activation mimicry and complexity penalties to match accuracy while reducing size for simpler tasks.

The Long-Term Effects of Data Selection in LLM Fine-Tuning

cs.LG · 2026-05-28 · unverdicted · novelty 6.0

Short-term data selectors in multi-stage LLM fine-tuning can slow future learning and increase forgetting, formalized as myopic selection with a proposed LHAS objective to address it.

citing papers explorer

Showing 9 of 9 citing papers after filters.

  • Dota 2 with Large Scale Deep Reinforcement Learning cs.LG · 2019-12-13 · accept · none · ref 55 · internal anchor

    OpenAI Five achieved superhuman performance in Dota 2 by defeating the world champions using scaled self-play reinforcement learning.

  • NetTailor: Tuning the Architecture, Not Just the Weights cs.CV · 2019-06-29 · unverdicted · none · ref 55 · internal anchor

    NetTailor adapts CNN architecture for new tasks by assembling pre-trained universal blocks with task-specific layers, trained via activation mimicry and complexity penalties to match accuracy while reducing size for simpler tasks.

  • Attentive Multi-Task Deep Reinforcement Learning cs.LG · 2019-07-05 · unverdicted · none · ref 26 · internal anchor

    Attention mechanism dynamically groups task knowledge at state granularity in multi-task DRL to enable positive transfer and avoid negative transfer, matching or exceeding prior methods with fewer parameters.

  • Lifelong Learning Starting From Zero cs.LG · 2019-06-24 · unverdicted · none · ref 27 · internal anchor

    A blank-slate neural network grows via expansion, generalization, forgetting, and backpropagation for lifelong learning with claimed gains in accuracy, efficiency, and versatility.

  • Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction cs.LG · 2019-06-21 · unverdicted · none · ref 20 · internal anchor

    CDAN framework uses diversity exploration and adversarial self-correction for continual RL in continuous control, evaluated on new CAM environment with NSD metric showing 18.35% NSD improvement over baseline.

  • Growing a Brain: Fine-Tuning by Increasing Model Capacity cs.CV · 2019-07-18 · unverdicted · none · ref 34 · internal anchor

    Growing CNN capacity by widening or deepening layers with normalized new units outperforms standard fine-tuning on vision benchmarks.

  • Efficient Multi-Domain Network Learning by Covariance Normalization cs.CV · 2019-06-24 · unverdicted · none · ref 40 · internal anchor

    CovNorm reduces parameters in domain-adaptive layers via two PCAs and a mini-adaptation layer, enabling efficient multi-domain learning with performance close to full fine-tuning.

  • Beneficial perturbation network for continual learning cs.LG · 2019-06-22 · unverdicted · none · ref 6 · internal anchor

    BPN adds task-specific beneficial perturbations as biases to neural networks to overcome catastrophic forgetting without storing prior data or expanding the network substantially.

  • Incremental Concept Learning via Online Generative Memory Recall cs.LG · 2019-07-05 · unverdicted · none · ref 29 · internal anchor

    Pseudo-rehearsal method with cGAN-generated old-concept samples, balanced online recall, and concept contrastive loss for class-incremental learning on MNIST, Fashion-MNIST and SVHN.