PathNet: Evolution Channels Gradient Descent in Super Neural Networks

Alexander Pritzel; Andrei A. Rusu; Charles Blundell; Chrisantha Fernando; Daan Wierstra; David Ha; Dylan Banarse; Yori Zwols

PathNet: Evolution Channels Gradient Descent in Super Neural Networks

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1701.08734 v1 pith:DS7IVUPT submitted 2017-01-30 cs.NE cs.LG

PathNet: Evolution Channels Gradient Descent in Super Neural Networks

Chrisantha Fernando , Dylan Banarse , Charles Blundell , Yori Zwols , David Ha , Andrei A. Rusu , Alexander Pritzel , Daan Wierstra This is my paper

classification cs.NE cs.LG

keywords networkneuraltasklearningalgorithmlearnedpathnettasks

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

For artificial general intelligence (AGI) it would be efficient if multiple users trained the same giant neural network, permitting parameter reuse, without catastrophic forgetting. PathNet is a first step in this direction. It is a neural network algorithm that uses agents embedded in the neural network whose task is to discover which parts of the network to re-use for new tasks. Agents are pathways (views) through the network which determine the subset of parameters that are used and updated by the forwards and backwards passes of the backpropogation algorithm. During learning, a tournament selection genetic algorithm is used to select pathways through the neural network for replication and mutation. Pathway fitness is the performance of that pathway measured according to a cost function. We demonstrate successful transfer learning; fixing the parameters along a path learned on task A and re-evolving a new population of paths for task B, allows task B to be learned faster than it could be learned from scratch or after fine-tuning. Paths evolved on task B re-use parts of the optimal path evolved on task A. Positive transfer was demonstrated for binary MNIST, CIFAR, and SVHN supervised learning classification tasks, and a set of Atari and Labyrinth reinforcement learning tasks, suggesting PathNets have general applicability for neural network training. Finally, PathNet also significantly improves the robustness to hyperparameter choices of a parallel asynchronous reinforcement learning algorithm (A3C).

discussion (0)

Forward citations

Cited by 17 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MoRe: Modular Representations for Principled Continual Representation Learning on Sequential Data
cs.LG 2026-05 unverdicted novelty 6.0

MoRe identifies modular structure in representations themselves to enable principled reuse, alignment, and expansion of modules during continual adaptation on sequential data.
MILE: Mixture of Incremental LoRA Experts for Continual Semantic Segmentation across Domains and Modalities
cs.CV 2026-05 unverdicted novelty 6.0

MILE combines incremental LoRA experts with prototype-guided gating to support continual semantic segmentation across domains and modalities while adding only a small number of parameters per task.
Learning Without Losing Identity: Capability Evolution for Embodied Agents
cs.RO 2026-04 unverdicted novelty 6.0

Embodied agents maintain persistent identity while evolving modular capabilities through a closed-loop process, raising simulated task success from 32.4% to 91.3% with zero policy drift.
Learning Without Losing Identity: Capability Evolution for Embodied Agents
cs.RO 2026-04 unverdicted novelty 6.0

Embodied agents maintain a persistent identity while evolving capabilities via modular ECMs, raising simulated task success from 32.4% to 91.3% over 20 iterations with zero policy drift or safety violations.
Evidence of an Emergent "Self" in Continual Robot Learning
cs.RO 2026-03 unverdicted novelty 6.0

Continual learning robots form a significantly more stable invariant subnetwork than constant-task controls, and preserving it improves adaptation while damaging it hurts performance.
TACO: Temporal Consensus Optimization for Continual Neural Mapping
cs.RO 2026-02 unverdicted novelty 6.0

TACO reformulates neural implicit mapping as temporal consensus optimization to enable continual adaptation to scene changes without data replay or storage.
A Survey of Continual Reinforcement Learning
cs.LG 2025-06 accept novelty 6.0

The paper surveys CRL literature, proposes a taxonomy of methods into four categories based on knowledge storage and transfer, reviews metrics and benchmarks, and outlines challenges and future research directions.
Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction
cs.LG 2019-06 unverdicted novelty 6.0

CDAN framework uses diversity exploration and adversarial self-correction for continual RL in continuous control, evaluated on new CAM environment with NSD metric showing 18.35% NSD improvement over baseline.
ECA: Efficient Continual Alignment for Open-Ended Image-to-Text Generation
cs.CV 2026-06 unverdicted novelty 5.0

ECA introduces continual alignment with MoQ, FeDEx, and DR for exemplar-free incremental learning in open-ended image-to-text generation, evaluated on four new benchmarks showing reduced forgetting.
On the Stability of Growth in Structural Plasticity
cs.LG 2026-05 unverdicted novelty 5.0

Newborn units in growing neural networks are forward-active but backward-starved, receiving weaker gradients than existing units and creating integration challenges that make growth less reliable than pruning in compl...
On the Stability of Growth in Structural Plasticity
cs.LG 2026-05 unverdicted novelty 5.0

Growth during training inserts new units into a specialized trajectory, making them forward-active but backward-starved with weaker gradients than existing units.
MoRe: Modular Representations for Principled Continual Representation Learning on Sequential Data
cs.LG 2026-05 unverdicted novelty 5.0

MoRe identifies modular representations in sequential data for continual learning with identifiability guarantees, enabling principled adaptation without disrupting old modules.
MoRe: Modular Representations for Principled Continual Representation Learning on Sequential Data
cs.LG 2026-05 unverdicted novelty 5.0

MoRe decomposes representations into identifiable hierarchical modules to enable principled continual adaptation on sequential data.
Incremental learning for audio classification with Hebbian Deep Neural Networks
eess.AS 2026-04 unverdicted novelty 5.0

A kernel plasticity approach in Hebbian DNNs for incremental sound classification achieves 76.3% accuracy over five steps on ESC-50, outperforming the 68.7% baseline without plasticity.
ARROW: Augmented Replay for RObust World models
cs.LG 2026-03 conditional novelty 5.0

ARROW, a DreamerV3-based continual RL method with dual distribution-matching replay buffers, reduces forgetting on sequential Atari while matching forward transfer on Procgen CoinRun variants.
ARROW: Augmented Replay for RObust World models
cs.LG 2026-03 unverdicted novelty 5.0

ARROW adds a distribution-matching long-term replay buffer to DreamerV3 and shows reduced forgetting versus same-size baselines on Atari and Procgen continual RL benchmarks.
Adaptive Reorganization of Neural Pathways for Continual Learning with Spiking Neural Networks
cs.NE 2023-09 unverdicted novelty 4.0

SOR-SNN employs Self-Organizing Regulation networks to reorganize a single SNN into sparse pathways, achieving better performance, energy efficiency, memory use, backward transfer, and self-repair on continual learnin...