Reinitializing weights vs units for maintaining plasticity in neural networks

J Fernando Hernandez-Garcia, Shibhansh Dohare, Jun Luo, Rich S Sutton · 2025 · arXiv 2508.00212

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Learning to Forget: Continual Learning with Adaptive Weight Decay

cs.LG · 2026-04-29 · unverdicted · novelty 6.0

FADE adapts per-parameter weight decay rates online via approximate meta-gradient descent to improve controlled forgetting over fixed decay in online tracking and streaming classification.

Attribution-Based Neuron Utility for Plasticity Restoration in Deep Networks

cs.LG · 2026-05-07 · unverdicted · novelty 5.0

GXD estimates the first-order functional cost of replacing a neuron via gradient attribution to make adaptive resets more reliable for preserving plasticity in continual learning.

citing papers explorer

Showing 2 of 2 citing papers.

Learning to Forget: Continual Learning with Adaptive Weight Decay cs.LG · 2026-04-29 · unverdicted · none · ref 15
FADE adapts per-parameter weight decay rates online via approximate meta-gradient descent to improve controlled forgetting over fixed decay in online tracking and streaming classification.
Attribution-Based Neuron Utility for Plasticity Restoration in Deep Networks cs.LG · 2026-05-07 · unverdicted · none · ref 8
GXD estimates the first-order functional cost of replacing a neuron via gradient attribution to make adaptive resets more reliable for preserving plasticity in continual learning.

Reinitializing weights vs units for maintaining plasticity in neural networks

fields

years

verdicts

representative citing papers

citing papers explorer