pith. sign in

arxiv: 1802.07239 · v2 · pith:VENYYUQCnew · submitted 2018-02-20 · 💻 cs.AI · cs.LG· cs.NE

Continual Reinforcement Learning with Complex Synapses

classification 💻 cs.AI cs.LGcs.NE
keywords learningcontinualforgettingcatastrophiccomplexknownnetworkneural
0
0 comments X
read the original abstract

Unlike humans, who are capable of continual learning over their lifetimes, artificial neural networks have long been known to suffer from a phenomenon known as catastrophic forgetting, whereby new learning can lead to abrupt erasure of previously acquired knowledge. Whereas in a neural network the parameters are typically modelled as scalar values, an individual synapse in the brain comprises a complex network of interacting biochemical components that evolve at different timescales. In this paper, we show that by equipping tabular and deep reinforcement learning agents with a synaptic model that incorporates this biological complexity (Benna & Fusi, 2016), catastrophic forgetting can be mitigated at multiple timescales. In particular, we find that as well as enabling continual learning across sequential training of two simple tasks, it can also be used to overcome within-task forgetting by reducing the need for an experience replay database.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Continual Reinforcement Learning with Diversity Exploration and Adversarial Self-Correction

    cs.LG 2019-06 unverdicted novelty 6.0

    CDAN framework uses diversity exploration and adversarial self-correction for continual RL in continuous control, evaluated on new CAM environment with NSD metric showing 18.35% NSD improvement over baseline.