pith. sign in

arxiv: 1704.04912 · v1 · pith:UZ6YZRDSnew · submitted 2017-04-17 · 💻 cs.AI

Pseudorehearsal in actor-critic agents

classification 💻 cs.AI
keywords pseudorehearsalactor-criticlearningagentagentsapproachesapproximationassists
0
0 comments X
read the original abstract

Catastrophic forgetting has a serious impact in reinforcement learning, as the data distribution is generally sparse and non-stationary over time. The purpose of this study is to investigate whether pseudorehearsal can increase performance of an actor-critic agent with neural-network based policy selection and function approximation in a pole balancing task and compare different pseudorehearsal approaches. We expect that pseudorehearsal assists learning even in such very simple problems, given proper initialization of the rehearsal parameters.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.