Pseudorehearsal in value function approximation

Leonard Johard; Manuel Mazzara; Vladimir Marochko

arxiv: 1703.07075 · v1 · pith:SESVE3OZnew · submitted 2017-03-21 · 💻 cs.AI

Pseudorehearsal in value function approximation

Vladimir Marochko , Leonard Johard , Manuel Mazzara This is my paper

classification 💻 cs.AI

keywords pseudorehearsalapproximationfunctionlearningapproachesassistbalancingcatastrophic

0 comments

read the original abstract

Catastrophic forgetting is of special importance in reinforcement learning, as the data distribution is generally non-stationary over time. We study and compare several pseudorehearsal approaches for Q-learning with function approximation in a pole balancing task. We have found that pseudorehearsal seems to assist learning even in such very simple problems, given proper initialization of the rehearsal parameters.

This paper has not been read by Pith yet.

Pseudorehearsal in value function approximation

discussion (0)