Time manipulation technique for speeding up reinforcement learning in simulations

Fangyan Dong; Kaoru Hirota; Kohei Nomoto; Petar Kormushev

arxiv: 0903.4930 · v1 · submitted 2009-03-28 · 💻 cs.AI · cs.LG· cs.RO

Time manipulation technique for speeding up reinforcement learning in simulations

Petar Kormushev , Kohei Nomoto , Fangyan Dong , Kaoru Hirota This is my paper

classification 💻 cs.AI cs.LGcs.RO

keywords learningtimealgorithmsmanipulationreinforcementsimulationspeedingtechnique

0 comments

read the original abstract

A technique for speeding up reinforcement learning algorithms by using time manipulation is proposed. It is applicable to failure-avoidance control problems running in a computer simulation. Turning the time of the simulation backwards on failure events is shown to speed up the learning by 260% and improve the state space exploration by 12% on the cart-pole balancing task, compared to the conventional Q-learning and Actor-Critic algorithms.

This paper has not been read by Pith yet.

Time manipulation technique for speeding up reinforcement learning in simulations

discussion (0)