pith. sign in

arxiv: 0903.4930 · v1 · submitted 2009-03-28 · 💻 cs.AI · cs.LG· cs.RO

Time manipulation technique for speeding up reinforcement learning in simulations

classification 💻 cs.AI cs.LGcs.RO
keywords learningtimealgorithmsmanipulationreinforcementsimulationspeedingtechnique
0
0 comments X
read the original abstract

A technique for speeding up reinforcement learning algorithms by using time manipulation is proposed. It is applicable to failure-avoidance control problems running in a computer simulation. Turning the time of the simulation backwards on failure events is shown to speed up the learning by 260% and improve the state space exploration by 12% on the cart-pole balancing task, compared to the conventional Q-learning and Actor-Critic algorithms.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.