Learning with Value-Ramp

Jan Van den Bussche; Tom J. Ameloot

arxiv: 1608.03647 · v2 · pith:SH7OJNZ6new · submitted 2016-08-12 · 💻 cs.LG

Learning with Value-Ramp

Tom J. Ameloot , Jan Van den Bussche This is my paper

classification 💻 cs.LG

keywords agentlearningnaturalvalue-rampalgorithmconfigureeasyfollow

0 comments

read the original abstract

We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers.

This paper has not been read by Pith yet.

Learning with Value-Ramp

discussion (0)