pith. sign in

arxiv: 1608.03647 · v2 · pith:SH7OJNZ6new · submitted 2016-08-12 · 💻 cs.LG

Learning with Value-Ramp

classification 💻 cs.LG
keywords agentlearningnaturalvalue-rampalgorithmconfigureeasyfollow
0
0 comments X
read the original abstract

We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.