Learning with Value-Ramp
classification
💻 cs.LG
keywords
agentlearningnaturalvalue-rampalgorithmconfigureeasyfollow
read the original abstract
We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.