pith. sign in

arxiv: 1705.07460 · v1 · pith:SSY2KU53new · submitted 2017-05-21 · 💻 cs.AI

Experience enrichment based task independent reward model

classification 💻 cs.AI
keywords rewardlearningagentsdefinedimplicitindependentmanuallymodel
0
0 comments X
read the original abstract

For most reinforcement learning approaches, the learning is performed by maximizing an accumulative reward that is expectedly and manually defined for specific tasks. However, in real world, rewards are emergent phenomena from the complex interactions between agents and environments. In this paper, we propose an implicit generic reward model for reinforcement learning. Unlike those rewards that are manually defined for specific tasks, such implicit reward is task independent. It only comes from the deviation from the agents' previous experiences.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.