A Definition of Happiness for Reinforcement Learning Agents

Jan Leike; Mayank Daswani

arxiv: 1505.04497 · v1 · pith:6WFXEEBEnew · submitted 2015-05-18 · 💻 cs.AI

A Definition of Happiness for Reinforcement Learning Agents

Mayank Daswani , Jan Leike This is my paper

classification 💻 cs.AI

keywords definitionhappinessagentsdesideratadifferencelearningreinforcementvalue

0 comments

read the original abstract

What is happiness for reinforcement learning agents? We seek a formal definition satisfying a list of desiderata. Our proposed definition of happiness is the temporal difference error, i.e. the difference between the value of the obtained reward and observation and the agent's expectation of this value. This definition satisfies most of our desiderata and is compatible with empirical research on humans. We state several implications and discuss examples.

This paper has not been read by Pith yet.

A Definition of Happiness for Reinforcement Learning Agents

discussion (0)