pith. sign in

arxiv: 1610.07089 · v1 · pith:232ZBDHMnew · submitted 2016-10-22 · 💻 cs.AI · cs.HC· cs.RO

Reinforcement Learning in Conflicting Environments for Autonomous Vehicles

classification 💻 cs.AI cs.HCcs.RO
keywords learningreinforcementautonomousdilemmasknownwelladequateagents
0
0 comments X
read the original abstract

In this work, we investigate the application of Reinforcement Learning to two well known decision dilemmas, namely Newcomb's Problem and Prisoner's Dilemma. These problems are exemplary for dilemmas that autonomous agents are faced with when interacting with humans. Furthermore, we argue that a Newcomb-like formulation is more adequate in the human-machine interaction case and demonstrate empirically that the unmodified Reinforcement Learning algorithms end up with the well known maximum expected utility solution.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.