Robby is Not a Robber (anymore): On the Use of Institutions for Learning Normative Behavior

Alessandro Saffiotti; Federico Pecora; Stevan Tomic

arxiv: 1908.02138 · v1 · pith:U4WNOX2Inew · submitted 2019-08-01 · 💻 cs.LG · cs.AI· cs.CY· cs.MA· stat.ML

Robby is Not a Robber (anymore): On the Use of Institutions for Learning Normative Behavior

Stevan Tomic , Federico Pecora , Alessandro Saffiotti This is my paper

classification 💻 cs.LG cs.AIcs.CYcs.MAstat.ML

keywords normslearningsocialhumanknowledgenormativeachievebehavior

0 comments

read the original abstract

Future robots should follow human social norms in order to be useful and accepted in human society. In this paper, we leverage already existing social knowledge in human societies by capturing it in our framework through the notion of social norms. We show how norms can be used to guide a reinforcement learning agent towards achieving normative behavior and apply the same set of norms over different domains. Thus, we are able to: (1) provide a way to intuitively encode social knowledge (through norms); (2) guide learning towards normative behaviors (through an automatic norm reward system); and (3) achieve a transfer of learning by abstracting policies; Finally, (4) the method is not dependent on a particular RL algorithm. We show how our approach can be seen as a means to achieve abstract representation and learn procedural knowledge based on the declarative semantics of norms and discuss possible implications of this in some areas of cognitive science.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline
cs.AI 2026-03 unverdicted novelty 5.0

The thesis presents Pino, an end-to-end pipeline that supervises reinforcement learning agents with argumentation-based normative advisors, introduces an algorithm for automatic argument extraction, and defines a miti...