pith. sign in

arxiv: 1103.5818 · v1 · pith:C35LZ4J2new · submitted 2011-03-30 · 🧮 math.PR

Reinforcement learning in signaling game

classification 🧮 math.PR
keywords actscorrespondencegamegraphlimitsignalingsignalsskyrms
0
0 comments X
read the original abstract

We consider a signaling game originally introduced by Skyrms, which models how two interacting players learn to signal each other and thus create a common language. The first rigorous analysis was done by Argiento, Pemantle, Skyrms and Volkov (2009) with 2 states, 2 signals and 2 acts. We study the case of M_1 states, M_2 signals and M_1 acts for general M_1, M_2. We prove that the expected payoff increases in average and thus converges a.s., and that a limit bipartite graph emerges, such that no signal-state correspondence is associated to both a synonym and an informational bottleneck. Finally, we show that any graph correspondence with the above property is a limit configuration with positive probability.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.