Maximum a posteriori learning in demand competition games

Mohsen Rakhshan

arxiv: 1611.10270 · v1 · pith:HFGEU2MOnew · submitted 2016-11-28 · 💻 cs.GT · math.OC

Maximum a posteriori learning in demand competition games

Mohsen Rakhshan This is my paper

classification 💻 cs.GT math.OC

keywords nashplayerscompetitiongamelearnmaximumopponentpolicy

0 comments

read the original abstract

We consider an inventory competition game between two firms. The question we address is this: If players do not know the opponent's action and opponent's utility function can they learn to play the Nash policy in a repeated game by observing their own sales? In this work it is proven that by means of Maximum A Posteriori (MAP) estimation, players can learn the Nash policy. It is proven that players' actions and beliefs do converge to the Nash equilibrium.

This paper has not been read by Pith yet.

Maximum a posteriori learning in demand competition games

discussion (0)