A penalized bandit algorithm

Damien Lamberton (LAMA); Gilles Pag\`es (PMA)

arxiv: math/0510384 · v1 · pith:ZJSPQ635new · submitted 2005-10-18 · 🧮 math.PR

A penalized bandit algorithm

Damien Lamberton (LAMA) , Gilles Pag\`es (PMA) This is my paper

classification 🧮 math.PR

keywords algorithmconvergencedistributionlimitarmed-banditbanditcentralcharacterized

0 comments

read the original abstract

We study a two armed-bandit algorithm with penalty. We show the convergence of the algorithm and establish the rate of convergence. For some choices of the parameters, we obtain a central limit theorem in which the limit distribution is characterized as the unique stationary distribution of a discontinuous Markov process.

This paper has not been read by Pith yet.

A penalized bandit algorithm

discussion (0)