A Note on Information-Directed Sampling and Thompson Sampling

Li Zhou

arxiv: 1503.06902 · v1 · pith:YQQOAST3new · submitted 2015-03-24 · 💻 cs.LG · cs.AI

A Note on Information-Directed Sampling and Thompson Sampling

Li Zhou This is my paper

classification 💻 cs.LG cs.AI

keywords samplingthompsonalgorithmsinformation-directednotethreebanditbayesian

0 comments

read the original abstract

This note introduce three Bayesian style Multi-armed bandit algorithms: Information-directed sampling, Thompson Sampling and Generalized Thompson Sampling. The goal is to give an intuitive explanation for these three algorithms and their regret bounds, and provide some derivations that are omitted in the original papers.

This paper has not been read by Pith yet.

A Note on Information-Directed Sampling and Thompson Sampling

discussion (0)