Best Arm Identification in Generalized Linear Bandits

Abbas Kazerouni; Lawrence M. Wein

arxiv: 1905.08224 · v1 · pith:VBONOBTSnew · submitted 2019-05-20 · 💻 cs.LG · stat.ML

Best Arm Identification in Generalized Linear Bandits

Abbas Kazerouni , Lawrence M. Wein This is my paper

classification 💻 cs.LG stat.ML

keywords linearbanditsgeneralizedidentificationbest-armproblemsufficientlyvector

0 comments

read the original abstract

Motivated by drug design, we consider the best-arm identification problem in generalized linear bandits. More specifically, we assume each arm has a vector of covariates, there is an unknown vector of parameters that is common across the arms, and a generalized linear model captures the dependence of rewards on the covariate and parameter vectors. The problem is to minimize the number of arm pulls required to identify an arm that is sufficiently close to optimal with a sufficiently high probability. Building on recent progress in best-arm identification for linear bandits (Xu et al. 2018), we propose the first algorithm for best-arm identification for generalized linear bandits, provide theoretical guarantees on its accuracy and sampling efficiency, and evaluate its performance in various scenarios via simulation.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Sequential Experimental Design for Transductive Linear Bandits
stat.ML 2019-06 unverdicted novelty 7.0

Introduces transductive linear bandits, gives instance-dependent lower bounds, and presents an algorithm matching them up to logarithmic factors, including the first non-asymptotic near-optimal method for standard lin...