pith. sign in

arxiv: 1605.07162 · v3 · pith:RMNP3CGSnew · submitted 2016-05-23 · 💻 cs.LG · cs.DS

Pure Exploration of Multi-armed Bandit Under Matroid Constraints

classification 💻 cs.LG cs.DS
keywords problemmatroidbest-basisexplorationidentificationpurearmsbandit
0
0 comments X p. Extension
pith:RMNP3CGS Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{RMNP3CGS}

Prints a linked pith:RMNP3CGS badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We study the pure exploration problem subject to a matroid constraint (Best-Basis) in a stochastic multi-armed bandit game. In a Best-Basis instance, we are given $n$ stochastic arms with unknown reward distributions, as well as a matroid $\mathcal{M}$ over the arms. Let the weight of an arm be the mean of its reward distribution. Our goal is to identify a basis of $\mathcal{M}$ with the maximum total weight, using as few samples as possible. The problem is a significant generalization of the best arm identification problem and the top-$k$ arm identification problem, which have attracted significant attentions in recent years. We study both the exact and PAC versions of Best-Basis, and provide algorithms with nearly-optimal sample complexities for these versions. Our results generalize and/or improve on several previous results for the top-$k$ arm identification problem and the combinatorial pure exploration problem when the combinatorial constraint is a matroid.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.