pith. sign in

arxiv: 1705.08110 · v3 · pith:JQ5YMYLYnew · submitted 2017-05-23 · 💻 cs.LG

Combinatorial Semi-Bandits with Knapsacks

classification 💻 cs.LG
keywords combinatorialbanditsalgorithmknapsackslimitedsemi-banditsactionsadditional
0
0 comments X
read the original abstract

We unify two prominent lines of work on multi-armed bandits: bandits with knapsacks (BwK) and combinatorial semi-bandits. The former concerns limited "resources" consumed by the algorithm, e.g., limited supply in dynamic pricing. The latter allows a huge number of actions but assumes combinatorial structure and additional feedback to make the problem tractable. We define a common generalization, support it with several motivating examples, and design an algorithm for it. Our regret bounds are comparable with those for BwK and combinatorial semi- bandits.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.