BCCB unifies learning of heterogeneous ad responses, exploration of uncertain users, and budget pacing into a single online process that works effectively from the first user on the Criteo Uplift dataset.
Improving Thompson sampling via information relaxation for budgeted multi-armed bandits
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Budget-Constrained Causal Bandits: Bridging Uplift Modeling and Sequential Decision-Making
BCCB unifies learning of heterogeneous ad responses, exploration of uncertain users, and budget pacing into a single online process that works effectively from the first user on the Criteo Uplift dataset.