Adaptive Monte Carlo via Bandit Allocation

Andr\'as Gy\"orgy; Csaba Szepesv\'ari; Dale Schuurmans; James Neufeld

arxiv: 1405.3318 · v1 · pith:KPGZDQ2Nnew · submitted 2014-05-13 · 💻 cs.AI · cs.LG

Adaptive Monte Carlo via Bandit Allocation

James Neufeld , Andr\'as Gy\"orgy , Dale Schuurmans , Csaba Szepesv\'ari This is my paper

classification 💻 cs.AI cs.LG

keywords carlomonteadaptiveallocationapproachesbanditestimatorsproblem

0 comments

read the original abstract

We consider the problem of sequentially choosing between a set of unbiased Monte Carlo estimators to minimize the mean-squared-error (MSE) of a final combined estimate. By reducing this task to a stochastic multi-armed bandit problem, we show that well developed allocation strategies can be used to achieve an MSE that approaches that of the best estimator chosen in retrospect. We then extend these developments to a scenario where alternative estimators have different, possibly stochastic costs. The outcome is a new set of adaptive Monte Carlo strategies that provide stronger guarantees than previous approaches while offering practical advantages.

This paper has not been read by Pith yet.

Adaptive Monte Carlo via Bandit Allocation

discussion (0)