Functional Bandits

Long Tran-Thanh , Jia Yuan Yu

Authors on Pith no claims yet

classification 📊 stat.ML cs.LG

keywords functionalmethodproblemachievesadditionapproacharisearm-reward

read the original abstract

We introduce the functional bandit problem, where the objective is to find an arm that optimises a known functional of the unknown arm-reward distributions. These problems arise in many settings such as maximum entropy methods in natural language processing, and risk-averse decision-making, but current best-arm identification techniques fail in these domains. We propose a new approach, that combines functional estimation and arm elimination, to tackle this problem. This method achieves provably efficient performance guarantees. In addition, we illustrate this method on a number of important functionals in risk management and information theory, and refine our generic theoretical results in those cases.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Beyond Static Bias: Adaptive Multi-Fidelity Bandits with Improving Proxies
cs.LG 2026-05 unverdicted novelty 7.0

TACC algorithm for adaptive multi-fidelity bandits with improving proxies achieves instance-dependent regret by replacing logarithmic high-fidelity pulls with bounded low-fidelity continuation for intermediate arms.