pith. machine review for the scientific record. sign in

arxiv: 1405.2432 · v1 · submitted 2014-05-10 · 📊 stat.ML · cs.LG

Recognition: unknown

Functional Bandits

Authors on Pith no claims yet
classification 📊 stat.ML cs.LG
keywords functionalmethodproblemachievesadditionapproacharisearm-reward
0
0 comments X
read the original abstract

We introduce the functional bandit problem, where the objective is to find an arm that optimises a known functional of the unknown arm-reward distributions. These problems arise in many settings such as maximum entropy methods in natural language processing, and risk-averse decision-making, but current best-arm identification techniques fail in these domains. We propose a new approach, that combines functional estimation and arm elimination, to tackle this problem. This method achieves provably efficient performance guarantees. In addition, we illustrate this method on a number of important functionals in risk management and information theory, and refine our generic theoretical results in those cases.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Beyond Static Bias: Adaptive Multi-Fidelity Bandits with Improving Proxies

    cs.LG 2026-05 unverdicted novelty 7.0

    TACC algorithm for adaptive multi-fidelity bandits with improving proxies achieves instance-dependent regret by replacing logarithmic high-fidelity pulls with bounded low-fidelity continuation for intermediate arms.