Closing the Gap on the Sample Complexity of 1-Identification

· 2026 · cs.LG · arXiv 2601.15620

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

The 1-identification problem is a fundamental pure-exploration problem in multi-armed bandits. An agent aims to determine whether there exists an arm whose mean reward exceeds a known threshold $\mu_0$, or to output \textsf{None} otherwise. The agent must guarantee correctness with probability at least $1-\delta$, while minimizing the expected number of arm pulls $\mathbb{E}[\tau]$. We study the 1-identification problem and make two main contributions. First, for instances with at least one qualified arm, we derive a new lower bound on $\mathbb{E}[\tau]$ via a novel optimization formulation. Second, we propose a new algorithm and establish upper bounds that match the lower bounds up to polynomial logarithmic factors uniformly over all instances. Our result complements the analysis of $\mathbb{E}\tau$ when there are multiple qualified arms, which is an open problem in the literature.

representative citing papers

Pure Exploration for a Good Policy in Reinforcement Learning with Bandit Feedback

cs.LG · 2026-05-22 · unverdicted · novelty 7.0

Introduces Good Policy Identification (GPI) and BEE-GPI algorithm whose sample complexity for positive instances has log(1/δ) coefficient O(H²/(V*−μ0)²) independent of state and action space sizes.

citing papers explorer

Showing 1 of 1 citing paper.

Pure Exploration for a Good Policy in Reinforcement Learning with Bandit Feedback cs.LG · 2026-05-22 · unverdicted · none · ref 33 · internal anchor
Introduces Good Policy Identification (GPI) and BEE-GPI algorithm whose sample complexity for positive instances has log(1/δ) coefficient O(H²/(V*−μ0)²) independent of state and action space sizes.

Closing the Gap on the Sample Complexity of 1-Identification

fields

years

verdicts

representative citing papers

citing papers explorer