arXiv preprint arXiv:2503.07555 , year=

Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference , author= · arXiv 2503.07555

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Active Context Selection Improves Simple Regret in Contextual Bandits

cs.LG · 2026-05-19 · accept · novelty 7.0

Active sampling with allocation q_j proportional to p_j to the 2/3 achieves tight regret sqrt(n/T) times norm of p to the 2/3 for known context distribution p, with improvement up to Theta(k to the 1/4) over passive sampling.

citing papers explorer

Showing 1 of 1 citing paper.

Active Context Selection Improves Simple Regret in Contextual Bandits cs.LG · 2026-05-19 · accept · none · ref 12
Active sampling with allocation q_j proportional to p_j to the 2/3 achieves tight regret sqrt(n/T) times norm of p to the 2/3 for known context distribution p, with improvement up to Theta(k to the 1/4) over passive sampling.

arXiv preprint arXiv:2503.07555 , year=

fields

years

verdicts

representative citing papers

citing papers explorer