A robust variant of binary search achieves regret O(C + log T) for dynamic pricing with known corruption C and O(C + log² T) when unknown.
Statistical science: a review journal of the Institute of Mathematical Statistics , volume=
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3representative citing papers
Active sampling with allocation q_j proportional to p_j to the 2/3 achieves tight regret sqrt(n/T) times norm of p to the 2/3 for known context distribution p, with improvement up to Theta(k to the 1/4) over passive sampling.
Adapts bandit algorithms to the Cox PH survival model for online treatment optimization under censoring, with theoretical sublinear regret and validation on simulations plus SEER cancer data.
citing papers explorer
-
Toward Optimal Regret in Robust Pricing: Decoupling Corruption and Time
A robust variant of binary search achieves regret O(C + log T) for dynamic pricing with known corruption C and O(C + log² T) when unknown.
-
Active Context Selection Improves Simple Regret in Contextual Bandits
Active sampling with allocation q_j proportional to p_j to the 2/3 achieves tight regret sqrt(n/T) times norm of p to the 2/3 for known context distribution p, with improvement up to Theta(k to the 1/4) over passive sampling.
-
Online Survival Analysis: A Bandit Approach under Cox PH Model
Adapts bandit algorithms to the Cox PH survival model for online treatment optimization under censoring, with theoretical sublinear regret and validation on simulations plus SEER cancer data.