Nonlinear stochastic multiarmed bandit problems with inexact oracle
classification
🧮 math.OC
keywords
inexactoraclepointproblemsconsidergeneralizemultiarmedresults
read the original abstract
In the paper we consider one point and two point multiarmed bamdit problems. In other words we consider the online stochastic convex optimization problems with oracle that return the value (realization) of the function at one point or at two points. We allow these values to be inexact, but the level of noise should be small enough. We generalize well known results for inexact oracle case. And we also generalize classical results to prox-structures differ from euclidian.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.