pith. sign in

arxiv: 1407.7241 · v1 · pith:W6FVUIT6new · submitted 2014-07-27 · 🧮 math.PR

Bandit problems with Levy processes

classification 🧮 math.PR
keywords problemsbanditcut-offlevyoptimalstrategytypescontinuous
0
0 comments X
read the original abstract

Bandit problems model the trade-off between exploration and exploitation in various decision problems. We study two-armed bandit problems in continuous time, where the risky arm can have two types: High or Low; both types yield stochastic payoffs generated by a Levy process. We show that the optimal strategy is a cut-off strategy and we provide an explicit expression for the cut-off and for the optimal payoff.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.