A Linear Programming Relaxation and a Heuristic for the Restless Bandit Problem with General Switching Costs
classification
🧮 math.OC
keywords
relaxationbanditcostsheuristicproblemprogrammingrestlessswitching
read the original abstract
We extend a relaxation technique due to Bertsimas and Nino-Mora for the restless bandit problem to the case where arbitrary costs penalize switching between the bandits. We also construct a one-step lookahead policy using the solution of the relaxation. Computational experiments and a bound for approximate dynamic programming provide some empirical support for the heuristic.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.