: Restless bandits: Activity allocation in a changing world

barticle Whittle , P · 1988

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Optimal Control of Fluid Restless Multi-armed Bandits: A Machine Learning Approach

cs.LG · 2025-02-06 · unverdicted · novelty 5.0

A framework generates training data from a numerical solver for FRMABPs, applies nonlinear feature transforms, and learns time-dependent policies via OCT-H to achieve up to 26 million times speed-up on test problems.

citing papers explorer

Showing 1 of 1 citing paper.

Optimal Control of Fluid Restless Multi-armed Bandits: A Machine Learning Approach cs.LG · 2025-02-06 · unverdicted · none · ref 39
A framework generates training data from a numerical solver for FRMABPs, applies nonlinear feature transforms, and learns time-dependent policies via OCT-H to achieve up to 26 million times speed-up on test problems.

: Restless bandits: Activity allocation in a changing world

fields

years

verdicts

representative citing papers

citing papers explorer