Pareto regret in multi-objective bandits matches the single-objective case by scaling inversely with the largest objective-wise gap g†, independent of dimension d, via a new top-two races and uncertainty-greedy algorithm with matching bounds.
Evolutionary computation for large-scale multi-objective optimization: A decade of progresses
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Are Stochastic Multi-objective Bandits Harder than Single-objective Bandits?
Pareto regret in multi-objective bandits matches the single-objective case by scaling inversely with the largest objective-wise gap g†, independent of dimension d, via a new top-two races and uncertainty-greedy algorithm with matching bounds.