Lower Bound.It is also known that any bandit algorithm can not perform better than a certain level, i.e., the algorithm will have to incur some minimum rate of (expected) regret

that demonstrate the aforementioned nature of shape of regret curve · 1985

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Creator Incentives in Recommender Systems: A Cooperative Game-Theoretic Approach for Stable and Fair Collaboration in Multi-Agent Bandits

cs.LG · 2026-04-09 · unverdicted · novelty 7.0

For homogeneous agents in multi-agent linear bandits the regret-based TU game is convex with non-empty core containing the Shapley value; for heterogeneous agents a simple regret-based payout lies in the core and satisfies three Shapley axioms.

citing papers explorer

Showing 1 of 1 citing paper.

Creator Incentives in Recommender Systems: A Cooperative Game-Theoretic Approach for Stable and Fair Collaboration in Multi-Agent Bandits cs.LG · 2026-04-09 · unverdicted · none · ref 6
For homogeneous agents in multi-agent linear bandits the regret-based TU game is convex with non-empty core containing the Shapley value; for heterogeneous agents a simple regret-based payout lies in the core and satisfies three Shapley axioms.

Lower Bound.It is also known that any bandit algorithm can not perform better than a certain level, i.e., the algorithm will have to incur some minimum rate of (expected) regret

fields

years

verdicts

representative citing papers

citing papers explorer