Let B′ be the set of agent-time tuples ( a, t)-s whose samplesy a,t-s remain in the buffer at the end

It remains in the buffer at the end of time horizon t = T

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Creator Incentives in Recommender Systems: A Cooperative Game-Theoretic Approach for Stable and Fair Collaboration in Multi-Agent Bandits

cs.LG · 2026-04-09 · unverdicted · novelty 7.0

For homogeneous agents in multi-agent linear bandits the regret-based TU game is convex with non-empty core containing the Shapley value; for heterogeneous agents a simple regret-based payout lies in the core and satisfies three Shapley axioms.

citing papers explorer

Showing 1 of 1 citing paper.

Creator Incentives in Recommender Systems: A Cooperative Game-Theoretic Approach for Stable and Fair Collaboration in Multi-Agent Bandits cs.LG · 2026-04-09 · unverdicted · none · ref 13
For homogeneous agents in multi-agent linear bandits the regret-based TU game is convex with non-empty core containing the Shapley value; for heterogeneous agents a simple regret-based payout lies in the core and satisfies three Shapley axioms.

Let B′ be the set of agent-time tuples ( a, t)-s whose samplesy a,t-s remain in the buffer at the end

fields

years

verdicts

representative citing papers

citing papers explorer