Variations in user state embeddings for CMAB recommenders can improve performance more than changing the bandit algorithm, with no embedding or aggregation strategy dominating across datasets.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
Greedy linear models without exploration consistently achieve top-tier performance in over 90% of offline dataset evaluations for linear bandit recommenders, with hyperparameter tuning favoring minimal exploration and exposing biases in these protocols.
Controlled personalization combining editorial curation with modest algorithmic recommendations in legacy news increases engagement, diversity, and reduces popularity bias per an A/B test.
citing papers explorer
-
The Bandit's Blind Spot: The Critical Role of User State Representation in Recommender Systems
Variations in user state embeddings for CMAB recommenders can improve performance more than changing the bandit algorithm, with no embedding or aggregation strategy dominating across datasets.
-
Exploitation Over Exploration: Unmasking the Bias in Linear Bandit Recommender Offline Evaluation
Greedy linear models without exploration consistently achieve top-tier performance in over 90% of offline dataset evaluations for linear bandit recommenders, with hyperparameter tuning favoring minimal exploration and exposing biases in these protocols.
-
Controlled Personalization in Legacy Media Online Services: A Case Study in News Recommendation
Controlled personalization combining editorial curation with modest algorithmic recommendations in legacy news increases engagement, diversity, and reduces popularity bias per an A/B test.