Reinforcement learning based recommender systems: A survey.ACM Computing Surveys, 55(7):1–38, 2022

M Mehdi Afsar, Trafford Crump, Behrouz Far · 2022

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

PREFER: Personalized Review Summarization with Online Preference Learning

cs.AI · 2026-05-07 · unverdicted · novelty 6.0

PREFER is an online preference learning system that generates personalized review summaries and improves alignment with user interests in simulations on Amazon review data.

Corruption-Tolerant Asynchronous Q-Learning with Near-Optimal Rates

cs.LG · 2025-09-10 · unverdicted · novelty 6.0

A novel robust asynchronous Q-learning algorithm achieves finite-time convergence rates that match clean-data bounds up to an additive term proportional to the corruption fraction, with a matching information-theoretic lower bound.

Time-Constrained Recommendations: Reinforcement Learning Strategies for E-Commerce

cs.LG · 2025-12-13 · unverdicted · novelty 4.0

Reinforcement learning policies for time-constrained slate recommendations improve engagement over contextual bandits in e-commerce settings.

citing papers explorer

Showing 3 of 3 citing papers.

PREFER: Personalized Review Summarization with Online Preference Learning cs.AI · 2026-05-07 · unverdicted · none · ref 2
PREFER is an online preference learning system that generates personalized review summaries and improves alignment with user interests in simulations on Amazon review data.
Corruption-Tolerant Asynchronous Q-Learning with Near-Optimal Rates cs.LG · 2025-09-10 · unverdicted · none · ref 3
A novel robust asynchronous Q-learning algorithm achieves finite-time convergence rates that match clean-data bounds up to an additive term proportional to the corruption fraction, with a matching information-theoretic lower bound.
Time-Constrained Recommendations: Reinforcement Learning Strategies for E-Commerce cs.LG · 2025-12-13 · unverdicted · none · ref 5
Reinforcement learning policies for time-constrained slate recommendations improve engagement over contextual bandits in e-commerce settings.

Reinforcement learning based recommender systems: A survey.ACM Computing Surveys, 55(7):1–38, 2022

fields

years

verdicts

representative citing papers

citing papers explorer