OPLS is a new off-policy method for contextual bandits with limited supply that outperforms greedy approaches by prioritizing items with higher relative expected rewards for the current user.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
dataset 1polarities
use dataset 1representative citing papers
APG4RecSim automatically generates realistic user profiles for LLM-based recommendation simulations, outperforming manual baselines by up to 7% in nDCG@10 and 8% in JSD on three benchmark datasets.
DPAA mitigates popularity bias in GNN-based collaborative filtering by integrating adaptive embedding-aware interaction weighting stabilized from pre-trained embeddings and layer-wise amplification of higher-order neighborhoods, outperforming prior debiasing methods on real and semi-synthetic data.
citing papers explorer
-
Off-Policy Learning with Limited Supply
OPLS is a new off-policy method for contextual bandits with limited supply that outperforms greedy approaches by prioritizing items with higher relative expected rewards for the current user.
-
Task-Aware Automated User Profile Generation for Recommendation Simulation Using Large Language Models
APG4RecSim automatically generates realistic user profiles for LLM-based recommendation simulations, outperforming manual baselines by up to 7% in nDCG@10 and 8% in JSD on three benchmark datasets.
-
Debiasing Message Passing to Mitigate Popularity Bias in GNN-based Collaborative Filtering
DPAA mitigates popularity bias in GNN-based collaborative filtering by integrating adaptive embedding-aware interaction weighting stabilized from pre-trained embeddings and layer-wise amplification of higher-order neighborhoods, outperforming prior debiasing methods on real and semi-synthetic data.