Retrieval coverage limits LLM rerankers in cold-start recommendation; a learned hybrid fusion improves pool quality but LLM reranking often degrades end-to-end performance while simpler rankers exploit the pool.
hub Mixed citations
Maxwell Harper and Joseph A
Mixed citation behavior. Most common role is background (50%).
hub tools
citation-role summary
citation-polarity summary
representative citing papers
DINOSAUR augments ANN indices with sampled embeddings to marginalize uncertainty, recovering standard retrieval at zero uncertainty while expanding coverage for uncertain items.
F-GRPO factorizes group-relative policy optimization into generation and ranking phases within one autoregressive sequence, using order-invariant coverage and position-aware utility rewards to improve top-ranked performance on recommendation and multi-hop QA tasks.
A Bayesian predictive model adaptively selects martingale factors to construct asymptotically log-optimal confidence sequences for bounded means while preserving anytime validity under misspecification.
InvariRank achieves permutation-invariant listwise reranking for LLM-based recommendations via a structured attention mask that blocks cross-candidate interactions and shared positional framing under RoPE, enabling stable rankings in one forward pass.
GREW uses a secret-key-driven green-red item partition and three ranking-integrated modules to embed verifiable watermarks in recommender systems that resist extraction attacks without data injection.
HORIZON creates a cross-domain, long-horizon user modeling benchmark from Amazon Reviews that tests generalization across time, domains, and unseen users, exposing gaps in sequential and LLM-based recommendation models.
For homogeneous agents in multi-agent linear bandits the regret-based TU game is convex with non-empty core containing the Shapley value; for heterogeneous agents a simple regret-based payout lies in the core and satisfies three Shapley axioms.
LLMAR applies LLM reasoning with a self-correction reflection loop to generate semantic user motives for tuning-free recommendations, showing up to 54.6% nDCG@10 gains on a sparse industrial dataset over trained baselines.
Scoring functions are sub-optimal for all utility-fairness trade-offs in ranking under a generic fairness formulation, but semi-greedy post-processing can approach the performance of exhaustive post-processing.
Develops TWSF estimator for causal forecasting in panel data by combining synthetic controls with time-series models under low-rank latent factor assumptions, providing finite-sample bounds and asymptotic normality.
SOLANET is a distributed GPU toolkit for neighbor graph construction that reports 11X speedup on 512 APUs for 1B points and 6.9X for 2B points.
Interventions in LLM-simulated user experiments induce distribution shifts in latent attributes that create confounding bias, diagnosable with negative control outcomes and partially mitigated by adding setting-relevant persona details.
APG4RecSim automatically generates realistic user profiles for LLM-based recommendation simulations, outperforming manual baselines by up to 7% in nDCG@10 and 8% in JSD on three benchmark datasets.
Develops COF algorithm for MAB-CS that intelligently checks cheap arm feasibility by pooling samples, with generalized instance-dependent lower bounds and matching upper bounds on cumulative cost and quality regret.
Graphify automates synthesis of type-safe graph backends via a formal GraphQL-to-Gremlin mapping and O(S) recursive transpilation algorithm supporting CRUD and nested queries.
GloRank reformulates list-wise reranking as token generation over a global item identifier space, using supervised pre-training followed by reinforcement learning to maximize list-wise utility and outperforming baselines on benchmarks and industrial data.
WPGRec is a new sequential recommender that performs multi-scale temporal modeling via stationary wavelet packets and injects high-order collaborative information through scale-aligned graph propagation with energy-aware gated fusion.
A2G-DiffRec applies adaptive autoguidance in diffusion recommenders, learning to balance main and weak model outputs via fairness-aware regularization to improve item exposure fairness with only marginal accuracy loss.
Introduces bounded fake data injection attacks that force a class of stochastic bandit algorithms to select a target arm in nearly all rounds at sublinear attack cost.
Group RC-DMC extends RC-DMC by adding Set-Transformer group aggregation, low-rank regularization via nuclear-norm proximal steps, and a low-rank decoder to improve group-level RMSE on MovieLens and Goodbooks while staying competitive on precision, recall, and F1.
ILASP approximates neural networks for recipe preference learning as both global and local models, using weak constraints and PCA to maintain fidelity and interpretability.
SAILRec uses dual-side semantic alignment and hierarchical attention steering to improve how LLMs incorporate collaborative embeddings for recommendations, outperforming baselines on MovieLens-1M and Amazon-Book datasets.
The thesis identifies theoretical, empirical, and conceptual flaws in offline fairness measures for recommender systems and contributes new evaluation methods and practical guidelines.
citing papers explorer
No citing papers match the current filters.