hub

Unifying generative and dense retrieval for sequential recommendation

Liu Yang, Fabian Paischer, Kaveh Hassani, Jiacheng Li, Shuai Shao, Zhang Gabriel Li, Yun He, Xue Feng, Nima Noorshams, Sem Park, et al · 2024 · arXiv 2411.18814

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

read on arXiv browse 12 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 3

citation-polarity summary

background 2 unclear 1

representative citing papers

TokenMinds: Pretrained User Tokens and Embeddings for User Understanding in Large Recommender Systems

cs.IR · 2026-06-23 · unverdicted · novelty 7.0

TokenMinds extends Semantic ID tokenization from items to users, producing paired discrete tokens and dense embeddings via an LLM-adapted encoder-decoder for industrial recommendation.

MLPs are Efficient Distilled Generative Recommenders

cs.IR · 2026-05-12 · unverdicted · novelty 7.0

SID-MLP distills autoregressive generative recommenders into efficient position-specific MLP heads for Semantic ID tasks, achieving 8.74x faster inference with matching accuracy.

Expressiveness Limits of Autoregressive Semantic ID Generation in Generative Recommendation

cs.IR · 2026-05-07 · unverdicted · novelty 7.0

Autoregressive semantic ID generation creates tree-induced probability correlations that prevent generative recommenders from capturing simple patterns; Latte adds latent tokens to relax these correlations.

GenRecEdit: Adapting Model Editing for Generative Recommendation with Cold-Start Items

cs.IR · 2026-03-15 · conditional · novelty 7.0

GenRecEdit injects cold-start items into generative recommendation models via context-aware token editing and interference-reducing triggers, boosting cold-start accuracy while using only 9.5% of retraining time.

UniPinRec: Unifying Generative Retrieval and Ranking at Pinterest Scale

cs.IR · 2026-05-29 · unverdicted · novelty 6.0

UniPinRec unifies retrieval and ranking into a single model and pipeline deployed at Pinterest, reporting +1% engagement lift, 11.1% lower latency, and 63.6% higher QPS.

Conditional Memory Enhanced Item Representation for Generative Recommendation

cs.IR · 2026-05-12 · unverdicted · novelty 6.0

ComeIR introduces dual-level Engram memory and memory-restoring prediction to reconstruct SID-token embeddings and restore token granularity in generative recommendation.

Bridging Textual Profiles and Latent User Embeddings for Personalization

cs.IR · 2026-05-07 · unverdicted · novelty 6.0

BLUE aligns LLM-generated textual user profiles with embedding-based recommendation objectives via reinforcement learning and next-item text supervision, yielding better zero-shot performance and cross-domain transfer than baselines.

CapsID: Soft-Routed Variable-Length Semantic IDs for Generative Recommendation

cs.IR · 2026-05-06 · unverdicted · novelty 6.0

CapsID uses probabilistic capsule routing and confidence-based termination to generate variable-length semantic IDs, improving recall by 9.6% over strong baselines with half the latency of dual-representation systems.

MTServe: Efficient Serving for Generative Recommendation Models with Hierarchical Caches

cs.LG · 2026-04-24 · unverdicted · novelty 6.0

MTServe achieves up to 3.1x speedup for generative recommendation model serving by using hierarchical caches with host RAM and system optimizations while keeping cache hit ratios above 98.5%.

LWGR: Lagrangian-Constrained Personalized World Knowledge for Generative Recommendation

cs.IR · 2026-04-16 · conditional · novelty 6.0

LWGR applies personalized soft instructions for LLM knowledge extraction and Lagrangian primal-dual optimization to selectively fuse beneficial world knowledge into generative recommendation while bounding degradation.

Sequential Data Augmentation for Generative Recommendation

cs.LG · 2025-09-17 · conditional · novelty 6.0

GenPAS unifies common data augmentation strategies for generative recommendation as special cases of a bias-controlled stochastic sampling process and demonstrates gains in accuracy, data efficiency, and parameter efficiency on benchmarks and industrial data.

Mitigating Collaborative Semantic ID Staleness in Generative Retrieval

cs.IR · 2026-04-14 · unverdicted · novelty 5.0

A model-agnostic SID alignment update mitigates staleness from temporal drift in user-item interactions for generative retrievers, improving Recall@K and nDCG@K while reducing compute by 8-9x versus full retraining.

citing papers explorer

Showing 9 of 9 citing papers after filters.

TokenMinds: Pretrained User Tokens and Embeddings for User Understanding in Large Recommender Systems cs.IR · 2026-06-23 · unverdicted · none · ref 37
TokenMinds extends Semantic ID tokenization from items to users, producing paired discrete tokens and dense embeddings via an LLM-adapted encoder-decoder for industrial recommendation.
MLPs are Efficient Distilled Generative Recommenders cs.IR · 2026-05-12 · unverdicted · none · ref 29
SID-MLP distills autoregressive generative recommenders into efficient position-specific MLP heads for Semantic ID tasks, achieving 8.74x faster inference with matching accuracy.
Expressiveness Limits of Autoregressive Semantic ID Generation in Generative Recommendation cs.IR · 2026-05-07 · unverdicted · none · ref 54
Autoregressive semantic ID generation creates tree-induced probability correlations that prevent generative recommenders from capturing simple patterns; Latte adds latent tokens to relax these correlations.
UniPinRec: Unifying Generative Retrieval and Ranking at Pinterest Scale cs.IR · 2026-05-29 · unverdicted · none · ref 30
UniPinRec unifies retrieval and ranking into a single model and pipeline deployed at Pinterest, reporting +1% engagement lift, 11.1% lower latency, and 63.6% higher QPS.
Conditional Memory Enhanced Item Representation for Generative Recommendation cs.IR · 2026-05-12 · unverdicted · none · ref 47
ComeIR introduces dual-level Engram memory and memory-restoring prediction to reconstruct SID-token embeddings and restore token granularity in generative recommendation.
Bridging Textual Profiles and Latent User Embeddings for Personalization cs.IR · 2026-05-07 · unverdicted · none · ref 17
BLUE aligns LLM-generated textual user profiles with embedding-based recommendation objectives via reinforcement learning and next-item text supervision, yielding better zero-shot performance and cross-domain transfer than baselines.
CapsID: Soft-Routed Variable-Length Semantic IDs for Generative Recommendation cs.IR · 2026-05-06 · unverdicted · none · ref 37
CapsID uses probabilistic capsule routing and confidence-based termination to generate variable-length semantic IDs, improving recall by 9.6% over strong baselines with half the latency of dual-representation systems.
MTServe: Efficient Serving for Generative Recommendation Models with Hierarchical Caches cs.LG · 2026-04-24 · unverdicted · none · ref 38
MTServe achieves up to 3.1x speedup for generative recommendation model serving by using hierarchical caches with host RAM and system optimizations while keeping cache hit ratios above 98.5%.
Mitigating Collaborative Semantic ID Staleness in Generative Retrieval cs.IR · 2026-04-14 · unverdicted · none · ref 34
A model-agnostic SID alignment update mitigates staleness from temporal drift in user-item interactions for generative retrievers, improving Recall@K and nDCG@K while reducing compute by 8-9x versus full retraining.

Unifying generative and dense retrieval for sequential recommendation

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer