GenRec combines page-wise NTP, token compression, and GRPO-SR reinforcement learning to scale generative retrieval, delivering 9.5% click and 8.7% transaction gains in production A/B tests on the JD App.
Title resolution pending
10 Pith papers cite this work. Polarity classification is still indexing.
years
2026 10representative citing papers
TokenFormer unifies multi-field and sequential recommendation modeling via bottom-full-top-sliding attention and non-linear interaction representations to avoid sequential collapse and deliver state-of-the-art performance.
IAT compresses each historical interaction instance into a unified embedding token via temporal-order or user-order schemes, allowing standard sequence models to learn long-range preferences with better performance and transferability.
MTServe achieves up to 3.1x speedup for generative recommendation model serving by using hierarchical caches with host RAM and system optimizations while keeping cache hit ratios above 98.5%.
UniSID jointly optimizes embeddings and Semantic IDs end-to-end with multi-granularity contrastive learning and summary-based reconstruction, outperforming RQ-based methods by up to 4.62% in Hit Rate for ad recommendation.
GrowthGR combines ItemLTV counterfactual prediction with MultiGR generative retrieval and MoPO optimization to deliver 5.3% new item GMV lift and 0.3% overall GMV gain on Taobao production.
SIF encodes entire historical raw samples as tokens via hierarchical group-adaptive quantization and token/sample-level mixing to overcome partial encoding and feature heterogeneity limits in scaled recommender models.
A re-ranking system for rich-media search that plans query intents from sessions, adds visual signals from VLMs, and uses an LLM to score results on multiple facets before multi-task RL adaptation, with reported gains in engagement after industrial deployment.
UniScale couples entire-space data construction with a hierarchical fusion transformer to improve scaling behavior and deliver 1.70% purchase and 2.04% GMV lifts in large-scale e-commerce search A/B tests.
citing papers explorer
-
GenRec: A Preference-Oriented Generative Framework for Large-Scale Recommendation
GenRec combines page-wise NTP, token compression, and GRPO-SR reinforcement learning to scale generative retrieval, delivering 9.5% click and 8.7% transaction gains in production A/B tests on the JD App.
-
TokenFormer: Unify the Multi-Field and Sequential Recommendation Worlds
TokenFormer unifies multi-field and sequential recommendation modeling via bottom-full-top-sliding attention and non-linear interaction representations to avoid sequential collapse and deliver state-of-the-art performance.
-
IAT: Instance-As-Token Compression for Historical User Sequence Modeling in Industrial Recommender Systems
IAT compresses each historical interaction instance into a unified embedding token via temporal-order or user-order schemes, allowing standard sequence models to learn long-range preferences with better performance and transferability.
-
MTServe: Efficient Serving for Generative Recommendation Models with Hierarchical Caches
MTServe achieves up to 3.1x speedup for generative recommendation model serving by using hierarchical caches with host RAM and system optimizations while keeping cache hit ratios above 98.5%.
-
End-to-End Semantic ID Generation for Generative Advertisement Recommendation
UniSID jointly optimizes embeddings and Semantic IDs end-to-end with multi-granularity contrastive learning and summary-based reconstruction, outperforming RQ-based methods by up to 4.62% in Hit Rate for ad recommendation.
-
Towards Sustainable Growth: A Multi-Value-Aware Retrieval Framework for E-Commerce Search
GrowthGR combines ItemLTV counterfactual prediction with MultiGR generative retrieval and MoPO optimization to deliver 5.3% new item GMV lift and 0.3% overall GMV gain on Taobao production.
-
Sample Is Feature: Beyond Item-Level, Toward Sample-Level Tokens for Unified Large Recommender Models
SIF encodes entire historical raw samples as tokens via hierarchical group-adaptive quantization and token/sample-level mixing to overcome partial encoding and feature heterogeneity limits in scaled recommender models.
-
Rich-Media Re-Ranker: A User Satisfaction-Driven LLM Re-ranking Framework for Rich-Media Search
A re-ranking system for rich-media search that plans query intents from sessions, adds visual signals from VLMs, and uses an LLM to score results on multiple facets before multi-task RL adaptation, with reported gains in engagement after industrial deployment.
-
Joint Model Parameter Scaling and Universal-Domain Data Integration for E-commerce Search Ranking
UniScale couples entire-space data construction with a hierarchical fusion transformer to improve scaling behavior and deliver 1.70% purchase and 2.04% GMV lifts in large-scale e-commerce search A/B tests.
- Deep Interest Mining for Intent-Enriched Semantic IDs in Multimodal Generative Recommendation