Farewell to item ids: Unlocking the scaling potential of large ranking models via semantic tokens

Zhen Zhao, Tong Zhang, Jie Xu, Qingliang Cai, Qile Zhang, Leyuan Yang, Daorui Xiao, Xiaojia Chang · 2026 · arXiv 2601.22694

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

On the Equivalence Between Auto-Regressive Next Token Prediction and Full-Item-Vocabulary Maximum Likelihood Estimation in Generative Recommendation--A Short Note

cs.IR · 2026-04-17 · accept · novelty 7.0

Auto-regressive next-token prediction is strictly equivalent to full-vocabulary maximum likelihood estimation in generative recommendation under bijective item-to-token-sequence mapping.

UxSID: Semantic-Aware User Interests Modeling for Ultra-Long Sequence

cs.AI · 2026-05-09 · unverdicted · novelty 5.0 · 3 refs

UxSID models ultra-long user sequences with semantic-group shared interest memory using Semantic IDs and dual-level attention, achieving state-of-the-art performance and a 0.337% revenue lift in advertising A/B tests.

Sample Is Feature: Beyond Item-Level, Toward Sample-Level Tokens for Unified Large Recommender Models

cs.IR · 2026-04-17 · unverdicted · novelty 5.0 · 2 refs

SIF encodes entire historical raw samples as tokens via hierarchical group-adaptive quantization and token/sample-level mixing to overcome partial encoding and feature heterogeneity limits in scaled recommender models.

citing papers explorer

Showing 3 of 3 citing papers.

On the Equivalence Between Auto-Regressive Next Token Prediction and Full-Item-Vocabulary Maximum Likelihood Estimation in Generative Recommendation--A Short Note cs.IR · 2026-04-17 · accept · none · ref 15
Auto-regressive next-token prediction is strictly equivalent to full-vocabulary maximum likelihood estimation in generative recommendation under bijective item-to-token-sequence mapping.
UxSID: Semantic-Aware User Interests Modeling for Ultra-Long Sequence cs.AI · 2026-05-09 · unverdicted · none · ref 40 · 3 links
UxSID models ultra-long user sequences with semantic-group shared interest memory using Semantic IDs and dual-level attention, achieving state-of-the-art performance and a 0.337% revenue lift in advertising A/B tests.
Sample Is Feature: Beyond Item-Level, Toward Sample-Level Tokens for Unified Large Recommender Models cs.IR · 2026-04-17 · unverdicted · none · ref 28 · 2 links
SIF encodes entire historical raw samples as tokens via hierarchical group-adaptive quantization and token/sample-level mixing to overcome partial encoding and feature heterogeneity limits in scaled recommender models.

Farewell to item ids: Unlocking the scaling potential of large ranking models via semantic tokens

fields

years

verdicts

representative citing papers

citing papers explorer