TokenFormer unifies multi-field and sequential recommendation modeling via bottom-full-top-sliding attention and non-linear interaction representations to avoid sequential collapse and deliver state-of-the-art performance.
Title resolution pending
6 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 6verdicts
UNVERDICTED 6roles
background 1polarities
background 1representative citing papers
UG-Separation framework disentangles user-side and item-side flows in TokenMixer dense-interaction models to enable reusable user computations, cutting inference latency up to 20% in ByteDance production scenarios.
MBGR is a new generative recommendation framework using business-aware semantic IDs, multi-business prediction, and label dynamic routing to handle multiple businesses without seesaw effects or representation confusion, validated by experiments and deployed at Meituan.
OCARM uses teacher-student distillation to let retention models learn from inaccessible post-conversion content without feature leakage, yielding improvements in offline experiments and online A/B tests.
A2Gen models temporal user action sequences with context-aware attention and autoregressive generation to improve short video recommendation accuracy, showing gains in watch time and retention on large-scale tests.
UniScale couples entire-space data construction with a hierarchical fusion transformer to improve scaling behavior and deliver 1.70% purchase and 2.04% GMV lifts in large-scale e-commerce search A/B tests.
citing papers explorer
-
TokenFormer: Unify the Multi-Field and Sequential Recommendation Worlds
TokenFormer unifies multi-field and sequential recommendation modeling via bottom-full-top-sliding attention and non-linear interaction representations to avoid sequential collapse and deliver state-of-the-art performance.
-
Compute Only Once: UG-Separation for Efficient Large Recommendation Models
UG-Separation framework disentangles user-side and item-side flows in TokenMixer dense-interaction models to enable reusable user computations, cutting inference latency up to 20% in ByteDance production scenarios.
-
MBGR: Multi-Business Prediction for Generative Recommendation at Meituan
MBGR is a new generative recommendation framework using business-aware semantic IDs, multi-business prediction, and label dynamic routing to handle multiple businesses without seesaw effects or representation confusion, validated by experiments and deployed at Meituan.
-
Break the Inaccessible Boundary: Distilling Post-Conversion Content for User Retention Modeling
OCARM uses teacher-student distillation to let retention models learn from inaccessible post-conversion content without feature leakage, yielding improvements in offline experiments and online A/B tests.
-
Action-Aware Generative Sequence Modeling for Short Video Recommendation
A2Gen models temporal user action sequences with context-aware attention and autoregressive generation to improve short video recommendation accuracy, showing gains in watch time and retention on large-scale tests.
-
Joint Model Parameter Scaling and Universal-Domain Data Integration for E-commerce Search Ranking
UniScale couples entire-space data construction with a hierarchical fusion transformer to improve scaling behavior and deliver 1.70% purchase and 2.04% GMV lifts in large-scale e-commerce search A/B tests.