Onepiece: Bringing context engineering and reasoning to industrial cascade ranking system.arXiv preprint arXiv:2509.18091

Sunhao Dai, Jiakai Tang, Jiahua Wu, Kun Wang, Yuxuan Zhu, Bingjun Chen, Bangyang Hong, Yu Zhao, Cong Fu, Kangle Wu, et al · 2025 · arXiv 2509.18091

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

read on arXiv browse 11 citing papers

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

Beyond Static Best-of-N: Bayesian List-wise Alignment for LLM-based Recommendation

cs.IR · 2026-05-06 · conditional · novelty 7.0

BLADE uses Bayesian list-wise alignment with dynamic estimation to create a self-evolving target that overcomes limitations of static references in LLM-based recommendation, yielding sustained gains in ranking and complex metrics.

Limitations of LTI Koopman Modeling for Nonlinear Control Systems

math.OC · 2026-04-28 · unverdicted · novelty 7.0

Exact LTI Koopman models for nonlinear control systems require affine linear dynamics under controllability and coordinate inclusion assumptions.

LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction

cs.IR · 2026-04-21 · unverdicted · novelty 7.0

LoopCTR trains CTR models with recursive layer reuse and process supervision so that zero-loop inference outperforms baselines on public and industrial datasets.

GenRec: A Preference-Oriented Generative Framework for Large-Scale Recommendation

cs.IR · 2026-04-16 · unverdicted · novelty 7.0

GenRec combines page-wise NTP, token compression, and GRPO-SR reinforcement learning to scale generative retrieval, delivering 9.5% click and 8.7% transaction gains in production A/B tests on the JD App.

S$^2$GR: Stepwise Semantic-Guided Reasoning in Latent Space for Generative Recommendation

cs.IR · 2026-01-26 · unverdicted · novelty 7.0

S²GR adds stepwise thinking tokens with contrastive supervision on codebook clusters to balance computational focus and ground reasoning paths in generative recommendation.

Intuition-Guided Latent Reasoning for LLM-Based Recommendation

cs.IR · 2026-06-26 · unverdicted · novelty 6.0

IntuRec anchors LLM latent reasoning for recommendation by deriving an intuition embedding from top-K candidates via self- and cross-attention to initialize more accurate trajectories.

UniPinRec: Unifying Generative Retrieval and Ranking at Pinterest Scale

cs.IR · 2026-05-29 · unverdicted · novelty 6.0

UniPinRec unifies retrieval and ranking into a single model and pipeline deployed at Pinterest, reporting +1% engagement lift, 11.1% lower latency, and 63.6% higher QPS.

From Local Indices to Global Identifiers: Generative Reranking for Recommender Systems via Global Action Space

cs.IR · 2026-04-28 · unverdicted · novelty 6.0

GloRank reformulates list-wise reranking as token generation over a global item identifier space, using supervised pre-training followed by reinforcement learning to maximize list-wise utility and outperforming baselines on benchmarks and industrial data.

MTServe: Efficient Serving for Generative Recommendation Models with Hierarchical Caches

cs.LG · 2026-04-24 · unverdicted · novelty 6.0

MTServe achieves up to 3.1x speedup for generative recommendation model serving by using hierarchical caches with host RAM and system optimizations while keeping cache hit ratios above 98.5%.

SSRLive: Live Streaming Recommendation with Dynamic Semantic ID

cs.IR · 2026-06-05 · unverdicted · novelty 5.0

SSRLive combines generative and discriminative modules with dynamic semantic IDs to improve live streaming recommendations, reporting gains of +3.38% watch time, +0.72% GMV, +3.12% follower growth, and +2.92% interaction volume in online A/B tests.

Token Factory: Efficiently Integrating Diverse Signals into Large Recommendation Models

cs.IR · 2026-06-17 · unverdicted · novelty 4.0

Token Factory transforms traditional signals into soft tokens for efficient integration and compression into Large Recommendation Models, avoiding prompt length explosion while enhancing performance.

citing papers explorer

Showing 11 of 11 citing papers.

Beyond Static Best-of-N: Bayesian List-wise Alignment for LLM-based Recommendation cs.IR · 2026-05-06 · conditional · none · ref 12
BLADE uses Bayesian list-wise alignment with dynamic estimation to create a self-evolving target that overcomes limitations of static references in LLM-based recommendation, yielding sustained gains in ranking and complex metrics.
Limitations of LTI Koopman Modeling for Nonlinear Control Systems math.OC · 2026-04-28 · unverdicted · none · ref 8
Exact LTI Koopman models for nonlinear control systems require affine linear dynamics under controllability and coordinate inclusion assumptions.
LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction cs.IR · 2026-04-21 · unverdicted · none · ref 3
LoopCTR trains CTR models with recursive layer reuse and process supervision so that zero-loop inference outperforms baselines on public and industrial datasets.
GenRec: A Preference-Oriented Generative Framework for Large-Scale Recommendation cs.IR · 2026-04-16 · unverdicted · none · ref 4
GenRec combines page-wise NTP, token compression, and GRPO-SR reinforcement learning to scale generative retrieval, delivering 9.5% click and 8.7% transaction gains in production A/B tests on the JD App.
S$^2$GR: Stepwise Semantic-Guided Reasoning in Latent Space for Generative Recommendation cs.IR · 2026-01-26 · unverdicted · none · ref 8
S²GR adds stepwise thinking tokens with contrastive supervision on codebook clusters to balance computational focus and ground reasoning paths in generative recommendation.
Intuition-Guided Latent Reasoning for LLM-Based Recommendation cs.IR · 2026-06-26 · unverdicted · none · ref 11
IntuRec anchors LLM latent reasoning for recommendation by deriving an intuition embedding from top-K candidates via self- and cross-attention to initialize more accurate trajectories.
UniPinRec: Unifying Generative Retrieval and Ranking at Pinterest Scale cs.IR · 2026-05-29 · unverdicted · none · ref 11
UniPinRec unifies retrieval and ranking into a single model and pipeline deployed at Pinterest, reporting +1% engagement lift, 11.1% lower latency, and 63.6% higher QPS.
From Local Indices to Global Identifiers: Generative Reranking for Recommender Systems via Global Action Space cs.IR · 2026-04-28 · unverdicted · none · ref 8
GloRank reformulates list-wise reranking as token generation over a global item identifier space, using supervised pre-training followed by reinforcement learning to maximize list-wise utility and outperforming baselines on benchmarks and industrial data.
MTServe: Efficient Serving for Generative Recommendation Models with Hierarchical Caches cs.LG · 2026-04-24 · unverdicted · none · ref 6
MTServe achieves up to 3.1x speedup for generative recommendation model serving by using hierarchical caches with host RAM and system optimizations while keeping cache hit ratios above 98.5%.
SSRLive: Live Streaming Recommendation with Dynamic Semantic ID cs.IR · 2026-06-05 · unverdicted · none · ref 6
SSRLive combines generative and discriminative modules with dynamic semantic IDs to improve live streaming recommendations, reporting gains of +3.38% watch time, +0.72% GMV, +3.12% follower growth, and +2.92% interaction volume in online A/B tests.
Token Factory: Efficiently Integrating Diverse Signals into Large Recommendation Models cs.IR · 2026-06-17 · unverdicted · none · ref 3
Token Factory transforms traditional signals into soft tokens for efficient integration and compression into Large Recommendation Models, avoiding prompt length explosion while enhancing performance.

Onepiece: Bringing context engineering and reasoning to industrial cascade ranking system.arXiv preprint arXiv:2509.18091

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer