Title resolution pending

Songpei Xu, Shijia Wang, Da Guo, Xianwen Guo, Qiang Xiao, Bin Huang, Guanlin Wu, Chuanjiang Luo · 2025

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Compute Only Once: UG-Separation for Efficient Large Recommendation Models

cs.IR · 2026-02-11 · unverdicted · novelty 7.0

UG-Separation framework disentangles user-side and item-side flows in TokenMixer dense-interaction models to enable reusable user computations, cutting inference latency up to 20% in ByteDance production scenarios.

MTServe: Efficient Serving for Generative Recommendation Models with Hierarchical Caches

cs.LG · 2026-04-24 · unverdicted · novelty 6.0

MTServe achieves up to 3.1x speedup for generative recommendation model serving by using hierarchical caches with host RAM and system optimizations while keeping cache hit ratios above 98.5%.

citing papers explorer

Showing 2 of 2 citing papers.

Compute Only Once: UG-Separation for Efficient Large Recommendation Models cs.IR · 2026-02-11 · unverdicted · none · ref 27
UG-Separation framework disentangles user-side and item-side flows in TokenMixer dense-interaction models to enable reusable user computations, cutting inference latency up to 20% in ByteDance production scenarios.
MTServe: Efficient Serving for Generative Recommendation Models with Hierarchical Caches cs.LG · 2026-04-24 · unverdicted · none · ref 37
MTServe achieves up to 3.1x speedup for generative recommendation model serving by using hierarchical caches with host RAM and system optimizations while keeping cache hit ratios above 98.5%.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer