Actions speak louder than words: Trillion-parameter sequential transducers for generative recommendations, 2024a

Zhai, J · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

FreeScale: Distributed Training for Sequence Recommendation Models with Minimal Scaling Cost

cs.LG · 2026-04-27 · unverdicted · novelty 4.0

FreeScale reduces computational bubbles by up to 90.3% in distributed training of sequence recommendation models on 256 H100 GPUs via load balancing, prioritized embedding overlap, and SM-Free communication.

citing papers explorer

Showing 1 of 1 citing paper.

FreeScale: Distributed Training for Sequence Recommendation Models with Minimal Scaling Cost cs.LG · 2026-04-27 · unverdicted · none · ref 4
FreeScale reduces computational bubbles by up to 90.3% in distributed training of sequence recommendation models on 256 H100 GPUs via load balancing, prioritized embedding overlap, and SM-Free communication.

Actions speak louder than words: Trillion-parameter sequential transducers for generative recommendations, 2024a

fields

years

verdicts

representative citing papers

citing papers explorer