Seq2Slate: Re-ranking and Slate Optimization with RNNs

Irwan Bello , Sayali Kulkarni , Sagar Jain , Craig Boutilier , Ed Chi , Elad Eban , Xiyang Luo , Alan Mackey

show 1 more author

Ofer Meshi

Authors on Pith no claims yet

classification 💻 cs.IR cs.LGstat.ML

keywords itemsmodelslaterankingitemseq2slatetaskaccount

0 comments

read the original abstract

Ranking is a central task in machine learning and information retrieval. In this task, it is especially important to present the user with a slate of items that is appealing as a whole. This in turn requires taking into account interactions between items, since intuitively, placing an item on the slate affects the decision of which other items should be placed alongside it. In this work, we propose a sequence-to-sequence model for ranking called seq2slate. At each step, the model predicts the next `best' item to place on the slate given the items already selected. The sequential nature of the model allows complex dependencies between the items to be captured directly in a flexible and scalable way. We show how to learn the model end-to-end from weak supervision in the form of easily obtained click-through data. We further demonstrate the usefulness of our approach in experiments on standard ranking benchmarks as well as in a real-world recommendation system.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

UniRank: Unified List-wise Reranking via Confidence-Ordered Denoising
cs.IR 2026-05 unverdicted novelty 7.0

UniRank unifies autoregressive and non-autoregressive list-wise reranking via bidirectional modeling in a confidence-ordered iterative denoising process, outperforming baselines on datasets and online tests.
Next-Scale Generative Reranking: A Tree-based Generative Rerank Method at Meituan
cs.IR 2026-04 unverdicted novelty 7.0

NSGR is a tree-structured generative reranker that progressively generates optimal lists via next-scale expansion and multi-scale neighbor loss to balance perspectives and align training signals.
From Local Indices to Global Identifiers: Generative Reranking for Recommender Systems via Global Action Space
cs.IR 2026-04 unverdicted novelty 6.0

GloRank reformulates list-wise reranking as token generation over a global item identifier space, using supervised pre-training followed by reinforcement learning to maximize list-wise utility and outperforming baseli...
Dual-Rerank: Fusing Causality and Utility for Industrial Generative Reranking
cs.IR 2026-04 unverdicted novelty 4.0

Dual-Rerank fuses autoregressive and non-autoregressive generative reranking via knowledge distillation and uses list-wise decoupled RL optimization to improve whole-page utility and cut latency in industrial video search.