Seq2slate: Re-ranking and slate optimization with rnns

Irwan Bello, Sayali Kulkarni, Sagar Jain, Craig Boutilier, Ed Chi, Elad Eban, Xiyang Luo, Alan Mackey, Ofer Meshi · 2018 · cs.IR · arXiv 1810.02019

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open full Pith review browse 6 citing papers arXiv PDF

abstract

Ranking is a central task in machine learning and information retrieval. In this task, it is especially important to present the user with a slate of items that is appealing as a whole. This in turn requires taking into account interactions between items, since intuitively, placing an item on the slate affects the decision of which other items should be placed alongside it. In this work, we propose a sequence-to-sequence model for ranking called seq2slate. At each step, the model predicts the next `best' item to place on the slate given the items already selected. The sequential nature of the model allows complex dependencies between the items to be captured directly in a flexible and scalable way. We show how to learn the model end-to-end from weak supervision in the form of easily obtained click-through data. We further demonstrate the usefulness of our approach in experiments on standard ranking benchmarks as well as in a real-world recommendation system.

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

UniRank: Unified List-wise Reranking via Confidence-Ordered Denoising

cs.IR · 2026-05-11 · unverdicted · novelty 7.0

UniRank unifies autoregressive and non-autoregressive list-wise reranking via bidirectional modeling in a confidence-ordered iterative denoising process, outperforming baselines on datasets and online tests.

Next-Scale Generative Reranking: A Tree-based Generative Rerank Method at Meituan

cs.IR · 2026-04-07 · unverdicted · novelty 7.0

NSGR is a tree-structured generative reranker that progressively generates optimal lists via next-scale expansion and multi-scale neighbor loss to balance perspectives and align training signals.

SCASRec: A Self-Correcting and Auto-Stopping Model for Generative Route List Recommendation

cs.IR · 2026-02-03 · unverdicted · novelty 6.0

SCASRec unifies ranking and redundancy elimination for route lists via stepwise corrective rewards and an adaptive end-of-recommendation token, claiming SOTA results on two datasets and real deployment.

From Local Indices to Global Identifiers: Generative Reranking for Recommender Systems via Global Action Space

cs.IR · 2026-04-28 · unverdicted · novelty 6.0

GloRank reformulates list-wise reranking as token generation over a global item identifier space, using supervised pre-training followed by reinforcement learning to maximize list-wise utility and outperforming baselines on benchmarks and industrial data.

Rich-Media Re-Ranker: A User Satisfaction-Driven LLM Re-ranking Framework for Rich-Media Search

cs.IR · 2026-02-05 · unverdicted · novelty 5.0

A re-ranking system for rich-media search that plans query intents from sessions, adds visual signals from VLMs, and uses an LLM to score results on multiple facets before multi-task RL adaptation, with reported gains in engagement after industrial deployment.

Dual-Rerank: Fusing Causality and Utility for Industrial Generative Reranking

cs.IR · 2026-04-08 · unverdicted · novelty 4.0

Dual-Rerank fuses autoregressive and non-autoregressive generative reranking via knowledge distillation and uses list-wise decoupled RL optimization to improve whole-page utility and cut latency in industrial video search.

citing papers explorer

Showing 6 of 6 citing papers.

UniRank: Unified List-wise Reranking via Confidence-Ordered Denoising cs.IR · 2026-05-11 · unverdicted · none · ref 6
UniRank unifies autoregressive and non-autoregressive list-wise reranking via bidirectional modeling in a confidence-ordered iterative denoising process, outperforming baselines on datasets and online tests.
Next-Scale Generative Reranking: A Tree-based Generative Rerank Method at Meituan cs.IR · 2026-04-07 · unverdicted · none · ref 4
NSGR is a tree-structured generative reranker that progressively generates optimal lists via next-scale expansion and multi-scale neighbor loss to balance perspectives and align training signals.
SCASRec: A Self-Correcting and Auto-Stopping Model for Generative Route List Recommendation cs.IR · 2026-02-03 · unverdicted · none · ref 4 · internal anchor
SCASRec unifies ranking and redundancy elimination for route lists via stepwise corrective rewards and an adaptive end-of-recommendation token, claiming SOTA results on two datasets and real deployment.
From Local Indices to Global Identifiers: Generative Reranking for Recommender Systems via Global Action Space cs.IR · 2026-04-28 · unverdicted · none · ref 3
GloRank reformulates list-wise reranking as token generation over a global item identifier space, using supervised pre-training followed by reinforcement learning to maximize list-wise utility and outperforming baselines on benchmarks and industrial data.
Rich-Media Re-Ranker: A User Satisfaction-Driven LLM Re-ranking Framework for Rich-Media Search cs.IR · 2026-02-05 · unverdicted · none · ref 4 · internal anchor
A re-ranking system for rich-media search that plans query intents from sessions, adds visual signals from VLMs, and uses an LLM to score results on multiple facets before multi-task RL adaptation, with reported gains in engagement after industrial deployment.
Dual-Rerank: Fusing Causality and Utility for Industrial Generative Reranking cs.IR · 2026-04-08 · unverdicted · none · ref 2
Dual-Rerank fuses autoregressive and non-autoregressive generative reranking via knowledge distillation and uses list-wise decoupled RL optimization to improve whole-page utility and cut latency in industrial video search.

Seq2slate: Re-ranking and slate optimization with rnns

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer