The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models

Jimmy Lin; Rodrigo Nogueira; Ronak Pradeep

arxiv: 2101.05667 · v1 · pith:SKSVQLJZnew · submitted 2021-01-14 · 💻 cs.IR · cs.CL

The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models

Ronak Pradeep , Rodrigo Nogueira , Jimmy Lin This is my paper

classification 💻 cs.IR cs.CL

keywords designrankingpatterntasksdocumentexpando-mono-duokeywordmodel

0 comments

read the original abstract

We propose a design pattern for tackling text ranking problems, dubbed "Expando-Mono-Duo", that has been empirically validated for a number of ad hoc retrieval tasks in different domains. At the core, our design relies on pretrained sequence-to-sequence models within a standard multi-stage ranking architecture. "Expando" refers to the use of document expansion techniques to enrich keyword representations of texts prior to inverted indexing. "Mono" and "Duo" refer to components in a reranking pipeline based on a pointwise model and a pairwise model that rerank initial candidates retrieved using keyword search. We present experimental results from the MS MARCO passage and document ranking tasks, the TREC 2020 Deep Learning Track, and the TREC-COVID challenge that validate our design. In all these tasks, we achieve effectiveness that is at or near the state of the art, in some cases using a zero-shot approach that does not exploit any training data from the target task. To support replicability, implementations of our design pattern are open-sourced in the Pyserini IR toolkit and PyGaggle neural reranking library.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 10 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Sensitivity-Aware Test Collection for Search Among Personal Information
cs.IR 2026-06 accept novelty 7.0

A new sensitivity-labeled test collection is released from Enron emails with crowdsourced queries, relevance judgments, and LLM extensions for evaluating sensitivity-aware search.
Whole-Pool Setwise Reranking with Long-Context Language Models
cs.IR 2026-06 unverdicted novelty 7.0

DualEnd enables whole-pool setwise reranking of 100 candidates using 50 serial LLM calls by simultaneously selecting top and bottom passages with long-context models.
F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking
cs.LG 2026-05 unverdicted novelty 7.0

F-GRPO factorizes group-relative policy optimization into generation and ranking phases within one autoregressive sequence, using order-invariant coverage and position-aware utility rewards to improve top-ranked perfo...
Led to Mislead: Adversarial Content Injection for Attacks on Neural Ranking Models
cs.IR 2026-05 unverdicted novelty 7.0

CRAFT is a supervised LLM framework using retrieval-augmented generation, self-refinement, fine-tuning, and preference optimization to create fluent adversarial content that boosts target ranks in neural ranking model...
BracketRank: Large Language Model Document Ranking via Reasoning-based Competitive Elimination
cs.IR 2026-04 conditional novelty 7.0

BracketRank reranks documents via LLM-driven bracket-style competitive elimination with mandatory reasoning explanations, reaching 26.56 nDCG@10 on BRIGHT and outperforming RankGPT-4 and Rank-R1-14B.
MIRA: An LLM-Assisted Benchmark for Multi-Category Integrated Retrieval
cs.IR 2026-05 unverdicted novelty 6.0

MIRA is a new benchmark for multi-category integrated retrieval built from real queries on a social science platform, with LLM assistance for topic descriptions and relevance labeling across four item categories.
RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze!
cs.IR 2023-12 conditional novelty 6.0

RankZephyr is a new open-source LLM that closes the effectiveness gap with GPT-4 for zero-shot listwise reranking while showing robustness to input ordering and document count.
Trie-based Experiment Plans for Efficient IR Pipeline Experiments
cs.IR 2026-07 unverdicted novelty 5.0

Trie-based experiment plans reduce the duration of comparative evaluations of IR pipelines by 26% versus linear plans in a BM25-MonoT5-DuoT5 demonstration on MSMARCO v2.
Mask-to-Correct$^+$: Leveraging Retriever Diversity for Masking-guided Faithful Fact Correction
cs.IR 2026-04 unverdicted novelty 5.0

Mask-to-Correct and M2C+ use diversity-aware masking in RAG to identify erroneous claim spans and produce faithful corrections, outperforming baselines by up to 14% SARI without gold evidence.
Dynamic Ranked List Truncation for Reranking Pipelines via LLM-generated Reference-Documents
cs.IR 2026-04 unverdicted novelty 5.0

LLM-generated reference documents enable dynamic ranked list truncation and adaptive batching for listwise reranking, outperforming prior RLT methods and accelerating processing by up to 66% on TREC benchmarks.