STORM trains lexical query rewriters via reward-guided beam search that converts retrieval metrics into stepwise token signals, enabling 0.6B-8B models to rival dense retrievers on TREC, BEIR and MIRACL without index changes.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.IR 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
STORM: Stepwise Token Optimization with Reward-Guided Beam Search
STORM trains lexical query rewriters via reward-guided beam search that converts retrieval metrics into stepwise token signals, enabling 0.6B-8B models to rival dense retrievers on TREC, BEIR and MIRACL without index changes.