Reward-rag: Enhancing rag with reward driven supervision

Thang Nguyen, Peter Chin, Yu-Wing Tai · 2024 · arXiv 2410.03780

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Supervising the search process produces reliable and generalizable information-seeking agents

cs.CL · 2025-02-19 · unverdicted · novelty 6.0

Process supervision via RAG-Gym produces more reliable and generalizable search agents, with gains driven by higher-quality queries on out-of-domain multi-hop tasks.

Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation

cs.CL · 2025-05-27 · unverdicted · novelty 5.0

RioRAG uses nugget-centric verification with cross-source checks to create dense verifiable rewards for RL-based optimization of long-form RAG, yielding higher factual recall and faithfulness on LongFact and RAGChecker.

citing papers explorer

Showing 2 of 2 citing papers.

Supervising the search process produces reliable and generalizable information-seeking agents cs.CL · 2025-02-19 · unverdicted · none · ref 51
Process supervision via RAG-Gym produces more reliable and generalizable search agents, with gains driven by higher-quality queries on out-of-domain multi-hop tasks.
Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation cs.CL · 2025-05-27 · unverdicted · none · ref 20
RioRAG uses nugget-centric verification with cross-source checks to create dense verifiable rewards for RL-based optimization of long-form RAG, yielding higher factual recall and faithfulness on LongFact and RAGChecker.

Reward-rag: Enhancing rag with reward driven supervision

fields

years

verdicts

representative citing papers

citing papers explorer