hub Canonical reference

A Survey on RAG Meeting LLMs: Towards Retrieval- Augmented Large Language Models

Yashar Deldjoo, Zhankui He, Julian McAuley, Anton Korikov, Scott Sanner, Arnau Ramisa, René Vidal, Maheswaran Sathiamoorthy, Atoosa Kasirzadeh, Silvia Milano · 2024 · arXiv 7528.367147

Canonical reference. 86% of citing Pith papers cite this work as background.

18 Pith papers citing it

Background 86% of classified citations

read on arXiv browse 18 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 6 method 1

citation-polarity summary

background 6 use method 1

representative citing papers

From Schema to Signal: Retrieval-Augmented Modeling for Relational Data Analytics

cs.DB · 2026-05-14 · unverdicted · novelty 7.0

RAM augments relational graph models with attribute-semantic retrieval via random-walk documents and two contrastive augmentations (ATRA, ETRA) to achieve state-of-the-art results on five real-world databases.

VulKey: Automated Vulnerability Repair Guided by Domain-Specific Repair Patterns

cs.CR · 2026-05-03 · unverdicted · novelty 7.0

VulKey reaches 31.5% repair accuracy on real C/C++ vulnerabilities by matching hierarchical expert patterns to guide LLM patch generation, beating prior baselines by 7.6%.

Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation

cs.AI · 2026-03-03 · unverdicted · novelty 7.0

PURE reduces preference-inconsistent explanations in LLM recommenders by selecting user-aligned evidence paths and injecting them into generation, while preserving accuracy.

Model Context Protocol (MCP): Landscape, Security Threats, and Future Research Directions

cs.CR · 2025-03-30 · unverdicted · novelty 7.0

MCP lifecycle is defined with four phases and 16 activities; a threat taxonomy of 16 scenarios is constructed, validated via case studies, and paired with phase-specific safeguards.

PipeANN-Filter: An Efficient Filtered Vector Search System on SSD

cs.OS · 2026-05-18 · unverdicted · novelty 6.0

PipeANN-Filter improves filtered vector search latency and throughput on SSD by exploring a superset of valid vectors identified via probabilistic filters and verifying attributes only after selecting top-k candidates.

FT-RAG: A Fine-grained Retrieval-Augmented Generation Framework for Complex Table Reasoning

cs.CL · 2026-05-02 · unverdicted · novelty 6.0

FT-RAG introduces a fine-grained graph-based retrieval framework for tables plus a new 9870-pair benchmark, reporting 23.5% and 59.2% gains in table- and cell-level hit rates and 62.2% higher exact-value recall over baselines.

Efficient Rationale-based Retrieval: On-policy Distillation from Generative Rerankers based on JEPA

cs.IR · 2026-04-25 · unverdicted · novelty 6.0

Rabtriever distills a generative reranker into an efficient bi-encoder using on-policy JEPA to achieve near-reranker accuracy with linear complexity on rationale-based retrieval.

Adaptive Defense Orchestration for RAG: A Sentinel-Strategist Architecture against Multi-Vector Attacks

cs.CR · 2026-04-22 · unverdicted · novelty 6.0

A context-aware Sentinel-Strategist system for RAG selectively applies defenses to block membership inference and data poisoning while recovering most retrieval utility compared to always-on defense stacks.

A Case Study on the Impact of Anonymization Along the RAG Pipeline

cs.CR · 2026-04-17 · unverdicted · novelty 6.0

Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.

ClusterRAG: Cluster-Based Collaborative Filtering for Personalized Retrieval-Augmented Generation

cs.IR · 2026-04-14 · unverdicted · novelty 6.0

ClusterRAG applies density-based clustering to user profiles for collaborative retrieval in personalized RAG and reports best performance on LaMP tasks by combining target and similar-user profiles.

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

cs.AI · 2025-09-02 · accept · novelty 6.0

Survey that defines agentic RL for LLMs via POMDPs, introduces a taxonomy of planning/tool-use/memory/reasoning capabilities and domains, and compiles open environments from over 500 papers.

TreeRanker: Fast and Model-agnostic Ranking System for Code Suggestions in IDEs

cs.SE · 2025-08-04 · unverdicted · novelty 6.0

TreeRanker ranks static code completions by organizing candidates in a prefix tree and collecting token scores via a single greedy language-model decoding pass.

Training LLMs with Reinforcement Learning for Intent-Aware Personalized Question Answering

cs.CL · 2026-05-12 · unverdicted · novelty 5.0

IAP uses RL to train LLMs to explicitly infer and apply implicit user intent in single-turn personalized QA, achieving ~7.5% average macro-score gains over baselines on LaMP-QA.

All Languages Matter: Understanding and Mitigating Language Bias in Multilingual RAG

cs.CL · 2026-04-22 · unverdicted · novelty 5.0

Multilingual RAG rerankers exhibit language bias that limits cross-lingual evidence use, and the proposed LAURA method aligns ranking with downstream generation utility to reduce the bias and improve performance.

Align Documents to Questions: Question-Oriented Document Rewriting for Retrieval-Augmented Generation

cs.CL · 2026-04-19 · unverdicted · novelty 5.0

QREAM rewrites documents to question-focused style using iterative ICL and distilled FT models, boosting RAG performance by up to 8% relative improvement.

GroupRank: A Groupwise Paradigm for Effective and Efficient Passage Reranking with LLMs

cs.IR · 2025-11-10 · unverdicted · novelty 5.0

GroupRank uses groupwise LLM reranking with answer-free data synthesis and a group-ranking reward to reach 65.2 NDCG@10 on BRIGHT while providing 6.4x faster inference than listwise baselines.

Rethinking Agentic Reinforcement Learning In Large Language Models

cs.AI · 2026-04-30 · unverdicted · novelty 3.0

The paper reviews conceptual foundations, methodological innovations, effective designs, critical challenges, and future directions for LLM-based Agentic Reinforcement Learning.

Controlled Personalization in Legacy Media Online Services: A Case Study in News Recommendation

cs.IR · 2025-10-10 · unverdicted · novelty 3.0

Controlled personalization combining editorial curation with modest algorithmic recommendations in legacy news increases engagement, diversity, and reduces popularity bias per an A/B test.

citing papers explorer

Showing 18 of 18 citing papers.

From Schema to Signal: Retrieval-Augmented Modeling for Relational Data Analytics cs.DB · 2026-05-14 · unverdicted · none · ref 14
RAM augments relational graph models with attribute-semantic retrieval via random-walk documents and two contrastive augmentations (ATRA, ETRA) to achieve state-of-the-art results on five real-world databases.
VulKey: Automated Vulnerability Repair Guided by Domain-Specific Repair Patterns cs.CR · 2026-05-03 · unverdicted · none · ref 23
VulKey reaches 31.5% repair accuracy on real C/C++ vulnerabilities by matching hierarchical expert patterns to guide LLM patch generation, beating prior baselines by 7.6%.
Beyond Factual Correctness: Mitigating Preference-Inconsistent Explanations in Explainable Recommendation cs.AI · 2026-03-03 · unverdicted · none · ref 6
PURE reduces preference-inconsistent explanations in LLM recommenders by selecting user-aligned evidence paths and injecting them into generation, while preserving accuracy.
Model Context Protocol (MCP): Landscape, Security Threats, and Future Research Directions cs.CR · 2025-03-30 · unverdicted · none · ref 24
MCP lifecycle is defined with four phases and 16 activities; a threat taxonomy of 16 scenarios is constructed, validated via case studies, and paired with phase-specific safeguards.
PipeANN-Filter: An Efficient Filtered Vector Search System on SSD cs.OS · 2026-05-18 · unverdicted · none · ref 13
PipeANN-Filter improves filtered vector search latency and throughput on SSD by exploring a superset of valid vectors identified via probabilistic filters and verifying attributes only after selecting top-k candidates.
FT-RAG: A Fine-grained Retrieval-Augmented Generation Framework for Complex Table Reasoning cs.CL · 2026-05-02 · unverdicted · none · ref 3
FT-RAG introduces a fine-grained graph-based retrieval framework for tables plus a new 9870-pair benchmark, reporting 23.5% and 59.2% gains in table- and cell-level hit rates and 62.2% higher exact-value recall over baselines.
Efficient Rationale-based Retrieval: On-policy Distillation from Generative Rerankers based on JEPA cs.IR · 2026-04-25 · unverdicted · none · ref 3
Rabtriever distills a generative reranker into an efficient bi-encoder using on-policy JEPA to achieve near-reranker accuracy with linear complexity on rationale-based retrieval.
Adaptive Defense Orchestration for RAG: A Sentinel-Strategist Architecture against Multi-Vector Attacks cs.CR · 2026-04-22 · unverdicted · none · ref 3
A context-aware Sentinel-Strategist system for RAG selectively applies defenses to block membership inference and data poisoning while recovering most retrieval utility compared to always-on defense stacks.
A Case Study on the Impact of Anonymization Along the RAG Pipeline cs.CR · 2026-04-17 · unverdicted · none · ref 6
Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.
ClusterRAG: Cluster-Based Collaborative Filtering for Personalized Retrieval-Augmented Generation cs.IR · 2026-04-14 · unverdicted · none · ref 66
ClusterRAG applies density-based clustering to user profiles for collaborative retrieval in personalized RAG and reports best performance on LaMP tasks by combining target and similar-user profiles.
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey cs.AI · 2025-09-02 · accept · none · ref 272
Survey that defines agentic RL for LLMs via POMDPs, introduces a taxonomy of planning/tool-use/memory/reasoning capabilities and domains, and compiles open environments from over 500 papers.
TreeRanker: Fast and Model-agnostic Ranking System for Code Suggestions in IDEs cs.SE · 2025-08-04 · unverdicted · none · ref 12
TreeRanker ranks static code completions by organizing candidates in a prefix tree and collecting token scores via a single greedy language-model decoding pass.
Training LLMs with Reinforcement Learning for Intent-Aware Personalized Question Answering cs.CL · 2026-05-12 · unverdicted · none · ref 17
IAP uses RL to train LLMs to explicitly infer and apply implicit user intent in single-turn personalized QA, achieving ~7.5% average macro-score gains over baselines on LaMP-QA.
All Languages Matter: Understanding and Mitigating Language Bias in Multilingual RAG cs.CL · 2026-04-22 · unverdicted · none · ref 44
Multilingual RAG rerankers exhibit language bias that limits cross-lingual evidence use, and the proposed LAURA method aligns ranking with downstream generation utility to reduce the bias and improve performance.
Align Documents to Questions: Question-Oriented Document Rewriting for Retrieval-Augmented Generation cs.CL · 2026-04-19 · unverdicted · none · ref 127
QREAM rewrites documents to question-focused style using iterative ICL and distilled FT models, boosting RAG performance by up to 8% relative improvement.
GroupRank: A Groupwise Paradigm for Effective and Efficient Passage Reranking with LLMs cs.IR · 2025-11-10 · unverdicted · none · ref 17
GroupRank uses groupwise LLM reranking with answer-free data synthesis and a group-ranking reward to reach 65.2 NDCG@10 on BRIGHT while providing 6.4x faster inference than listwise baselines.
Rethinking Agentic Reinforcement Learning In Large Language Models cs.AI · 2026-04-30 · unverdicted · none · ref 18
The paper reviews conceptual foundations, methodological innovations, effective designs, critical challenges, and future directions for LLM-based Agentic Reinforcement Learning.
Controlled Personalization in Legacy Media Online Services: A Case Study in News Recommendation cs.IR · 2025-10-10 · unverdicted · none · ref 9
Controlled personalization combining editorial curation with modest algorithmic recommendations in legacy news increases engagement, diversity, and reduces popularity bias per an A/B test.

A Survey on RAG Meeting LLMs: Towards Retrieval- Augmented Large Language Models

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer