hub Canonical reference

Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG

Aditi Singh, Abul Ehtesham, Saket Kumar, Tala Talaei Khoei, Athanasios V. Vasilakos · 2025 · cs.AI · arXiv 2501.09136

Canonical reference. 91% of citing Pith papers cite this work as background.

67 Pith papers citing it

Background 91% of classified citations

open full Pith review browse 67 citing papers arXiv PDF

abstract

Large Language Models (LLMs) have advanced artificial intelligence by enabling human-like text generation and natural language understanding. However, their reliance on static training data limits their ability to respond to dynamic, real-time queries, resulting in outdated or inaccurate outputs. Retrieval-Augmented Generation (RAG) has emerged as a solution, enhancing LLMs by integrating real-time data retrieval to provide contextually relevant and up-to-date responses. Despite its promise, traditional RAG systems are constrained by static workflows and lack the adaptability required for multi-step reasoning and complex task management. Agentic Retrieval-Augmented Generation (Agentic RAG) transcends these limitations by embedding autonomous AI agents into the RAG pipeline. These agents leverage agentic design patterns reflection, planning, tool use, and multi-agent collaboration to dynamically manage retrieval strategies, iteratively refine contextual understanding, and adapt workflows through operational structures ranging from sequential steps to adaptive collaboration. This integration enables Agentic RAG systems to deliver flexibility, scalability, and context-awareness across diverse applications. This paper presents an analytical survey of Agentic RAG systems. It traces the evolution of RAG paradigms, introduces a principled taxonomy of Agentic RAG architectures based on agent cardinality, control structure, autonomy, and knowledge representation, and provides a comparative analysis of design trade-offs across existing frameworks. The survey examines applications in healthcare, finance, education, and enterprise document processing, and distills practical lessons for system designers and practitioners. Finally, it identifies key open research challenges related to evaluation, coordination, memory management, efficiency, and governance, outlining directions for future research.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 11

citation-polarity summary

background 10 unclear 1

representative citing papers

ChartWalker: Benchmarking the Cross-Chart RAG Task with Hierarchical Knowledge Graphs

cs.IR · 2026-06-22 · unverdicted · novelty 7.0

ChartWalker provides a hierarchical knowledge graph construction method and structure-aware sampling to generate cross-chart RAG benchmarks, releasing ChartWalker-Bench that exposes performance gaps across RAG paradigms.

Chatbots Output Meaningful (but Problematic) Language

cs.CL · 2026-06-02 · unverdicted · novelty 7.0

LLM outputs are meaningful according to standard theories of human language, without requiring anthropomorphic assumptions about the models.

CuSearch: Curriculum Rollout Sampling via Search Depth for Agentic RAG

cs.AI · 2026-05-12 · unverdicted · novelty 7.0 · 2 refs

CuSearch reallocates rollout budget in RLVR toward deeper-search trajectories as a proxy for retrieval supervision density, yielding up to 11.8 exact-match gains over uniform GRPO sampling on ZeroSearch.

LatentRAG: Latent Reasoning and Retrieval for Efficient Agentic RAG

cs.CL · 2026-05-07 · unverdicted · novelty 7.0

LatentRAG performs agentic RAG by generating latent tokens for thoughts and subqueries in one forward pass, matching explicit methods' accuracy on seven benchmarks while reducing latency by ~90%.

SCOUT: Active Information Foraging for Long-Text Understanding with Decoupled Epistemic States

cs.CL · 2026-05-06 · unverdicted · novelty 7.0

SCOUT achieves state-of-the-art long-text understanding with up to 8x lower token use by actively foraging for sparse query-relevant information and updating a compact provenance-grounded epistemic state.

RAG-Reflect: Agentic Retrieval-Augmented Generation with Reflections for Comment-Driven Code Maintenance on Stack Overflow

cs.SE · 2026-04-24 · unverdicted · novelty 7.0

RAG-Reflect achieves F1=0.78 on valid comment-edit prediction using retrieval-augmented reasoning and self-reflection, outperforming baselines and approaching fine-tuned models without retraining.

A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding

cs.AI · 2026-04-21 · unverdicted · novelty 7.0

A-MAR decomposes art queries into reasoning plans to condition retrieval, leading to improved explanation quality and multi-step reasoning on art benchmarks compared to baselines.

Don't Retrieve, Navigate: Distilling Enterprise Knowledge into Navigable Agent Skills for QA and RAG

cs.IR · 2026-04-16 · unverdicted · novelty 7.0 · 2 refs

Corpus2Skill converts document corpora into navigable hierarchical skill directories for LLM agents, improving QA and RAG quality on single-domain enterprise data but not on open-domain or tabular corpora.

E2E-REME: Towards End-to-End Microservices Auto-Remediation via Experience-Simulation Reinforcement Fine-Tuning

cs.SE · 2026-04-13 · unverdicted · novelty 7.0

E2E-REME outperforms nine LLMs in accuracy and efficiency for end-to-end microservice remediation by using experience-simulation reinforcement fine-tuning on a new benchmark called MicroRemed.

DOTRAG: Retrieval-Time Reasoning Along Paths

cs.IR · 2026-04-06 · unverdicted · novelty 7.0

DotRAG reformulates graph retrieval as query-guided path reasoning with Division of Thought, reporting SOTA results on MetaQA and UltraDomain for multi-hop tasks.

Q-RAG: Long Context Multi-step Retrieval via Value-based Embedder Training

cs.LG · 2025-11-10 · unverdicted · novelty 7.0

Q-RAG trains embedders via RL for multi-step retrieval and reports state-of-the-art results on BabiLong and RULER benchmarks for contexts up to 10M tokens.

HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation

cs.CL · 2025-10-09 · unverdicted · novelty 7.0

HiPRAG adds hierarchical process rewards to RL training for agentic RAG, reducing over-search to 2.3% and achieving 65.4-67.2% accuracy on seven QA benchmarks across 3B and 7B models.

KidnapRAG: A Black-Box Attack for Hijacking Reasoning in Agentic Retrieval-Augmented Generation Systems

cs.CR · 2026-07-01 · unverdicted · novelty 6.0

KidnapRAG is a sequential black-box poisoning attack on Agentic RAG systems using Bait, Chain-Link, and Mal-Ins documents to redirect retrieval and reasoning, outperforming prior baselines.

MedEvoEval: Evaluating Continual Evolution of Doctor Agents through Simulated Clinical Episodes

cs.AI · 2026-06-27 · unverdicted · novelty 6.0

MedEvoEval is an executable longitudinal evaluation framework that converts medical cases into action-gated simulated episodes to track how doctor agents evolve decision-making, resource use, and experience across multiple encounters.

Is GraphRAG Needed? From Basic RAG to Graph-/Agentic Solutions with Context Optimization

cs.CL · 2026-06-24 · unverdicted · novelty 6.0

Evaluates 9 RAG scenarios across variants, proposes context engineering reducing token usage 19-53%, and identifies a retrieval-generation gap where more retrieval does not improve generation proportionally.

To Isolate or to Score? Model-Adaptive Assessment for Cost-Efficient Multi-Agent RAG

cs.AI · 2026-06-23 · unverdicted · novelty 6.0

Empirical study finds isolation drives gains for weak models in multi-agent RAG while scoring matters for strong ones, enabling MADARA for cost-efficient adaptive assessment.

Only Ask What You Don't Know: Grounded Delta Planning for Efficient Multi-step RAG

cs.CL · 2026-06-21 · unverdicted · novelty 6.0 · 2 refs

GDP-RAG targets only information deltas in multi-hop RAG through preliminary grounding, gap-conditioned prompts, and skeletal trajectories, reaching 60.63% accuracy at 0.51 cost-of-pass on HotpotQA, 2WikiMultiHopQA, and MuSiQue.

SHACR: A Graph-Augmented Semi-Autonomous Framework for Multi-Class Conflict Resolution in Smart Home IoT Automation

cs.NI · 2026-06-21 · unverdicted · novelty 6.0

SHACR is a graph-augmented framework that grounds LLMs in a formal knowledge graph to unify logical, semantic, and physical conflict detection in IoT automation, raising F1 from 0.59 to 0.95 on a 203-rule testbed.

EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

cs.CL · 2026-06-19 · unverdicted · novelty 6.0

EvoEmbedding generates evolvable embeddings via a latent memory updated during sequential processing, outperforming larger models on long-context retrieval and generalizing to 10x longer contexts in downstream tasks.

SHIFT: Semantic Harmonization via Index-side Feature Transformation for Multilingual Information Retrieval

cs.IR · 2026-06-17 · unverdicted · novelty 6.0

SHIFT mitigates language bias in MLIR by subtracting estimated relative language vectors from document embeddings during indexing using parallel translation pairs.

Rethinking RAG in Long Videos: What to Retrieve and How to Use It?

cs.AI · 2026-06-11 · unverdicted · novelty 6.0

Introduces V-RAGBench benchmark and CARVE method that selects per-chunk retrieval configurations via parallel retrievers and adaptive reranking, outperforming eight VideoRAG baselines.

Agentic Hybrid RAG for Evidence-Grounded Muon Collider Analysis

hep-ex · 2026-06-09 · unverdicted · novelty 6.0

Agentic hybrid RAG with a new muon collider benchmark outperforms baselines in retrieval effectiveness, answer quality, evidence coverage, and factual grounding.

MARDoc: A Memory-Aware Refinement Agent Framework for Multimodal Long Document QA

cs.CL · 2026-06-04 · unverdicted · novelty 6.0

MARDoc introduces a three-agent framework (Explorer, Refiner, Reflector) with dynamically updated structured memory to improve multi-hop reasoning in multimodal long-document QA, outperforming baselines on MMLongBench-Doc and DocBench.

HMARS: A Hierarchical Multi-Agent Memory System for Long-Context Reasoning

cs.IR · 2026-06-03 · unverdicted · novelty 6.0

HMARS introduces a hierarchical multi-agent memory system that outperforms standard retrieval and other baselines on long-document and multi-turn reasoning tasks through improved evidence coverage.

citing papers explorer

Showing 1 of 1 citing paper after filters.

SHACR: A Graph-Augmented Semi-Autonomous Framework for Multi-Class Conflict Resolution in Smart Home IoT Automation cs.NI · 2026-06-21 · unverdicted · none · ref 40 · internal anchor
SHACR is a graph-augmented framework that grounds LLMs in a formal knowledge graph to unify logical, semantic, and physical conflict detection in IoT automation, raising F1 from 0.59 to 0.95 on a 203-rule testbed.

Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer