Attribution, Citation, and Quotation: A Survey of Evidence-based Text Generation with Large Language Models

· 2025 · cs.CL · arXiv 2508.15396

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

abstract

The increasing adoption of large language models (LLMs) has raised serious concerns about their reliability and trustworthiness. As a result, a growing body of research focuses on evidence-based text generation with LLMs, aiming to link model outputs to supporting evidence to ensure traceability and verifiability. However, the field is fragmented due to inconsistent terminology, isolated evaluation practices, and a lack of unified benchmarks. To bridge this gap, we systematically analyze 134 papers, introduce a unified taxonomy of evidence-based text generation with LLMs, and investigate 300 evaluation metrics across seven key dimensions. Thereby, we focus on approaches that use citations, attribution, or quotations for evidence-based text generation. Building on this, we examine the distinctive characteristics and representative methods in the field. Finally, we highlight open challenges and outline promising directions for future work.

representative citing papers

From Agent Traces to Trust: A Survey of Evidence Tracing and Execution Provenance in LLM Agents

cs.CR · 2026-06-03 · unverdicted · novelty 5.0 · 2 refs

This survey defines execution provenance as a typed graph of agent execution and evidence tracing as its projection onto evidence-support relations, then reviews methods, taxonomy, benchmarks, and challenges for auditable LLM agents.

Are Finer Citations Always Better? Rethinking Granularity for Attributed Generation

cs.CL · 2026-04-01 · unverdicted · novelty 5.0

Enforcing sentence-level citations degrades LLM attribution quality by 16-276% versus paragraph-level, with larger models penalized more due to disrupted semantic synthesis.

Explicit Evidence Grounding via Structured Inline Citation Generation

cs.CL · 2026-06-05 · unverdicted · novelty 4.0

FullCite introduces three strategies for structured inline citation generation in QA and finds LLMs identify relevant documents well but struggle with precise evidence spans on ASQA, BioASQ, and ExpertQA.

citing papers explorer

Showing 1 of 1 citing paper after filters.

From Agent Traces to Trust: A Survey of Evidence Tracing and Execution Provenance in LLM Agents cs.CR · 2026-06-03 · unverdicted · none · ref 15 · 2 links · internal anchor
This survey defines execution provenance as a typed graph of agent execution and evidence tracing as its projection onto evidence-support relations, then reviews methods, taxonomy, benchmarks, and challenges for auditable LLM agents.

Attribution, Citation, and Quotation: A Survey of Evidence-based Text Generation with Large Language Models

fields

years

verdicts

representative citing papers

citing papers explorer