Generative Large Language Models in Automated Fact-Checking: A Survey

Ivan Vykopal; Mari\'an \v{S}imko; Mat\'u\v{s} Pikuliak; Simon Ostermann

arxiv: 2407.02351 · v3 · pith:UAAKD236new · submitted 2024-07-02 · 💻 cs.CL

Generative Large Language Models in Automated Fact-Checking: A Survey

Ivan Vykopal , Mat\'u\v{s} Pikuliak , Simon Ostermann , Mari\'an \v{S}imko This is my paper

classification 💻 cs.CL

keywords fact-checkinggenerativellmsautomatedresearchsurveycurrentlanguage

0 comments

read the original abstract

The rapid spread of false and misleading information on online platforms poses a growing societal challenge, overwhelming the capacity of manual fact-checking and increasing the demand for scalable, reliable automation. Recent advances in generative large language models (LLMs) have broadened the scope of automated fact-checking beyond accuracy-driven prediction. LLMs are now integral components of fact-checking pipelines, supporting tasks such as generating new data, performing and assisting with fact verification, and shaping how fact-checking systems are evaluated. This survey provides a comprehensive overview of the role of generative LLMs in automated fact-checking, based on a systematic review of 199 research papers. We introduce a unifying taxonomy that captures how generative LLMs are integrated into fact-checking workflows and analyze their use across core fact-checking tasks, dataset construction and augmentation strategies, task formulations, and evaluation practices. Additionally, we investigate the impact of generative LLMs in multilingual and low-resource settings in fact-checking, highlighting trends, limitations, and gaps in current research. By consolidating fragmented research efforts and identifying methodological patterns, limitations, and open challenges, this survey maps the current state of generative LLMs in automated fact-checking. It aims to support researchers in developing more reliable, interpretable, and inclusive fact-checking systems, while outlining promising directions for future research in this rapidly evolving field.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

TSVer: A Benchmark for Fact Verification Against Time-Series Evidence
cs.CL 2025-11 unverdicted novelty 7.0

TSVer is a new benchmark dataset for fact verification against time-series evidence, with 304 annotated real-world claims, 400 time series, verdicts, and justifications, plus baseline results showing current models struggle.
Multilingual Fact-Checking at Scale: Fine-Tuned Compact Models vs LLMs
cs.CL 2026-06 unverdicted novelty 4.0

Fine-tuned compact models achieve strong multilingual performance and large efficiency gains over LLMs on production data from 114 languages for claim detection and 28 for veracity prediction.