Hallucination Detection in LLMs with Topological Divergence on Attention Graphs

· 2025 · cs.CL · arXiv 2504.10063

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Hallucination, i.e., generating factually incorrect content, remains a critical challenge for large language models (LLMs). We introduce TOHA, a TOpology-based HAllucination detector in the RAG setting, which leverages a topological divergence metric to quantify the structural properties of graphs induced by attention matrices. Examining the topological divergence between prompt and response subgraphs reveals consistent patterns: higher divergence values in specific attention heads correlate with hallucinated outputs, independent of the dataset. Extensive experiments - including evaluation on question answering and summarization tasks - show that our approach achieves state-of-the-art or competitive results on several benchmarks while requiring minimal annotated data and computational resources. Our findings suggest that analyzing the topological structure of attention matrices can serve as an efficient and robust indicator of factual reliability in LLMs.

representative citing papers

Attention Sinks as Internal Signals for Hallucination Detection in Large Language Models

cs.CL · 2026-04-12 · unverdicted · novelty 6.0

SinkProbe detects hallucinations in LLMs by analyzing attention sinks in attention maps, showing they indicate transitions to prior-dominated computation and achieving state-of-the-art results.

Topological Data Analysis Applications in Natural Language Processing: A Survey

cs.CL · 2024-11-15 · accept · novelty 6.0

This survey compiles 137 papers on Topological Data Analysis in NLP, categorizing them into theoretical explanations of language and practical integrations into ML systems while noting open challenges.

citing papers explorer

Showing 2 of 2 citing papers.

Attention Sinks as Internal Signals for Hallucination Detection in Large Language Models cs.CL · 2026-04-12 · unverdicted · none · ref 2 · internal anchor
SinkProbe detects hallucinations in LLMs by analyzing attention sinks in attention maps, showing they indicate transitions to prior-dominated computation and achieving state-of-the-art results.
Topological Data Analysis Applications in Natural Language Processing: A Survey cs.CL · 2024-11-15 · accept · none · ref 1 · internal anchor
This survey compiles 137 papers on Topological Data Analysis in NLP, categorizing them into theoretical explanations of language and practical integrations into ML systems while noting open challenges.

Hallucination Detection in LLMs with Topological Divergence on Attention Graphs

fields

years

verdicts

representative citing papers

citing papers explorer