Grapheval: A knowledge-graph based llm hallucination evaluation framework

· 2024 · arXiv 2407.10793

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

MedHal-Loc: Are "Explainable-by-Architecture" Medical Hallucination Detectors Faithful Localizers? A Localization Benchmark

cs.CL · 2026-06-19 · unverdicted · novelty 7.0

MedHal-Loc benchmark shows KG-triple hallucination detectors localize errors no better than chance on controlled medical statements due to entity extraction limits, while NLI and consistency methods succeed above chance, and real hallucinations are mostly diffuse conclusion changes.

Integrating Graphs, Large Language Models, and Agents: Reasoning and Retrieval

cs.AI · 2026-04-17 · unverdicted · novelty 6.0

A structured survey organizing graph-LLM integration methods by purpose, modality, and strategy across application domains.

From Personas to Plot: Character-Grounded Multi-Agent Story Generation for Long-Form Narratives

cs.CL · 2026-07-01 · unverdicted · novelty 5.0

MAGNET multi-agent generation with persona grounding and ATLAS graph verification yields 34-50% fewer hallucinations and annotations than single-model or IBSEN baselines at 100-page scale.

Cross Paraphrastic Invariance Learning for Hallucination Detection

cs.CL · 2026-06-06 · unverdicted · novelty 5.0

CPIL is a contrastive two-stage method that enforces paraphrase invariance on limited labeled data to outperform baselines in hallucination detection across 11 tasks.

Position: How can Graphs Help Large Language Models?

cs.AI · 2026-05-04 · unverdicted · novelty 3.0

Graphs can help LLMs reduce hallucinations, boost reasoning via prompting techniques, and better process structured data.

citing papers explorer

Showing 5 of 5 citing papers after filters.

MedHal-Loc: Are "Explainable-by-Architecture" Medical Hallucination Detectors Faithful Localizers? A Localization Benchmark cs.CL · 2026-06-19 · unverdicted · none · ref 16
MedHal-Loc benchmark shows KG-triple hallucination detectors localize errors no better than chance on controlled medical statements due to entity extraction limits, while NLI and consistency methods succeed above chance, and real hallucinations are mostly diffuse conclusion changes.
Integrating Graphs, Large Language Models, and Agents: Reasoning and Retrieval cs.AI · 2026-04-17 · unverdicted · none · ref 34
A structured survey organizing graph-LLM integration methods by purpose, modality, and strategy across application domains.
From Personas to Plot: Character-Grounded Multi-Agent Story Generation for Long-Form Narratives cs.CL · 2026-07-01 · unverdicted · none · ref 43
MAGNET multi-agent generation with persona grounding and ATLAS graph verification yields 34-50% fewer hallucinations and annotations than single-model or IBSEN baselines at 100-page scale.
Cross Paraphrastic Invariance Learning for Hallucination Detection cs.CL · 2026-06-06 · unverdicted · none · ref 22
CPIL is a contrastive two-stage method that enforces paraphrase invariance on limited labeled data to outperform baselines in hallucination detection across 11 tasks.
Position: How can Graphs Help Large Language Models? cs.AI · 2026-05-04 · unverdicted · none · ref 40
Graphs can help LLMs reduce hallucinations, boost reasoning via prompting techniques, and better process structured data.

Grapheval: A knowledge-graph based llm hallucination evaluation framework

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer