Hallucination Inspector: A Fact-Checking Judge for API Migration

· 2026 · cs.SE · arXiv 2604.20202

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Large Language Models (LLMs) are increasingly deployed in automated software engineering for tasks such as API migration. While LLMs are able to identify migration patterns, they often make mistakes and fail to produce correct glue code to invoke the new API in place of the old one. We call this issue Scaffolding Hallucination, a failure mode where models generate incorrect calling contexts by inventing Phantom Symbols -- such as imaginary imports, constructors, and constants -- that do not exist in the API specification. In this paper, we show that standard metrics cannot be relied upon to detect these instances of hallucination. We propose Hallucination Inspector, a static analysis tool to detect Scaffolding Hallucination in LLM-generated code. Our approach includes a lightweight evaluation framework that verifies symbols extracted from the abstract syntax tree against a knowledge base derived directly from software documentation for the API. A preliminary evaluation on Android API migrations demonstrates that our approach successfully identifies hallucinations and significantly reduces false positives compared to standard metrics and probabilistic judges

representative citing papers

Empirical Analysis and Detection of Hallucinations in LLM-Generated Bug Report Summaries

cs.SE · 2026-05-22 · unverdicted · novelty 5.0

Develops a section-aware hallucination detection method for LLM bug report summaries using synthetic injection on the BugsRepo dataset from Mozilla projects, reporting up to 0.89 Macro-F1 at report level.

citing papers explorer

Showing 1 of 1 citing paper.

Empirical Analysis and Detection of Hallucinations in LLM-Generated Bug Report Summaries cs.SE · 2026-05-22 · unverdicted · none · ref 34 · internal anchor
Develops a section-aware hallucination detection method for LLM bug report summaries using synthetic injection on the BugsRepo dataset from Mozilla projects, reporting up to 0.89 Macro-F1 at report level.

Hallucination Inspector: A Fact-Checking Judge for API Migration

fields

years

verdicts

representative citing papers

citing papers explorer