PRISM: Probing Reasoning, Instruction, and Source Memory in LLM Hallucinations

· 2026 · cs.CL · arXiv 2604.16909

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

As large language models (LLMs) evolve from conversational assistants into agents capable of handling complex tasks, they are increasingly deployed in high-risk domains. However, existing benchmarks largely rely on mixed queries and posterior evaluation, output-level scoring, which quantifies hallucination severity but offers limited insight into where and why hallucinations arise in the generation pipeline. We therefore reformulate hallucination evaluation as a diagnostic problem and propose PRISM, a controlled benchmark that disentangles hallucinations into four dimensions: knowledge missing, knowledge errors, reasoning errors, and instruction-following errors, grounded in three stages of generation (memory, instruction, and reasoning). PRISM contains 9,448 instances across 65 tasks and supports fine-grained, stage-aware diagnostic evaluation. Evaluating 24 mainstream open-source and proprietary LLMs, we uncover consistent trade-offs across instruction following, memory retrieval, and logical reasoning, showing that mitigation strategies often improve specific dimensions at the expense of others. We hope PRISM provides a framework for understanding the specific mechanisms behind LLMs hallucinations, ultimately accelerating the development of trustworthy large language models.

representative citing papers

Three Regimes of Context-Parametric Conflict: A Predictive Framework and Empirical Validation

cs.CL · 2026-05-12 · conditional · novelty 6.0

A three-regime framework resolves contradictions in LLM context vs. parametric knowledge conflicts by distinguishing single-source updating, competitive integration, and task-appropriate selection, with empirical confirmation of certainty gradients and task effects across five models.

citing papers explorer

Showing 1 of 1 citing paper.

Three Regimes of Context-Parametric Conflict: A Predictive Framework and Empirical Validation cs.CL · 2026-05-12 · conditional · none · ref 18 · internal anchor
A three-regime framework resolves contradictions in LLM context vs. parametric knowledge conflicts by distinguishing single-source updating, competitive integration, and task-appropriate selection, with empirical confirmation of certainty gradients and task effects across five models.

PRISM: Probing Reasoning, Instruction, and Source Memory in LLM Hallucinations

fields

years

verdicts

representative citing papers

citing papers explorer