ACM Computing Surveys , volume=

Survey of hallucination in natural language generation , author=

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Hallucination as Commitment Failure: Larger LLMs Misfire Despite Knowing the Answer

cs.CL · 2026-05-21 · unverdicted · novelty 6.0

Larger LLMs hallucinate more often despite having the correct concept available because instruction tuning causes probability mass to disperse across alternative surface forms instead of concentrating on one.

How Language Models Process Out-of-Distribution Inputs: A Two-Pathway Framework

cs.CL · 2026-04-30 · unverdicted · novelty 6.0

LLM OOD detectors are length-confounded; a two-pathway embedding-plus-trajectory framework detects covert OOD inputs at 0.721 average AUROC and 0.850 on jailbreaks.

citing papers explorer

Showing 2 of 2 citing papers.

Hallucination as Commitment Failure: Larger LLMs Misfire Despite Knowing the Answer cs.CL · 2026-05-21 · unverdicted · none · ref 48
Larger LLMs hallucinate more often despite having the correct concept available because instruction tuning causes probability mass to disperse across alternative surface forms instead of concentrating on one.
How Language Models Process Out-of-Distribution Inputs: A Two-Pathway Framework cs.CL · 2026-04-30 · unverdicted · none · ref 20
LLM OOD detectors are length-confounded; a two-pathway embedding-plus-trajectory framework detects covert OOD inputs at 0.721 average AUROC and 0.850 on jailbreaks.

ACM Computing Surveys , volume=

fields

years

verdicts

representative citing papers

citing papers explorer