pith:MCYNG54F
A Survey on Hallucination in Large Vision-Language Models
Large vision-language models generate text that conflicts with input images, and this survey defines the problem while reviewing its symptoms, benchmarks, causes, and fixes.
arxiv:2402.00253 v2 · 2024-02-01 · cs.CV · cs.CL · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{MCYNG54FMM74BLASM5OMVQKGDV}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Hallucination, defined as the misalignment between factual visual content and corresponding textual generation, poses a significant challenge of utilizing LVLMs, and this survey establishes an overview to facilitate future mitigation.
That hallucinations can be consistently defined and isolated across diverse LVLM architectures and tasks without significant overlap with other error types such as reasoning failures.
This survey reviews the definition, symptoms, evaluation benchmarks, root causes, and mitigation methods for hallucinations in large vision-language models.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-18T04:33:37.170404Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
60b0d37785633fc0ac12675ccac1461d5c6257f36b2a5a10468ee6d6560931d5
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/MCYNG54FMM74BLASM5OMVQKGDV \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 60b0d37785633fc0ac12675ccac1461d5c6257f36b2a5a10468ee6d6560931d5
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "58e98477621291f8569dc0c279bd252a0b496b574d21a26e9188f320d8d2a5b9",
"cross_cats_sorted": [
"cs.CL",
"cs.LG"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CV",
"submitted_at": "2024-02-01T00:33:21Z",
"title_canon_sha256": "33a9a211515ced3e215194accef8df66483c182165a90f379bf8d027480135ce"
},
"schema_version": "1.0",
"source": {
"id": "2402.00253",
"kind": "arxiv",
"version": 2
}
}