pith:WYVVK7UF
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
Finetuning on a dataset with both positive and negative visual instructions reduces hallucinations in large multi-modal models.
arxiv:2306.14565 v4 · 2023-06-26 · cs.CV · cs.AI · cs.CE · cs.CL · cs.MM
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{WYVVK7UFB4B2WG5HOU57Q6FZLE}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
We successfully mitigate hallucination by finetuning MiniGPT4 and mPLUG-Owl on LRV-Instruction while improving performance on several public datasets compared to state-of-the-art methods.
That GPT-4-generated negative instructions at the three semantic levels accurately capture the hallucination behaviors that matter in real deployments and that the GAVIE GPT-4 judge produces scores that align with human judgment.
A new dataset of 400k visual instructions including negative examples at three semantic levels reduces hallucinations in models like MiniGPT-4 when used for fine-tuning while improving benchmark performance.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:39:22.305051Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
b62b557e850f03ab1ba7753bf878b959185b9fecd2660f4f01f58fff3d4ad2e3
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/WYVVK7UFB4B2WG5HOU57Q6FZLE \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: b62b557e850f03ab1ba7753bf878b959185b9fecd2660f4f01f58fff3d4ad2e3
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "f37358e2dda34cc97dd49e498d272bd67d03fb70082bf90c64217715d5ba88bd",
"cross_cats_sorted": [
"cs.AI",
"cs.CE",
"cs.CL",
"cs.MM"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CV",
"submitted_at": "2023-06-26T10:26:33Z",
"title_canon_sha256": "9b26ffcce0d957b6538a8e449e36f9f06417f87d058c00bbf90192962d56c9d3"
},
"schema_version": "1.0",
"source": {
"id": "2306.14565",
"kind": "arxiv",
"version": 4
}
}