pith:FABDGC42
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence
Multimodal document models often produce correct answers while citing the wrong evidence regions.
arxiv:2605.12882 v1 · 2026-05-13 · cs.CL · cs.CV
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{FABDGC42UOCVILIWPJ2DHRXX7G}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Auditing 20 MLLMs reveals a pervasive Attribution Hallucination: models frequently produce the right answer while citing the wrong region. The strongest system (Gemini-3.1-Pro-Preview) achieves an SAA of only 76.0, and the strongest open-source MLLM reaches just 22.5.
The automated masking-ablation pipeline plus expert review produces accurate ground-truth element-level citations that correctly identify the minimal sufficient evidence regions for each question.
CiteVQA requires models to cite specific document regions with bounding boxes alongside answers and finds that even the strongest MLLMs frequently cite the wrong region, with top SAA scores of only 76.0 for closed models and 22.5 for open-source ones.
References
Receipt and verification
| First computed | 2026-05-18T03:09:11.069821Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
2802330b9aa385542d167a7433c6f7f988bdff8191b1a30147780678fe5216fa
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/FABDGC42UOCVILIWPJ2DHRXX7G \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 2802330b9aa385542d167a7433c6f7f988bdff8191b1a30147780678fe5216fa
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "d7abf45806e966aac184e30ced91b414828d7d4249608ffb512b9b9d741dd2c2",
"cross_cats_sorted": [
"cs.CV"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-05-13T01:54:42Z",
"title_canon_sha256": "49cb8290ff597ff16029388c750aa4d45f1f8610be6c8b5e4d294800cbc5f66c"
},
"schema_version": "1.0",
"source": {
"id": "2605.12882",
"kind": "arxiv",
"version": 1
}
}