pith:7YTNYMNU
OCRBench: On the Hidden Mystery of OCR in Large Multimodal Models
OCRBench evaluates large multimodal models on 29 OCR datasets to expose their specific weaknesses in text recognition tasks.
arxiv:2305.07895 v7 · 2023-05-13 · cs.CV · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{7YTNYMNUELMI4SR5F3BAH7D5KB}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
To facilitate the assessment of Optical Character Recognition (OCR) capabilities in Large Multimodal Models, we propose OCRBench, a comprehensive evaluation benchmark. OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available.
That the 29 chosen datasets together form a representative and non-redundant sample of all text-related visual challenges that large multimodal models will encounter in practice.
OCRBench provides the largest evaluation suite yet for OCR capabilities in large multimodal models, revealing gaps in multilingual, handwritten, and mathematical text handling.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:14.445847Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
fe26dc31b422d88e4a3d2ec203fc7d5061286d2fad3a439a3f0fa3536dda8d6a
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/7YTNYMNUELMI4SR5F3BAH7D5KB \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: fe26dc31b422d88e4a3d2ec203fc7d5061286d2fad3a439a3f0fa3536dda8d6a
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "29d2b49f308511804352e18309295d4a3fa4576924d2b23a7b2a012b54df781f",
"cross_cats_sorted": [
"cs.CL"
],
"license": "http://creativecommons.org/licenses/by-sa/4.0/",
"primary_cat": "cs.CV",
"submitted_at": "2023-05-13T11:28:37Z",
"title_canon_sha256": "752f574bb06201a05cecfb174a1559187447237c80e2df7daf18a0cc7aad28f9"
},
"schema_version": "1.0",
"source": {
"id": "2305.07895",
"kind": "arxiv",
"version": 7
}
}