Pith Number
pith:NDIP27JO
pith:2020:NDIP27JOQ2GI5TEW5OSJGK6HCP
not attested
not anchored
not stored
refs resolved
PathVQA: 30000+ Questions for Medical Visual Question Answering
The first pathology visual question answering dataset is created with 32,799 manually verified questions from 4,998 images.
arxiv:2003.10286 v1 · 2020-03-07 · cs.CL · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{NDIP27JOQ2GI5TEW5OSJGK6HCP}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
1
Bitcoin timestamp
2
Internet Archive
3
Author claim
· sign in to
claim
4
Citations
5
Replications
✓
Portable graph bundle live · download bundle · merged
state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same
current state with the deterministic merge algorithm.
Claims
C1strongest claim
To our best knowledge, this is the first dataset for pathology VQA.
C2weakest assumption
That questions automatically generated from image captions using NLP, followed by manual checks, produce medically accurate and representative questions that pathologists would actually ask when viewing the images.
C3one line summary
PathVQA is the first public dataset of over 32,000 questions on nearly 5,000 pathology images for medical visual question answering.
References
[1] Vqa: Visual question answering
[2] A multi-world approach to question answering about real-world scenes based on uncertain input
[3] Image question answering: A visual semantic embedding model and a new dataset
[4] Clevr: A diagnostic dataset for compositional language and elementary visual reasoning
[5] Making the v in vqa matter: Elevating the role of image understanding in visual question answering
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:53.418891Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
68d0fd7d2e868c8ecc96eba4932bc713fa6fcd7c8a770e0219c54df830b848ba
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/NDIP27JOQ2GI5TEW5OSJGK6HCP \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 68d0fd7d2e868c8ecc96eba4932bc713fa6fcd7c8a770e0219c54df830b848ba
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "7d90650c41118c44b4714d4f5e346c41bc98b2570e8867fb21fd37f16b77d852",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CL",
"submitted_at": "2020-03-07T17:55:41Z",
"title_canon_sha256": "81632873afec56f9215e42e974206a080dc643bae35d6ca32526915cfdae1b6e"
},
"schema_version": "1.0",
"source": {
"id": "2003.10286",
"kind": "arxiv",
"version": 1
}
}