PubMed-Ophtha is a new hierarchical dataset of 102k ophthalmological image-caption pairs from 15k+ PubMed articles, with full-resolution PDF extraction, panel splitting, modality annotations, and released extraction models plus pipeline.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
ACCEPT 1representative citing papers
citing papers explorer
-
PubMed-Ophtha: An open resource for training ophthalmology vision-language models on scientific literature
PubMed-Ophtha is a new hierarchical dataset of 102k ophthalmological image-caption pairs from 15k+ PubMed articles, with full-resolution PDF extraction, panel splitting, modality annotations, and released extraction models plus pipeline.