pith:ML5KRKZ2
Capabilities of Gemini Models in Medicine
Med-Gemini models reach 91.1 percent accuracy on USMLE medical questions and surpass GPT-4 on medical benchmarks.
arxiv:2404.18416 v2 · 2024-04-29 · cs.AI · cs.CL · cs.CV · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{ML5KRKZ2U3SKXDRGO4CBBASGI4}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Our best-performing Med-Gemini model achieves SoTA performance of 91.1% accuracy on MedQA (USMLE) using a novel uncertainty-guided search strategy, surpasses the GPT-4 model family on every benchmark where direct comparison is viable, and improves over GPT-4V by an average relative margin of 44.5% on 7 multimodal benchmarks.
That benchmark accuracy on curated medical datasets (MedQA, NEJM Image Challenges, MMMU health subset, etc.) will translate to reliable performance and safety in real clinical workflows with noisy, incomplete, or out-of-distribution patient data.
Med-Gemini sets new records on 10 of 14 medical benchmarks including 91.1% on MedQA-USMLE, beats GPT-4V by 44.5% on multimodal tasks, and surpasses humans on medical text summarization.
References
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:50.766478Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
62faa8ab3aa6e4ab8e2677041082464732608b331e66262eae1e44c5fcb3f97a
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/ML5KRKZ2U3SKXDRGO4CBBASGI4 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 62faa8ab3aa6e4ab8e2677041082464732608b331e66262eae1e44c5fcb3f97a
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "caee117c34f877923c1935488b44d1956325b13f0bab70c31f5478b87fe76cc7",
"cross_cats_sorted": [
"cs.CL",
"cs.CV",
"cs.LG"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.AI",
"submitted_at": "2024-04-29T04:11:28Z",
"title_canon_sha256": "cb50dbe911bb83a2d23526805ba180095681824fbf212e77b5fbe93c1962eff6"
},
"schema_version": "1.0",
"source": {
"id": "2404.18416",
"kind": "arxiv",
"version": 2
}
}