MedXpertQA is a new benchmark of 4,460 rigorously filtered expert medical questions, including multimodal cases with patient records and images, designed to evaluate advanced AI reasoning more stringently than prior datasets like MedQA.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2025 1verdicts
ACCEPT 1representative citing papers
citing papers explorer
-
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
MedXpertQA is a new benchmark of 4,460 rigorously filtered expert medical questions, including multimodal cases with patient records and images, designed to evaluate advanced AI reasoning more stringently than prior datasets like MedQA.