A new expanded Polish medical exam benchmark with structural modifications shows top LLMs drop 28-31 percentage points, indicating standard MCQA overestimates true medical competence.
InHealthcare, volume 12, page 1637
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Reassessing High-Performing LLMs on Polish Medical Exams: True Competence or Bias-Driven Performance?
A new expanded Polish medical exam benchmark with structural modifications shows top LLMs drop 28-31 percentage points, indicating standard MCQA overestimates true medical competence.