A pipeline generates 1,732 valid FHIR bundles from clinician cases, revealing lower LLM diagnostic accuracy on structured inputs than on plain text.
arXiv:2603.11413 [cs]
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
MedCase-Structured: A Text-to-FHIR Dataset for Benchmarking Diagnostic Reasoning in Clinically Realistic EHR Settings
A pipeline generates 1,732 valid FHIR bundles from clinician cases, revealing lower LLM diagnostic accuracy on structured inputs than on plain text.