Trustworthy medical question answering: An evaluation-centric survey

[Wanget al · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Quantifying Hallucinations in Language Language Models on Medical Textbooks

cs.CL · 2026-02-12 · conditional · novelty 5.0

LLMs hallucinate in 19.7% of textbook-grounded medical QA answers despite high plausibility scores, indicating they remain unfit for unsupervised clinical use.

citing papers explorer

Showing 1 of 1 citing paper.

Quantifying Hallucinations in Language Language Models on Medical Textbooks cs.CL · 2026-02-12 · conditional · none · ref 20
LLMs hallucinate in 19.7% of textbook-grounded medical QA answers despite high plausibility scores, indicating they remain unfit for unsupervised clinical use.

Trustworthy medical question answering: An evaluation-centric survey

fields

years

verdicts

representative citing papers

citing papers explorer