PatientSafeBench: Evaluating the Safety of Medical LLMs for Patient Use

Kim, M · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

CareGuardAI: Context-Aware Multi-Agent Guardrails for Clinical Safety & Hallucination Mitigation in Patient-Facing LLMs

cs.CY · 2026-04-07 · unverdicted · novelty 5.0

CareGuardAI introduces dual risk assessments (SRA and HRA) and a multi-stage agent pipeline that only releases LLM responses when both risks score at or below 2, outperforming GPT-4o-mini on PatientSafeBench, MedSafetyBench, and MedHallu.

citing papers explorer

Showing 1 of 1 citing paper.

CareGuardAI: Context-Aware Multi-Agent Guardrails for Clinical Safety & Hallucination Mitigation in Patient-Facing LLMs cs.CY · 2026-04-07 · unverdicted · none · ref 20
CareGuardAI introduces dual risk assessments (SRA and HRA) and a multi-stage agent pipeline that only releases LLM responses when both risks score at or below 2, outperforming GPT-4o-mini on PatientSafeBench, MedSafetyBench, and MedHallu.

PatientSafeBench: Evaluating the Safety of Medical LLMs for Patient Use

fields

years

verdicts

representative citing papers

citing papers explorer