Safe-Child-LLM supplies a new developmental benchmark dataset and evaluation protocol that exposes safety gaps in leading LLMs when handling child and adolescent users.
66.Evaluating the accuracy and readabil-ity of ChatGPT in providing parental guidance for ade-noidectomy, tonsillectomy, and ventilation tube insertion surgery,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CY 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Safe-Child-LLM: A Developmental Benchmark for Evaluating LLM Safety in Child-LLM Interactions
Safe-Child-LLM supplies a new developmental benchmark dataset and evaluation protocol that exposes safety gaps in leading LLMs when handling child and adolescent users.