Large Language Models in Mental Health Care: a Scoping Review

Andrew Beam; David A. Clifton; Fenglin Liu; Hongbin Na; John Torous; Kailai Yang; Lauren V. Moran; Peilin Zhou; Sophia Ananiadou; Yi-han Sheu

arxiv: 2401.02984 · v3 · pith:U6UFQGV3new · submitted 2024-01-01 · 💻 cs.CL · cs.AI

Large Language Models in Mental Health Care: a Scoping Review

Yining Hua , Fenglin Liu , Kailai Yang , Zehan Li , Hongbin Na , Yi-han Sheu , Peilin Zhou , Lauren V. Moran

show 4 more authors

Sophia Ananiadou David A. Clifton Andrew Beam John Torous

This is my paper

classification 💻 cs.CL cs.AI

keywords mentalcarehealthllmsreviewwerelanguageapplications

0 comments

read the original abstract

Objectieve:This review aims to deliver a comprehensive analysis of Large Language Models (LLMs) utilization in mental health care, evaluating their effectiveness, identifying challenges, and exploring their potential for future application. Materials and Methods: A systematic search was performed across multiple databases including PubMed, Web of Science, Google Scholar, arXiv, medRxiv, and PsyArXiv in November 2023. The review includes all types of original research, regardless of peer-review status, published or disseminated between October 1, 2019, and December 2, 2023. Studies were included without language restrictions if they employed LLMs developed after T5 and directly investigated research questions within mental health care settings. Results: Out of an initial 313 articles, 34 were selected based on their relevance to LLMs applications in mental health care and the rigor of their reported outcomes. The review identified various LLMs applications in mental health care, including diagnostics, therapy, and enhancing patient engagement. Key challenges highlighted were related to data availability and reliability, the nuanced handling of mental states, and effective evaluation methods. While LLMs showed promise in improving accuracy and accessibility, significant gaps in clinical applicability and ethical considerations were noted. Conclusion: LLMs hold substantial promise for enhancing mental health care. For their full potential to be realized, emphasis must be placed on developing robust datasets, development and evaluation frameworks, ethical guidelines, and interdisciplinary collaborations to address current limitations.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

One Year Later...The Harms Persist, But So Do We!
cs.CL 2026-06 unverdicted novelty 5.0

Evaluation of eight LLMs across 16 DSM-5 conditions shows safety guardrails succeed only for suicide/self-harm while failing at rates up to 100% for other clinical issues, supported by a new eight-dimension harm taxonomy.
One Year Later...The Harms Persist, But So Do We!
cs.CL 2026-06 unverdicted novelty 5.0

LLM safety guardrails fail for most mental health conditions with up to 100% failure rates for eating disorders, substance use disorder, and major depressive disorder, while holding only for suicide and self-harm.
DySRec: Dynamic Context-Aware Psychometric Scale Recommendation via Multi-Agent Collaboration
cs.HC 2026-05 unverdicted novelty 5.0

DySRec is a multi-agent conversational system that dynamically recommends psychometric scales by integrating user context, behaviors, and risk signals through interactive dialogue and closed-loop refinement.
One Year Later...The Harms Persist, But So Do We!
cs.CL 2026-06 unverdicted novelty 4.0

Six proprietary LLMs were tested across 16 DSM-5 conditions with four adversarial variants; safeguards succeeded only for suicide/self-harm while failing at high rates for eating disorders, substance use, and major de...
Beyond Accuracy: Interpreting Topic Representation in Suicide Ideation Detection Models
cs.LG 2026-06 unverdicted novelty 4.0

Topic-aware augmentation makes psychosocial risk factors such as immigration, family issues, and financial crisis more distinct and coherent in the internal representations of suicide ideation detection models.