Large Language Models in Mental Health Care: a Scoping Review
read the original abstract
Objectieve:This review aims to deliver a comprehensive analysis of Large Language Models (LLMs) utilization in mental health care, evaluating their effectiveness, identifying challenges, and exploring their potential for future application. Materials and Methods: A systematic search was performed across multiple databases including PubMed, Web of Science, Google Scholar, arXiv, medRxiv, and PsyArXiv in November 2023. The review includes all types of original research, regardless of peer-review status, published or disseminated between October 1, 2019, and December 2, 2023. Studies were included without language restrictions if they employed LLMs developed after T5 and directly investigated research questions within mental health care settings. Results: Out of an initial 313 articles, 34 were selected based on their relevance to LLMs applications in mental health care and the rigor of their reported outcomes. The review identified various LLMs applications in mental health care, including diagnostics, therapy, and enhancing patient engagement. Key challenges highlighted were related to data availability and reliability, the nuanced handling of mental states, and effective evaluation methods. While LLMs showed promise in improving accuracy and accessibility, significant gaps in clinical applicability and ethical considerations were noted. Conclusion: LLMs hold substantial promise for enhancing mental health care. For their full potential to be realized, emphasis must be placed on developing robust datasets, development and evaluation frameworks, ethical guidelines, and interdisciplinary collaborations to address current limitations.
This paper has not been read by Pith yet.
Forward citations
Cited by 5 Pith papers
-
One Year Later...The Harms Persist, But So Do We!
Evaluation of eight LLMs across 16 DSM-5 conditions shows safety guardrails succeed only for suicide/self-harm while failing at rates up to 100% for other clinical issues, supported by a new eight-dimension harm taxonomy.
-
One Year Later...The Harms Persist, But So Do We!
LLM safety guardrails fail for most mental health conditions with up to 100% failure rates for eating disorders, substance use disorder, and major depressive disorder, while holding only for suicide and self-harm.
-
DySRec: Dynamic Context-Aware Psychometric Scale Recommendation via Multi-Agent Collaboration
DySRec is a multi-agent conversational system that dynamically recommends psychometric scales by integrating user context, behaviors, and risk signals through interactive dialogue and closed-loop refinement.
-
One Year Later...The Harms Persist, But So Do We!
Six proprietary LLMs were tested across 16 DSM-5 conditions with four adversarial variants; safeguards succeeded only for suicide/self-harm while failing at high rates for eating disorders, substance use, and major de...
-
Beyond Accuracy: Interpreting Topic Representation in Suicide Ideation Detection Models
Topic-aware augmentation makes psychosocial risk factors such as immigration, family issues, and financial crisis more distinct and coherent in the internal representations of suicide ideation detection models.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.