Seven clinician-informed safety criteria enable LLM-as-a-Judge to reach substantial agreement with human consensus (Cohen's κ up to 0.75) on evaluating LLM responses to users demonstrating psychosis.
Title resolution pending
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 1polarities
background 1representative citing papers
CNSocialDepress is a new benchmark dataset containing 44,178 Chinese social media posts annotated by experts with binary depression risk labels and multidimensional psychological attributes for fine-grained analysis.
EmBot combines wearable-triggered stress detection with LLM conversational support and was probed via expert interviews to surface design considerations for daily stress management.
PsychAgent combines memory-augmented planning, trajectory-based skill evolution, and rejection fine-tuning to create a self-improving AI psychological counselor that outperforms general LLMs in multi-session evaluations.
citing papers explorer
-
Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis
Seven clinician-informed safety criteria enable LLM-as-a-Judge to reach substantial agreement with human consensus (Cohen's κ up to 0.75) on evaluating LLM responses to users demonstrating psychosis.
-
CNSocialDepress: A Chinese Social Media Dataset for Depression Risk Detection and Structured Analysis
CNSocialDepress is a new benchmark dataset containing 44,178 Chinese social media posts annotated by experts with binary depression risk labels and multidimensional psychological attributes for fine-grained analysis.
-
Exploring Expert Perspectives on Wearable-Triggered LLM Conversational Support for Daily Stress Management
EmBot combines wearable-triggered stress detection with LLM conversational support and was probed via expert interviews to surface design considerations for daily stress management.
-
PsychAgent: An Experience-Driven Lifelong Learning Agent for Self-Evolving Psychological Counselor
PsychAgent combines memory-augmented planning, trajectory-based skill evolution, and rejection fine-tuning to create a self-improving AI psychological counselor that outperforms general LLMs in multi-session evaluations.