pith. sign in

arxiv: 2504.09689 · v3 · pith:WOQ54IMEnew · submitted 2025-04-13 · 💻 cs.AI · cs.CL· cs.CY· cs.HC· cs.LG

EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety

classification 💻 cs.AI cs.CLcs.CYcs.HCcs.LG
keywords mentalemoagentusersdeteriorationhealthinteractionspsychologicalrisks
0
0 comments X
read the original abstract

The rise of LLM-driven AI characters raises safety concerns, particularly for vulnerable human users with psychological disorders. To address these risks, we propose EmoAgent, a multi-agent AI framework designed to evaluate and mitigate mental health hazards in human-AI interactions. EmoAgent comprises two components: EmoEval simulates virtual users, including those portraying mentally vulnerable individuals, to assess mental health changes before and after interactions with AI characters. It uses clinically proven psychological and psychiatric assessment tools (PHQ-9, PDI, PANSS) to evaluate mental risks induced by LLM. EmoGuard serves as an intermediary, monitoring users' mental status, predicting potential harm, and providing corrective feedback to mitigate risks. Experiments conducted in popular character-based chatbots show that emotionally engaging dialogues can lead to psychological deterioration in vulnerable users, with mental state deterioration in more than 34.4% of the simulations. EmoGuard significantly reduces these deterioration rates, underscoring its role in ensuring safer AI-human interactions. Our code is available at: https://github.com/1akaman/EmoAgent

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Structure Matters: Evaluating Multi-Agents Orchestration in Generative Therapeutic Chatbots

    cs.HC 2026-02 unverdicted novelty 6.0

    A multi-agent system with finite state machine for therapeutic stages was perceived as significantly more natural and human-like than single-agent or unguided LLM versions in an RCT with 66 participants.