Identify any entities mentioned 3

Extract the exact statement text 2

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Blending Human and LLM Expertise to Detect Hallucinations and Omissions in Mental Health Chatbot Responses

cs.CL · 2026-03-17 · unverdicted · novelty 5.0

Hybrid human-LLM features let traditional ML models reach 0.717-0.849 F1 for hallucination detection and 0.59-0.64 F1 for omissions in mental health data, beating LLM judges at 52% accuracy.

citing papers explorer

Showing 1 of 1 citing paper.

Blending Human and LLM Expertise to Detect Hallucinations and Omissions in Mental Health Chatbot Responses cs.CL · 2026-03-17 · unverdicted · none · ref 6
Hybrid human-LLM features let traditional ML models reach 0.717-0.849 F1 for hallucination detection and 0.59-0.64 F1 for omissions in mental health data, beating LLM judges at 52% accuracy.

Identify any entities mentioned 3

fields

years

verdicts

representative citing papers

citing papers explorer