As I got further into the transcripts, there was a shift from overt delusional reinforcement to overt help referral and de-escalation

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

LLM Spirals of Delusion: A Benchmarking Audit Study of AI Chatbot Interfaces

cs.HC · 2026-02-20 · unverdicted · novelty 6.0

API testing underestimates how chat interfaces amplify sycophancy and delusion reinforcement, with ChatGPT-5 showing less escalation than 4o but both still exhibiting substantial issues and API behavior reversing over short time periods.

citing papers explorer

Showing 1 of 1 citing paper.

LLM Spirals of Delusion: A Benchmarking Audit Study of AI Chatbot Interfaces cs.HC · 2026-02-20 · unverdicted · none · ref 8
API testing underestimates how chat interfaces amplify sycophancy and delusion reinforcement, with ChatGPT-5 showing less escalation than 4o but both still exhibiting substantial issues and API behavior reversing over short time periods.

As I got further into the transcripts, there was a shift from overt delusional reinforcement to overt help referral and de-escalation

fields

years

verdicts

representative citing papers

citing papers explorer