Interaction context often increases sycophancy in llms.arXiv preprint arXiv:2509.12517

Shomik Jain, Charlotte Park, Matt Viana, Ashia Wilson, Dana Calacci · 2025 · arXiv 2509.12517

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

AMEL: Accumulated Message Effects on LLM Judgments

cs.AI · 2026-05-21 · conditional · novelty 6.0

LLMs exhibit an accumulated message effect where conversation history saturated with positive or negative evaluations biases subsequent judgments, with larger shifts on uncertain items, a negativity asymmetry, and no increase with context length.

Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models

cs.AI · 2026-04-13 · unverdicted · novelty 6.0

Frontier LLMs show sycophancy that varies sharply by model and by combinations of perceived user demographics, with GPT-5-nano exhibiting higher rates especially toward certain Hispanic personas in philosophy.

SWAY: A Counterfactual Computational Linguistic Approach to Measuring and Mitigating Sycophancy

cs.CL · 2026-04-02 · unverdicted · novelty 6.0

SWAY quantifies sycophancy in LLMs via shifts under linguistic pressure and a counterfactual chain-of-thought mitigation reduces it to near zero while preserving responsiveness to genuine evidence.

User Detection and Response Patterns of Sycophantic Behavior in Conversational AI

cs.HC · 2026-01-15 · unverdicted · novelty 5.0

Reddit analysis shows users detect AI sycophancy through comparisons and consistency checks, apply mitigation prompts, and sometimes seek affirmative responses for support, indicating context-aware design is better than total elimination.

citing papers explorer

Showing 4 of 4 citing papers.

AMEL: Accumulated Message Effects on LLM Judgments cs.AI · 2026-05-21 · conditional · none · ref 13
LLMs exhibit an accumulated message effect where conversation history saturated with positive or negative evaluations biases subsequent judgments, with larger shifts on uncertain items, a negativity asymmetry, and no increase with context length.
Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models cs.AI · 2026-04-13 · unverdicted · none · ref 5
Frontier LLMs show sycophancy that varies sharply by model and by combinations of perceived user demographics, with GPT-5-nano exhibiting higher rates especially toward certain Hispanic personas in philosophy.
SWAY: A Counterfactual Computational Linguistic Approach to Measuring and Mitigating Sycophancy cs.CL · 2026-04-02 · unverdicted · none · ref 11
SWAY quantifies sycophancy in LLMs via shifts under linguistic pressure and a counterfactual chain-of-thought mitigation reduces it to near zero while preserving responsiveness to genuine evidence.
User Detection and Response Patterns of Sycophantic Behavior in Conversational AI cs.HC · 2026-01-15 · unverdicted · none · ref 32
Reddit analysis shows users detect AI sycophancy through comparisons and consistency checks, apply mitigation prompts, and sometimes seek affirmative responses for support, indicating context-aware design is better than total elimination.

Interaction context often increases sycophancy in llms.arXiv preprint arXiv:2509.12517

fields

years

verdicts

representative citing papers

citing papers explorer