Sycophancy in vision-language models: A systematic analysis and an inference-time mitigation framework.arXiv preprint arXiv:2408.11261

Yunpu Zhao, Rui Zhang, Junbin Xiao, Changxin Ke, Ruibo Hou, Yifan Hao, Ling Li · arXiv 2408.11261

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Beyond Social Pressure: Benchmarking Epistemic Attack in Large Language Models

cs.CL · 2026-04-09 · unverdicted · novelty 7.0

PPT-Bench measures how LLMs change answers under epistemic, value, authority, and identity pressures at baseline, single-turn, and multi-turn levels, finding separable inconsistency patterns across five models.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond Social Pressure: Benchmarking Epistemic Attack in Large Language Models cs.CL · 2026-04-09 · unverdicted · none · ref 22
PPT-Bench measures how LLMs change answers under epistemic, value, authority, and identity pressures at baseline, single-turn, and multi-turn levels, finding separable inconsistency patterns across five models.

Sycophancy in vision-language models: A systematic analysis and an inference-time mitigation framework.arXiv preprint arXiv:2408.11261

fields

years

verdicts

representative citing papers

citing papers explorer