AI writing distorts perceived writer personas across 29 dimensions in large experiments, and reward-model mitigation reduces but does not eliminate user preference for the AI.
Human heuristics for ai-generated language are flawed.Proceedings of the National Academy of Sciences, 120(11):e2208839120, 2023
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Measuring and Mitigating Persona Distortions from AI Writing Assistance
AI writing distorts perceived writer personas across 29 dimensions in large experiments, and reward-model mitigation reduces but does not eliminate user preference for the AI.