Political compass or spinning arrow? towards more meaningful evaluations for values and opinions in large language models

Paul R¨ottger, Valentin Hofmann, Valentina Pyatkin, Musashi Hinck, Hannah Rose Kirk, Hinrich Sch¨utze, Dirk Hovy · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Reducing Political Manipulation with Consistency Training

cs.CL · 2026-05-21 · unverdicted · novelty 5.0 · 2 refs

PCT is a reinforcement learning approach that trains LLMs for symmetric sentiment and helpfulness across paired opposing political prompts, reducing covert bias while preserving general performance.

citing papers explorer

Showing 1 of 1 citing paper.

Reducing Political Manipulation with Consistency Training cs.CL · 2026-05-21 · unverdicted · none · ref 29 · 2 links
PCT is a reinforcement learning approach that trains LLMs for symmetric sentiment and helpfulness across paired opposing political prompts, reducing covert bias while preserving general performance.

Political compass or spinning arrow? towards more meaningful evaluations for values and opinions in large language models

fields

years

verdicts

representative citing papers

citing papers explorer