Introduces Political Consistency Training (PCT) with sentiment and helpfulness consistency objectives to reduce covert political bias in LLMs while preserving helpfulness.
The self-perception and political biases of ChatGPT.Human Behavior and Emerging Technologies, 2024
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CL 2verdicts
UNVERDICTED 2representative citing papers
Training data for open LLMs is systematically left-leaning, with pre-training corpora containing more political material than post-training data and model stances aligning with data distributions.
citing papers explorer
-
Reducing Political Manipulation with Consistency Training
Introduces Political Consistency Training (PCT) with sentiment and helpfulness consistency objectives to reduce covert political bias in LLMs while preserving helpfulness.
-
What Is The Political Content in LLMs' Pre- and Post-Training Data?
Training data for open LLMs is systematically left-leaning, with pre-training corpora containing more political material than post-training data and model stances aligning with data distributions.