A Bayesian framework disentangles topic, agreement, and anchoring biases from interaction effects in LLM multi-turn dialogues, revealing convergence to attractors that shift with fine-tuning.
Strategies for integrating disparate social information.Proceedings of the Royal Society B, 287 (1939):20202413, 2020
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
physics.soc-ph 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Disentangling Interaction and Bias Effects in Opinion Dynamics of Large Language Models
A Bayesian framework disentangles topic, agreement, and anchoring biases from interaction effects in LLM multi-turn dialogues, revealing convergence to attractors that shift with fine-tuning.