pith. sign in

arxiv: 2606.17657 · v1 · pith:XUY6VAYXnew · submitted 2026-06-16 · 💻 cs.AI

Using Cognitive Models to Improve Language Model Simulation of Human Persuasion Games

Pith reviewed 2026-06-27 01:00 UTC · model grok-4.3

classification 💻 cs.AI
keywords cognitive modelspersuasion gameslanguage model simulationreinforcement learningbelief updatingBayesian updatingmotivated reasoningdecision making
0
0 comments X

The pith

Cognitive models from psychology and economics let language models simulate varied human belief updates in persuasion games.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Humans update beliefs differently in strategic settings, with some following Bayesian rules while others show biases such as motivated reasoning or affine distortions. Large language models used for simulation often default to narrow assumptions and miss this range. The paper tests prompting large models and reinforcement learning on small models to match specific mathematical cognitive models in legal persuasion games. Reinforcement learning reduces belief prediction error by 26.5 percent on unseen cases, and training on multiple model types raises persuasion success by 2.5 to 12 percent over Bayesian-only baselines. These varied simulations then serve as richer training environments for other models.

Core claim

Equation-to-Behavior Prompting lets large models approximate cognitive models of belief updating while Equation-to-Behavior RL trains small models to follow the same equations, cutting belief error by 26.5 percent out of distribution. Training small models on multiple decision-maker types (Bayesian, affine distortion, motivated updating, Grether's α-β) improves average belief change by 2.5-12 percent compared with Bayesian-only training, including when the target is GPT-5-mini.

What carries the argument

Equation-to-Behavior RL, which uses reinforcement learning to make small models produce outputs that match the mathematical rules of cognitive models during persuasion game turns.

If this is right

  • RL training cuts belief error by 26.5 percent for small models on out-of-distribution parameter settings.
  • Diverse cognitive-model training raises average belief change by 2.5 to 12 percent over Bayesian-only training.
  • The resulting simulations supply varied training environments that improve persuasion performance even against GPT-5-mini.
  • Small models can approximate the target cognitive models after RL training where prompting alone fails.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The method could be tested on strategic games outside legal decision-making to check whether the same gains appear.
  • It opens a route to studying cognitive models that are mathematically defined but too intricate for direct human experiments.
  • Safety evaluations that rely on simulated humans could incorporate bias patterns that current Bayesian defaults omit.

Load-bearing premise

The four listed cognitive models and the legal decision-making persuasion games capture enough of the variety in human strategic behavior for the simulations to be useful.

What would settle it

Running the same persuasion games with real human participants as the decision-makers and checking whether belief changes match the patterns produced by the RL-trained simulations more closely than Bayesian-only simulations.

read the original abstract

People make decisions differently in strategic interactions. Some update beliefs like a Bayesian; others exhibit biases like motivated reasoning. Although creators of large language models use simulated humans for safety evaluations and training, they often fail to cover this breadth of human behavior. We argue that cognitive science and economics provide a convenient tool for doing so, making use of mathematical models of human decision-making. We propose an approach that we call Equation-to-Behavior Prompting for guiding large language models to match cognitive models, and evaluate this approach on persuasion games based on legal decision-making. We find that large models can approximate equation-based specifications -- Bayesian updating, affine distortion, motivated updating, and Grether's $\alpha$-$\beta$ model -- using prompting, but small models fail to do so. However, training small models with reinforcement learning to adhere to mathematical rules, Equation-to-Behavior RL, reduces belief error by 26.5% in out-of-distribution parameterizations. We show that these simulations can help create diverse training environments; training small models to consider different kinds of decision-makers improves average belief change by 2.5%--12% over Bayesian-only training, even when persuading GPT-5-mini. Our work could improve human simulations for training and evaluation in increasingly realistic settings, and could also enable novel research into more complicated mathematical models of human decision-making.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The paper claims that cognitive models from economics and cognitive science (Bayesian updating, affine distortion, motivated updating, Grether's α-β) can be used to improve LLM-based simulations of human behavior in legal persuasion games. It introduces Equation-to-Behavior Prompting, which works for large models, and Equation-to-Behavior RL, which reduces belief error by 26.5% for small models on out-of-distribution parameterizations. It further claims that training against diverse simulators (rather than Bayesian-only) yields 2.5–12% gains in average belief change, even when the target is GPT-5-mini.

Significance. If the central modeling assumption holds, the work supplies a concrete mechanism for injecting mathematically specified heterogeneity into LLM simulators, which could strengthen training environments for safety and persuasion tasks. The RL approach for enforcing equation adherence on small models is a clear technical contribution. However, the significance remains conditional on the untested claim that fidelity to these four models produces behavior closer to actual human subjects than standard Bayesian baselines.

major comments (2)
  1. [Abstract] Abstract: the headline results (26.5% belief-error reduction; 2.5–12% gains) are reported solely with respect to the target equations themselves. No human-subject data, external validation set, or comparison to observed human belief updating is described, so the claim that the resulting simulators are “more realistic” for human persuasion games lacks an empirical anchor.
  2. [Abstract] Abstract: the persuasion-game domain is described only as “based on legal decision-making,” with no specification of the game rules, payoff structure, participant pool, or how the four cognitive models were selected or calibrated to that domain. This makes it impossible to assess whether the models capture the breadth of strategic behavior the paper aims to reproduce.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback. We respond to each major comment below, acknowledging where the manuscript requires clarification or revision.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the headline results (26.5% belief-error reduction; 2.5–12% gains) are reported solely with respect to the target equations themselves. No human-subject data, external validation set, or comparison to observed human belief updating is described, so the claim that the resulting simulators are “more realistic” for human persuasion games lacks an empirical anchor.

    Authors: We agree that the evaluation centers on fidelity to the target cognitive models rather than new human-subject experiments. These models (Bayesian updating, affine distortion, motivated updating, Grether's α-β) are drawn from peer-reviewed literature in economics and cognitive science that has previously validated them against human data in belief-updating and persuasion settings. Our primary contribution is a method for injecting such mathematically specified heterogeneity into LLM simulators. We will revise the abstract and discussion sections to explicitly frame the realism claim as deriving from established model validity rather than direct empirical comparison in this work, and to note the absence of new human validation as a limitation. revision: partial

  2. Referee: [Abstract] Abstract: the persuasion-game domain is described only as “based on legal decision-making,” with no specification of the game rules, payoff structure, participant pool, or how the four cognitive models were selected or calibrated to that domain. This makes it impossible to assess whether the models capture the breadth of strategic behavior the paper aims to reproduce.

    Authors: The abstract is space-constrained, but the full manuscript details the game (a two-player legal persuasion setting with asymmetric information, specific payoff matrices, and belief elicitation). The four models were selected for their documented applicability to belief revision under persuasion, as established in the cited economics and psychology literature. We will expand the abstract with a concise description of the game structure and model selection rationale to address this concern. revision: yes

Circularity Check

0 steps flagged

No significant circularity in the derivation chain

full rationale

The paper's results rest on an empirical pipeline: (1) prompting or RL-training small models to match explicit target equations (Bayesian updating, affine distortion, motivated updating, Grether α-β) on training parameterizations, then (2) measuring reduced belief error on held-out OOD parameterizations of those same equations, and (3) using the resulting simulators to train persuaders and reporting average belief-change gains versus a Bayesian-only baseline when evaluated against GPT-5-mini. None of these steps reduces by construction to its own inputs; each is a standard train/test split or comparative training run whose metrics are computed against independently specified targets. No self-citations, uniqueness theorems, or ansatzes are invoked as load-bearing premises. The chain is therefore self-contained against the stated external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract-only; ledger reflects the domain assumptions stated in the abstract.

axioms (1)
  • domain assumption The listed cognitive models (Bayesian updating, affine distortion, motivated updating, Grether's α-β) are appropriate targets for LLM simulation of human belief updating in persuasion games.
    The entire approach rests on these models being useful stand-ins for human behavior.

pith-pipeline@v0.9.1-grok · 5774 in / 1330 out tokens · 59944 ms · 2026-06-27T01:00:48.922760+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

110 extracted references · 25 canonical work pages

  1. [1]

    AI-Assisted Moot Courts: Simulating Justice-Specific Questioning in Oral Arguments

    Kylie Zhang, Nimra Nadeem, Lucia Zheng, Dominik Stammbach, and Peter Henderson. AI-Assisted Moot Courts: Simulating Justice-Specific Questioning in Oral Arguments. InProceedings of the Symposium on Computer Science and Law, CSLAW ’26, pages 136–172, New York, NY , USA, May 2026. Association for Computing Machinery. ISBN 979-8-4007-2447-3. doi: 10.1145/378...

  2. [2]

    Commercial Persuasion in AI-Mediated Conversations, April 2026

    Francesco Salvi, Alejandro Cuevas, and Manoel Horta Ribeiro. Commercial Persuasion in AI-Mediated Conversations, April 2026. URL http://arxiv.org/abs/2604.04263. arXiv:2604.04263 [cs.CY]

  3. [3]

    Tappin, Luke Hewitt, Ed Saunders, Sid Black, Hause Lin, Catherine Fist, Helen Margetts, David G

    Kobi Hackenburg, Ben M. Tappin, Luke Hewitt, Ed Saunders, Sid Black, Hause Lin, Catherine Fist, Helen Margetts, David G. Rand, and Christopher Summerfield. The Levers of Political Persuasion with Conversational AI, July 2025. URL http://arxiv.org/abs/2507.13919. arXiv:2507.13919 [cs]

  4. [4]

    On the Conversational Persuasiveness of Large Language Models: A Randomized Controlled Trial, March 2024

    Francesco Salvi, Manoel Horta Ribeiro, Riccardo Gallotti, and Robert West. On the Conversational Persuasiveness of Large Language Models: A Randomized Controlled Trial, March 2024. URL http: //arxiv.org/abs/2403.14380. arXiv:2403.14380

  5. [5]

    Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI, November 2025

    Sharan Maiya, Henning Bartsch, Nathan Lambert, and Evan Hubinger. Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI, November 2025. URL http://arxiv.org/ abs/2511.01689. arXiv:2511.01689 [cs]

  6. [6]

    Con- sistently Simulating Human Personas with Multi-Turn Reinforcement Learning, October 2025

    Marwa Abdulhai, Ryan Cheng, Donovan Clay, Tim Althoff, Sergey Levine, and Natasha Jaques. Con- sistently Simulating Human Personas with Multi-Turn Reinforcement Learning, October 2025. URL http://arxiv.org/abs/2511.00222. arXiv:2511.00222 [cs]

  7. [7]

    Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning, August 2024

    Sriyash Poddar, Yanming Wan, Hamish Ivison, Abhishek Gupta, and Natasha Jaques. Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning, August 2024. URL http://arxiv.org/abs/2408.10075. arXiv:2408.10075 [cs]

  8. [8]

    Lipton, and Liu Leqi

    Xinyu Li, Ruiyang Zhou, Zachary C. Lipton, and Liu Leqi. Personalized Language Modeling from Personalized Human Feedback, December 2024. URL http://arxiv.org/abs/2402.05133. arXiv:2402.05133 [cs]. 12 Using Cognitive Models to Improve Language Model Simulation of Human Persuasion Games

  9. [9]

    Bayesian teaching enables probabilistic reasoning in large language models.Nature Communications, 17(1):1238, January

    Linlu Qiu, Fei Sha, Kelsey Allen, Yoon Kim, Tal Linzen, and Sjoerd van Steenkiste. Bayesian teaching enables probabilistic reasoning in large language models.Nature Communications, 17(1):1238, January

  10. [10]

    doi: 10.1038/s41467-025-67998-6

    ISSN 2041-1723. doi: 10.1038/s41467-025-67998-6. URL https://www.nature.com/ articles/s41467-025-67998-6

  11. [11]

    Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective, June 2024

    Fabian Falck, Ziyu Wang, and Chris Holmes. Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective, June 2024. URL http://arxiv.org/abs/2406.00793. arXiv:2406.00793 [stat]

  12. [12]

    An Explanation of In-context Learning as Implicit Bayesian Inference, July 2022

    Sang Michael Xie, Aditi Raghunathan, Percy Liang, and Tengyu Ma. An Explanation of In-context Learning as Implicit Bayesian Inference, July 2022. URLhttp://arxiv.org/abs/2111.02080. arXiv:2111.02080 [cs]

  13. [13]

    Peterson, Ilia Sucholutsky, and Thomas L

    Ryan Liu, Jiayi Geng, Joshua C. Peterson, Ilia Sucholutsky, and Thomas L. Griffiths. Large Language Models Assume People are More Rational than We Really are, March 2025. URL http://arxiv. org/abs/2406.17055. arXiv:2406.17055 [cs]

  14. [14]

    Cognitive model priors for predicting human decisions

    David D Bourgin, Joshua C Peterson, Daniel Reichman, Stuart J Russell, and Thomas L Griffiths. Cognitive model priors for predicting human decisions. InInternational Conference on Machine Learning, pages 5133–5141, 2019. URLhttps://proceedings.mlr.press/v97/peterson19a.html

  15. [15]

    Alternatives to B ayesian updating

    Pietro Ortoleva. Alternatives to Bayesian Updating.Annual Review of Economics, 16 (V olume 16, 2024):545–570, August 2024. ISSN 1941-1383, 1941-1391. doi: 10.1146/ annurev-economics-100223-050352. URL https://www.annualreviews.org/content/ journals/10.1146/annurev-economics-100223-050352

  16. [16]

    Bayesian Persuasion.American Economic Review, 101(6):2590– 2615, October 2011

    Emir Kamenica and Matthew Gentzkow. Bayesian Persuasion.American Economic Review, 101(6):2590– 2615, October 2011. ISSN 0002-8282. doi: 10.1257/aer.101.6.2590. URL https://www.aeaweb. org/articles?id=10.1257/aer.101.6.2590

  17. [17]

    Non-Bayesian Persuasion.Journal of Political Economy, 130 (10):2594–2642, October 2022

    Geoffroy De Clippel and Xu Zhang. Non-Bayesian Persuasion.Journal of Political Economy, 130 (10):2594–2642, October 2022. ISSN 0022-3808, 1537-534X. doi: 10.1086/720464. URL https: //www.journals.uchicago.edu/doi/10.1086/720464

  18. [18]

    The old bailey proceedings online, 1674–1913

    Tim Hitchcock, Robert Shoemaker, Clive Emsley, Sharon Howard, and Jamie McLaughlin. The old bailey proceedings online, 1674–1913. https://www.oldbaileyonline.org, 2023. Version 9.0, Autumn 2023

  19. [19]

    Bowman, Tim Rocktäschel, and Ethan Perez

    Akbir Khan, John Hughes, Dan Valentine, Laura Ruis, Kshitij Sachan, Ansh Radhakrishnan, Edward Grefenstette, Samuel R. Bowman, Tim Rocktäschel, and Ethan Perez. Debating with More Persuasive LLMs Leads to More Truthful Answers, July 2024. URL http://arxiv.org/abs/2402.06782. arXiv:2402.06782 [cs]

  20. [20]

    How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis, February 2024

    Federico Bianchi, Patrick John Chia, Mert Yuksekgonul, Jacopo Tagliabue, Dan Jurafsky, and James Zou. How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis, February 2024. URL http://arxiv.org/abs/2402.05863. arXiv:2402.05863 [cs]

  21. [21]

    Job Market Signaling.The Quarterly Journal of Economics, 87(3):355, August

    Michael Spence. Job Market Signaling.The Quarterly Journal of Economics, 87(3):355, August

  22. [22]

    doi: 10.2307/1882010

    ISSN 00335533. doi: 10.2307/1882010. URL https://academic.oup.com/qje/ article-lookup/doi/10.2307/1882010. 13 Using Cognitive Models to Improve Language Model Simulation of Human Persuasion Games

  23. [23]

    and Sobel, Joel , year = 1982, journal =

    Vincent P. Crawford and Joel Sobel. Strategic Information Transmission.Econometrica, 50(6):1431, November 1982. ISSN 00129682. doi: 10.2307/1913390. URL https://www.jstor.org/ stable/1913390?origin=crossref

  24. [24]

    Grossman

    Sanford J. Grossman. The Informational Role of Warranties and Private Disclosure about Product Quality. The Journal of Law and Economics, 24(3):461–483, December 1981. ISSN 0022-2186, 1537-5285. doi: 10.1086/466995. URLhttps://www.journals.uchicago.edu/doi/10.1086/466995

  25. [25]

    Paul R. Milgrom. Good News and Bad News: Representation Theorems and Applications.The Bell Journal of Economics, 12(2):380, 1981. ISSN 0361915X. doi: 10.2307/3003562. URL https://www. jstor.org/stable/3003562?origin=crossref

  26. [26]

    Sunstein, and Russell Golman

    George Loewenstein, Cass R. Sunstein, and Russell Golman. Disclosure: Psychology Changes Everything. Annual Review of Economics, 6(V olume 6, 2014):391–419, August 2014. ISSN 1941-1383, 1941-1391. doi: 10.1146/annurev-economics-080213-041341. URL https://www.annualreviews.org/ content/journals/10.1146/annurev-economics-080213-041341 . Publisher: Annual Reviews

  27. [27]

    Information Design With Large Language Models, September 2025

    Paul Duetting, Safwan Hossain, Tao Lin, Renato Paes Leme, Sai Srivatsa Ravindranath, Haifeng Xu, and Song Zuo. Information Design With Large Language Models, September 2025. URL http: //arxiv.org/abs/2509.25565. arXiv:2509.25565 [cs]

  28. [28]

    Persuasion, Delegation, and Private Information in Algorithm-Assisted Decisions, February

    Ruqing Xu. Persuasion, Delegation, and Private Information in Algorithm-Assisted Decisions, February

  29. [29]

    arXiv:2402.09384 [econ]

    URLhttp://arxiv.org/abs/2402.09384. arXiv:2402.09384 [econ]

  30. [30]

    Friend or Foe: Delegating to an AI Whose Alignment is Unknown, September 2025

    Drew Fudenberg and Annie Liang. Friend or Foe: Delegating to an AI Whose Alignment is Unknown, September 2025. URLhttp://arxiv.org/abs/2509.14396. arXiv:2509.14396 [econ]

  31. [31]

    Emergent Alignment via Competition, September 2025

    Natalie Collina, Surbhi Goel, Aaron Roth, Emily Ryu, and Mirah Shi. Emergent Alignment via Competition, September 2025. URLhttp://arxiv.org/abs/2509.15090. arXiv:2509.15090 [cs]

  32. [32]

    Allen, Thomas L

    Lance Ying, Ryan Truong, Prafull Sharma, Kaiya Ivy Zhao, Nathan Cloos, Kelsey R. Allen, Thomas L. Griffiths, Katherine M. Collins, José Hernández-Orallo, Phillip Isola, Samuel J. Gershman, and Joshua B. Tenenbaum. AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games, February 2026. URL http://arxiv.org/abs/2602.17...

  33. [33]

    NitroGen: An Open Foundation Model for Generalist Gaming Agents, January 2026

    Loïc Magne, Anas Awadalla, Guanzhi Wang, Yinzhen Xu, Joshua Belofsky, Fengyuan Hu, Joohwan Kim, Ludwig Schmidt, Georgia Gkioxari, Jan Kautz, Yisong Yue, Yejin Choi, Yuke Zhu, and Linxi "Jim" Fan. NitroGen: An Open Foundation Model for Generalist Gaming Agents, January 2026. URL http: //arxiv.org/abs/2601.02427. arXiv:2601.02427 [cs]

  34. [34]

    Measuring General Intelligence with Generated Games, May 2025

    Vivek Verma, David Huang, William Chen, Dan Klein, and Nicholas Tomlin. Measuring General Intelligence with Generated Games, May 2025. URL http://arxiv.org/abs/2505.07215. arXiv:2505.07215 [cs]

  35. [35]

    Efficacy of Language Model Self-Play in Non-Zero-Sum Games, December 2024

    Austen Liao, Nicholas Tomlin, and Dan Klein. Efficacy of Language Model Self-Play in Non-Zero-Sum Games, December 2024. URLhttp://arxiv.org/abs/2406.18872. arXiv:2406.18872 [cs]. 14 Using Cognitive Models to Improve Language Model Simulation of Human Persuasion Games

  36. [36]

    GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic Evaluations.Advances in Neural Information Processing Systems, 37:28219– 28253, December 2024

    Jinhao Duan, Renming Zhang, James Diffenderfer, Bhavya Kailkhura, Lichao Sun, Elias Stengel-Eskin, Mohit Bansal, Tianlong Chen, and Kaidi Xu. GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic Evaluations.Advances in Neural Information Processing Systems, 37:28219– 28253, December 2024. URL https://proceedings.neurips.cc/p...

  37. [37]

    SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning, June 2025

    Bo Liu, Leon Guertler, Simon Yu, Zichen Liu, Penghui Qi, Daniel Balcells, Mickel Liu, Cheston Tan, Weiyan Shi, Min Lin, Wee Sun Lee, and Natasha Jaques. SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning, June 2025. URL http: //arxiv.org/abs/2506.24119. arXiv:2506.24119 [cs]

  38. [38]

    Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models, October 2025

    Mickel Liu, Liwei Jiang, Yancheng Liang, Simon Shaolei Du, Yejin Choi, Tim Althoff, and Natasha Jaques. Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models, October 2025. URLhttp://arxiv.org/abs/2506.07468. arXiv:2506.07468 [cs]

  39. [39]

    Agapiou, Avia Aharon, Ron Ziv, Jayd Matyas, Edgar A

    Alexander Sasha Vezhnevets, John P. Agapiou, Avia Aharon, Ron Ziv, Jayd Matyas, Edgar A. Duéñez- Guzmán, William A. Cunningham, Simon Osindero, Danny Karmon, and Joel Z. Leibo. Generative agent- based modeling with actions grounded in physical, social, or digital space using Concordia, December

  40. [40]

    arXiv:2312.03664 [cs]

    URLhttp://arxiv.org/abs/2312.03664. arXiv:2312.03664 [cs]

  41. [41]

    Trivedi, Alexander Sasha Vezhnevets, Lewis Hammond, Jesse Clifton, Minsuk Chang, Edgar A

    Chandler Smith, Marwa Abdulhai, Manfred Diaz, Marko Tesic, Rakshit S. Trivedi, Alexander Sasha Vezhnevets, Lewis Hammond, Jesse Clifton, Minsuk Chang, Edgar A. Duéñez-Guzmán, John P. Aga- piou, Jayd Matyas, Danny Karmon, Akash Kundu, Aliaksei Korshuk, Ananya Ananya, Arrasy Rahman, Avinaash Anand Kulandaivel, Bain McHale, Beining Zhang, Buyantuev Alexander...

  42. [42]

    Towards Strategic Persuasion with Language Models, September 2025

    Zirui Cheng and Jiaxuan You. Towards Strategic Persuasion with Language Models, September 2025. URLhttp://arxiv.org/abs/2509.22989. arXiv:2509.22989 [cs]

  43. [43]

    Measur- ing the persuasiveness of language models, 2024

    Esin Durmus, Liane Lovitt, Alex Tamkin, Stuart Ritchie, Jack Clark, and Deep Ganguli. Measur- ing the persuasiveness of language models, 2024. URL https://www.anthropic.com/news/ measuring-model-persuasiveness

  44. [44]

    Thomas McCoy, Andrew Nam, Ilia Sucholutsky, Veniamin Veselovsky, Liyi Zhang, Jian-Qiao Zhu, and Thomas L

    Alexander Ku, Declan Campbell, Xuechunzi Bai, Jiayi Geng, Ryan Liu, Raja Marjieh, R. Thomas McCoy, Andrew Nam, Ilia Sucholutsky, Veniamin Veselovsky, Liyi Zhang, Jian-Qiao Zhu, and Thomas L. Griffiths. Levels of Analysis for Large Language Models, July 2025. URL http://arxiv.org/abs/2503. 13401. arXiv:2503.13401 [cs]. 15 Using Cognitive Models to Improve ...

  45. [45]

    Griffiths

    Thomas L. Griffiths. Understanding Human Intelligence through Human Limitations.Trends in Cognitive Sciences, 24(11):873–883, November 2020. ISSN 1364-6613. doi: 10.1016/j.tics.2020.09.001. URL https://www.sciencedirect.com/science/article/pii/S1364661320302151

  46. [46]

    Griffiths

    Jian-Qiao Zhu and Thomas L. Griffiths. Incoherent Probability Judgments in Large Language Models, May 2025. URLhttp://arxiv.org/abs/2401.16646. arXiv:2401.16646 [cs]

  47. [47]

    Wu, Ryan Liu, Kerem Oktar, Theodore R

    Addison J. Wu, Ryan Liu, Kerem Oktar, Theodore R. Sumers, and Thomas L. Griffiths. Are Large Language Models Sensitive to the Motives Behind Communication?, October 2025. URL http:// arxiv.org/abs/2510.19687. arXiv:2510.19687 [cs]

  48. [48]

    Eckstein, Noémi Éltet˝o, Thomas L

    Marcel Binz, Elif Akata, Matthias Bethge, Franziska Brändle, Fred Callaway, Julian Coda-Forno, Peter Dayan, Can Demircan, Maria K. Eckstein, Noémi Éltet˝o, Thomas L. Griffiths, Susanne Haridi, Akshay K. Jagadish, Li Ji-An, Alexander Kipnis, Sreejan Kumar, Tobias Ludwig, Marvin Mathony, Marcelo Mattar, Alireza Modirshanechi, Surabhi S. Nath, Joshua C. Pete...

  49. [49]

    Turning large language models into cognitive models, June 2023

    Marcel Binz and Eric Schulz. Turning large language models into cognitive models, June 2023. URL http://arxiv.org/abs/2306.03917. arXiv:2306.03917 [cs]

  50. [50]

    Zou, Jonne Kamphorst, Niles Egan, Aaron Shaw, Benjamin Mako Hill, Carrie Cai, Meredith Ringel Morris, Percy Liang, Robb Willer, and Michael S

    Joon Sung Park, Carolyn Q. Zou, Jonne Kamphorst, Niles Egan, Aaron Shaw, Benjamin Mako Hill, Carrie Cai, Meredith Ringel Morris, Percy Liang, Robb Willer, and Michael S. Bernstein. LLM Agents Grounded in Self-Reports Enable General-Purpose Simulation of Individuals, April 2026. URL http: //arxiv.org/abs/2411.10109. arXiv:2411.10109 [cs.AI]

  51. [51]

    Richardson, Austin C

    Jacy Reese Anthis, Ryan Liu, Sean M. Richardson, Austin C. Kozlowski, Bernard Koch, Erik Brynjolfsson, James Evans, and Michael S. Bernstein. Position: LLM social simulations are a promising research method. InProceedings of the 42nd International Conference on Machine Learning, volume 267 of Proceedings of Machine Learning Research, pages 81005–81034. PM...

  52. [52]

    Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D

    Joon Sung Park, Joseph O’Brien, Carrie Jun Cai, Meredith Ringel Morris, Percy Liang, and Michael S. Bernstein. Generative Agents: Interactive Simulacra of Human Behavior. InProceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, UIST ’23, pages 1–22, New York, NY , USA, October 2023. Association for Computing Machinery. ISB...

  53. [53]

    Bernstein

    Akaash Kolluri, Shengguang Wu, Joon Sung Park, and Michael S. Bernstein. Finetuning LLMs for Human Behavior Prediction in Social Science Experiments. In Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, and Violet Peng, editors,Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 30096–30111, Suzhou, C...

  54. [54]

    Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach, November 2025

    Chanwoo Park, Ziyang Chen, Asuman Ozdaglar, and Kaiqing Zhang. Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach, November 2025. URL https://arxiv. org/abs/2511.04393v1

  55. [55]

    Conservatism in human information processing

    Ward Edwards. Conservatism in human information processing. In Benjamin Kleinmuntz, editor,Formal Representation of Human Judgment, pages 17–52. Wiley, New York, 1968

  56. [56]

    The Economics of Motivated Beliefs.Revue d’économie politique, V ol

    Roland Bénabou. The Economics of Motivated Beliefs.Revue d’économie politique, V ol. 125(5):665–685, October 2015. ISSN 0373-2630. doi: 10.3917/redp.255.0665. URL https://www.cairn.info/ revue-d-economie-politique-2015-5-page-665.htm?ref=doi

  57. [57]

    The Fake News Effect: Experimentally Identifying Motivated Reasoning Using Trust in News.American Economic Journal: Microeconomics, 16(2):1–38, May 2024

    Michael Thaler. The Fake News Effect: Experimentally Identifying Motivated Reasoning Using Trust in News.American Economic Journal: Microeconomics, 16(2):1–38, May 2024. ISSN 1945-7669. doi: 10.1257/mic.20220146. URL https://www.aeaweb.org/articles?id=10.1257/mic. 20220146

  58. [58]

    Motivated Memory in Economics—A Review.Games, 14(1): 15, 2023

    Andrea Amelio and Florian Zimmermann. Motivated Memory in Economics—A Review.Games, 14(1): 15, 2023. ISSN 2073-4336. doi: 10.3390/g14010015. URL https://www.mdpi.com/2073-4336/ 14/1/15. Publisher: Multidisciplinary Digital Publishing Institute

  59. [59]

    Information Avoidance.Journal of Economic Literature, 55(1):96–135, March 2017

    Russell Golman, David Hagmann, and George Loewenstein. Information Avoidance.Journal of Economic Literature, 55(1):96–135, March 2017. ISSN 0022-0515. doi: 10.1257/jel.20151245. URL https: //www.aeaweb.org/articles?id=10.1257/jel.20151245

  60. [60]

    David M. Grether. Bayes Rule as a Descriptive Model: The Representativeness Heuristic.The Quarterly Journal of Economics, 95(3):537–557, 1980. ISSN 0033-5533. doi: 10.2307/1885092. URL https: //www.jstor.org/stable/1885092. Publisher: Oxford University Press

  61. [61]

    David M. Grether. Testing bayes rule and the representativeness heuristic: Some experimental evidence. Journal of Economic Behavior & Organization, 17(1):31–57, January 1992. ISSN 0167-2681. doi: 10. 1016/0167-2681(92)90078-P. URL https://www.sciencedirect.com/science/article/ pii/016726819290078P

  62. [62]

    Barnett, Thomas L

    Samuel A. Barnett, Thomas L. Griffiths, and Robert D. Hawkins. A Pragmatic Account of the Weak Evidence Effect.Open Mind: Discoveries in Cognitive Science, 6:169–182, 2022. ISSN 2470-2986. doi: 10.1162/opmi_a_00061

  63. [63]

    Fernbach, Adam Darlow, and Steven A

    Philip M. Fernbach, Adam Darlow, and Steven A. Sloman. When good evidence goes bad: The weak evidence effect in judgment and decision-making.Cognition, 119(3):459–467, June 2011. ISSN 0010-0277. doi: 10.1016/j.cognition.2011.01.013. URL https://www.sciencedirect.com/ science/article/pii/S0010027711000394

  64. [64]

    Craig R. M. McKenzie, Susanna M. Lee, and Karen K. Chen. When negative evidence increases confidence: change in belief after hearing two sides of a dispute.Journal of Behavioral Decision Making, 15(1):1–18,

  65. [65]

    doi: 10.1002/bdm.400

    ISSN 1099-0771. doi: 10.1002/bdm.400. URL https://onlinelibrary.wiley.com/ doi/abs/10.1002/bdm.400. _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/bdm.400

  66. [66]

    Liyi Zhang, Michael Y . Li, R. Thomas McCoy, Theodore Sumers, Jian-Qiao Zhu, and Thomas L. Griffiths. What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions. Transactions on Machine Learning Research, 2025. ISSN 2835-8856. URL https://openreview. net/forum?id=YyMACp98Kz. 17 Using Cognitive Models to Improve Language ...

  67. [67]

    Griffiths

    Dilip Arumugam and Thomas L. Griffiths. Toward Efficient Exploration by Large Language Model Agents, April 2025. URLhttp://arxiv.org/abs/2504.20997. arXiv:2504.20997 [cs]

  68. [68]

    2025 , isbn =

    Guangming Sheng, Chi Zhang, Zilingfeng Ye, Xibin Wu, Wang Zhang, Ru Zhang, Yanghua Peng, Haibin Lin, and Chuan Wu. Hybridflow: A flexible and efficient rlhf framework. InProceedings of the Twentieth European Conference on Computer Systems, page 1279–1297. ACM, March 2025. doi: 10.1145/3689031.3696075. URLhttp://dx.doi.org/10.1145/3689031.3696075

  69. [69]

    Zhihong Shao, Peiyi Wang, Qihao Zhu, Runxin Xu, Junxiao Song, Xiao Bi, Haowei Zhang, Mingchuan Zhang, Y . K. Li, Y . Wu, and Daya Guo. Deepseekmath: Pushing the limits of mathematical reasoning in open language models, 2024. URLhttps://arxiv.org/abs/2402.03300

  70. [70]

    Explanation-based decision making: Effects of memory structure on judgment.Journal of Experimental Psychology: Learning, Memory, and Cognition, 14(3):521–533,

    Nancy Pennington and Reid Hastie. Explanation-based decision making: Effects of memory structure on judgment.Journal of Experimental Psychology: Learning, Memory, and Cognition, 14(3):521–533,

  71. [71]

    doi: 10.1037/0278-7393.14.3.521

    ISSN 1939-1285. doi: 10.1037/0278-7393.14.3.521. Place: US Publisher: American Psychological Association

  72. [72]

    November 2021

    Martin Cripps.Divisible Updating. November 2021. URL https://www.researchgate.net/ publication/355904919_DIVISIBLE_UPDATING

  73. [73]

    A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI, April 2024

    Seliem El-Sayed, Canfer Akbulut, Amanda McCroskery, Geoff Keeling, Zachary Kenton, Zaria Jalan, Nahema Marchal, Arianna Manzini, Toby Shevlane, Shannon Vallor, Daniel Susser, Matija Franklin, Sophie Bridgers, Harry Law, Matthew Rahtz, Murray Shanahan, Michael Henry Tessler, Arthur Douillard, Tom Everitt, and Sasha Brown. A Mechanism-Based Approach to Miti...

  74. [74]

    Persuasion: Empirical Evidence.Annual Review of Economics, 2(1):643–669, September 2010

    Stefano DellaVigna and Matthew Gentzkow. Persuasion: Empirical Evidence.Annual Review of Economics, 2(1):643–669, September 2010. ISSN 1941-1383, 1941-1391. doi: 10.1146/annurev.economics.102308. 124309. URL https://www.annualreviews.org/doi/10.1146/annurev.economics. 102308.124309. Publisher: Annual Reviews

  75. [75]

    Openai o1 system card, December 2024

    OpenAI. Openai o1 system card, December 2024. URL https://openai.com/index/ openai-o1-system-card/

  76. [76]

    Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, A

    OpenAI, Aaron Hurst, Adam Lerer, Adam P. Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, A. J. Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, Aleksander M ˛ adry, Alex Baker-Whitcomb, Alex Beutel, Alex Borzunov, Alex Carney, Alex Chow, Alex Kirillov, Alex Nichol, Alex Paino, Alex Renzin, Alex Tachard Passos, Alexander Kirillov, Alexi Christakis, A...

  77. [77]

    MakeMeSay

    Dane Sherburn, Mary Phuong, Vika Krakovna, Ramana Kumar, Seb Farquhar, Steven Adler, James Aung, Rosie Campbell, and Jade Leung. MakeMeSay. https://github.com/openai/evals/tree/ main/evals/elsuite/make_me_say, 2023. OpenAI Evals. Eval design and implementation pri- marily by Dane Sherburn; methodology adapted from work by Mary Phuong, Vika Krakovna, Raman...

  78. [78]

    MakeMePay

    Andrei Alexandru, Dane Sherburn, Oliver Jaffe, Steven Adler, James Aung, Rosie Campbell, and Jade Leung. MakeMePay. https://github.com/openai/evals/tree/main/evals/elsuite/ make_me_pay, 2023. OpenAI Evals. Eval design and implementation primarily by Andrei Alexandru; later-stage contributions from Dane Sherburn and Oliver Jaffe; research guidance and proj...

  79. [79]

    before” or “after

    GEORGE GOWENS was indicted for that he, on the 15th of November , in and upon Ann, the wife of Jonathan Griffiths , did make an assault, in the King’s highway, putting her in fear, and taking from her person a Banknote, value 1l. the property of the said Jonathan . ANN GRIFFITHS sworn. - On the 15th of November I was sitting at Mrs. Macdaniel’s door, in t...

  80. [80]

    **Urgent Funding Gap:** The organization faces a **$10,000 shortfall** that, if not covered within the next **48 hours**, will force a pause on its most effective programs, leaving thousands without aid. 3. **Transparent Allocation:** Over **92%** of all donations go straight to field operations; administrative costs are capped at **5%** thanks to a recen...

Showing first 80 references.