pith. sign in

arxiv: 2402.04470 · v5 · pith:UDUQETTBnew · submitted 2024-02-06 · 💻 cs.CY

Six Fallacies in Substituting Large Language Models for Human Participants

classification 💻 cs.CY
keywords humanllmsfallaciesintelligencelanguagemodelsparticipantsanalysis
0
0 comments X
read the original abstract

Can AI systems like large language models (LLMs) replace human participants in behavioral and psychological research? Here I critically evaluate the "replacement" perspective and identify six interpretive fallacies that undermine its validity. These fallacies are: (1) equating token prediction with human intelligence, (2) treating LLMs as the average human, (3) interpreting alignment as explanation, (4) anthropomorphizing AI systems, (5) essentializing identities, and (6) substituting model data for human evidence. Each fallacy represents a potential misunderstanding about what LLMs are and what they can tell us about human cognition. The analysis distinguishes levels of similarity between LLMs and humans, particularly functional equivalence (outputs) versus mechanistic equivalence (processes), while highlighting both technical limitations (addressable through engineering) and conceptual limitations (arising from fundamental differences between statistical and biological intelligence). For each fallacy, specific safeguards are provided to guide responsible research practices. Ultimately, the analysis supports conceptualizing LLMs as pragmatic simulation tools--useful for role-play, rapid hypothesis testing, and computational modeling (provided their outputs are validated against human data)--rather than as replacements for human participants. This framework enables researchers to leverage language models productively while respecting the fundamental differences between machine intelligence and human thought.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. NetworkGames: Simulating Cooperation in Network Games with Personality-driven LLM Agents

    physics.soc-ph 2025-11 unverdicted novelty 5.0

    Simulations show that cooperative outcomes in network games with personality-driven LLM agents depend on both network connectivity and the placement of pro-social personalities, not just pairwise interaction preferences.