pith. sign in

arxiv: 2606.11058 · v1 · pith:HTPVSTJRnew · submitted 2026-06-09 · ⚛️ physics.soc-ph

The social consequences of AI delegation

Pith reviewed 2026-06-27 10:49 UTC · model grok-4.3

classification ⚛️ physics.soc-ph
keywords AI delegationlarge language modelssocial actorssocial normscollective dynamicshuman-AI interactiondecision makingbehavioral research
0
0 comments X

The pith

Humans are beginning to delegate their own deliberation to LLMs, turning these systems into functional social actors that shape decisions, norms, and collective dynamics.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shifts focus from using LLMs as stand-ins for human subjects in experiments to the opposite direction: people turning to LLMs for guidance in health, law, finance, education, and personal decisions instead of or alongside human experts or their own judgment. This practice, even when evidence of its prevalence is still uneven, creates systems whose outputs can influence individual choices and wider social patterns. The central proposal is therefore a dedicated research program that studies LLMs as consequential social actors in a functional sense. A reader would care because such delegation could change how norms form, how groups reach conclusions, and how accountability works when AI participates in deliberation.

Core claim

The more consequential question is not simply whether researchers should use LLMs as human surrogates, but whether - and under what conditions - humans are beginning to use LLMs as surrogates for their own deliberation. Across domains including health, law, finance, education, and personal guidance, increasing numbers of people consult generative AI systems before, alongside, or instead of human experts, peers, or independent judgment. Although evidence for actual delegation remains uneven, this uncertainty makes the phenomenon an urgent social-scientific object of study. We argue for a research programme that treats LLMs as consequential social actors in a functional sense: systems whose ou

What carries the argument

LLMs treated as consequential social actors in a functional sense, where the mechanism is humans consulting these systems before, alongside, or instead of human experts or independent judgment, allowing AI outputs to influence decisions and norms.

If this is right

  • Research attention must move beyond laboratory substitution of LLMs for humans to field studies of real-world delegation in health, law, finance, education, and personal guidance.
  • Social norms around expertise and decision-making are expected to shift as AI outputs become routine inputs before or instead of human consultation.
  • Collective dynamics such as opinion formation, consensus building, and group behavior may change when many individuals draw on the same LLM sources.
  • The uneven nature of current evidence itself becomes a reason to prioritize systematic observation rather than wait for clearer patterns to emerge.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Frameworks developed for studying algorithmic influence on social media could be adapted to track how LLM outputs propagate through repeated human consultations.
  • Questions of responsibility for downstream effects, such as biased or erroneous advice, may require new legal or ethical categories that treat the AI output as an intermediate social actor.
  • Experimental designs could compare decision quality and diversity when groups deliberate with versus without access to LLM surrogates.
  • The same delegation pattern might appear in other AI systems beyond language models, suggesting the research program could generalize to other generative tools.

Load-bearing premise

The use of LLMs as substitutes for human deliberation is happening at a scale large enough to produce measurable effects on social norms and collective dynamics, even though current evidence for that scale is uneven.

What would settle it

Large-scale surveys or usage logs across health, law, finance, and education showing that most people still rely on human experts or independent judgment rather than LLM outputs for consequential decisions.

Figures

Figures reproduced from arXiv: 2606.11058 by Henrique Ferraz de Arruda, Yamir Moreno.

Figure 1
Figure 1. Figure 1: Conceptual overview of the social consequences of AI delegation. The Perspective argues [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
read the original abstract

A substantial body of recent work has debated whether large language models (LLMs) can serve as substitutes for human participants in behavioural research. This debate, however, captures only one direction of a rapidly changing relationship. The more consequential question is not simply whether researchers should use LLMs as human surrogates, but whether - and under what conditions - humans are beginning to use LLMs as surrogates for their own deliberation. Across domains including health, law, finance, education, and personal guidance, increasing numbers of people consult generative AI systems before, alongside, or instead of human experts, peers, or independent judgment. Although evidence for actual delegation remains uneven, this uncertainty makes the phenomenon an urgent social-scientific object of study. We argue for a research programme that treats LLMs as consequential social actors in a functional sense: systems whose outputs shape human decisions, social norms, and collective dynamics.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 2 minor

Summary. The manuscript argues that the ongoing debate over using LLMs as substitutes for human participants in behavioral research addresses only one direction of the human-AI relationship. The more consequential direction, it claims, is humans delegating their own deliberation to LLMs across domains such as health, law, finance, education, and personal guidance. Although evidence for such delegation remains uneven, the authors treat this uncertainty as motivation for a dedicated research program that treats LLMs as consequential social actors in a functional sense—systems whose outputs shape human decisions, social norms, and collective dynamics.

Significance. If the described delegation phenomenon occurs at meaningful scale, the proposed research program could open productive lines of inquiry at the intersection of social science and AI studies by shifting attention from LLMs as research tools to LLMs as influences on real-world social processes. The manuscript's honesty about the current data gap is a strength, as it frames the call for study around an acknowledged uncertainty rather than overstated claims. As a position statement without new empirical results or formal models, its value lies in its potential to stimulate targeted investigations rather than in any tested proposition.

minor comments (2)
  1. The abstract and framing would benefit from one or two concrete examples of observed or hypothesized delegation (e.g., specific use cases in health or finance) to illustrate the functional social-actor claim without requiring new data.
  2. A short section outlining example research questions or methodological approaches for the proposed program would make the call more actionable for readers while remaining consistent with the position-piece format.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their constructive and positive assessment of the manuscript. We appreciate the recognition that the paper's value lies in framing a call for study around an acknowledged uncertainty rather than overstated claims, and that this honesty is a strength. The referee's summary accurately captures the manuscript's central argument regarding the shift from LLMs as research tools to LLMs as influences on real-world social processes.

Circularity Check

0 steps flagged

No circularity: position paper with no derivations or fitted inputs

full rationale

The paper is a conceptual position statement advocating a research program on LLMs as social actors. It contains no equations, parameters, derivations, or empirical fits. The abstract explicitly flags that 'evidence for actual delegation remains uneven' and uses this uncertainty to motivate study rather than asserting a quantified claim that could reduce to its own inputs. No self-citation chains, self-definitional loops, or renamed known results appear in the provided text. The central claim is functional and argumentative, remaining self-contained without internal reduction to fitted quantities or prior author work.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The document contains no quantitative derivations, fitted parameters, or postulated entities; it is a perspective statement whose central move is definitional (treating LLMs as social actors) rather than evidentiary.

pith-pipeline@v0.9.1-grok · 5674 in / 1088 out tokens · 18650 ms · 2026-06-27T10:49:20.426347+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

40 extracted references · 2 canonical work pages

  1. [1]

    I., and Kalai, A

    Aher, G., Arriaga, R. I., and Kalai, A. T. (2023). Using large language models to simulate multiple humans and replicate human subject studies. In Proceedings of the 40th International Conference on Machine Learning, pages 337--371

  2. [2]

    R., Shah, J

    Anderson, B. R., Shah, J. H., and Kreminski, M. (2024). Homogenization effects of large language models on human creative ideation. In Proceedings of the 16th Conference on Creativity & Cognition (C&C '24), pages 413--425. Association for Computing Machinery

  3. [3]

    P., Busby, E

    Argyle, L. P., Busby, E. C., Fulda, N., Gubler, J. R., Rytting, C., and Wingate, D. (2023). Out of one, many: Using language models to simulate human samples. Political Analysis, 31(3):337--351

  4. [4]

    J., Lin, M

    Ayo-Ajibola, O., Davis, R. J., Lin, M. E., Riddell, J., and Kravitz, R. L. (2024). Characterizing the adoption and experiences of users of artificial intelligence--generated health information in the United States : Cross-sectional questionnaire study. Journal of Medical Internet Research, 26:e55138

  5. [5]

    M., Gebru, T., McMillan-Major, A., and Shmitchell, S

    Bender, E. M., Gebru, T., McMillan-Major, A., and Shmitchell, S. (2021). On the dangers of stochastic parrots: Can language models be too big? In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, pages 610--623

  6. [6]

    A., Aditi, E., Altman, R., Arora, S., Artetxe, M., et al

    Bommasani, R., Hudson, D. A., Aditi, E., Altman, R., Arora, S., Artetxe, M., et al. (2021). On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258

  7. [7]

    F., Nussberger, A.-M., Czaplicka, A., Acerbi, A., Griffiths, T

    Brinkmann, L., Baumann, F., Bonnefon, J.-F., Derex, M., M\" u ller, T. F., Nussberger, A.-M., Czaplicka, A., Acerbi, A., Griffiths, T. L., Henrich, J., Leibo, J. Z., McElreath, R., Oudeyer, P.-Y., Stray, J., and Rahwan, I. (2023). Machine culture. Nature Human Behaviour, 7(11):1855--1868

  8. [8]

    Carr, N. (2011). The Shallows: What the Internet Is Doing to Our Brains. W. W. Norton

  9. [9]

    H., Monahan, A

    Choi, J. H., Monahan, A. B., and Schwarcz, D. (2024). Lawyering in the age of artificial intelligence. Minnesota Law Review, 109(1):147--218

  10. [10]

    and Chalmers, D

    Clark, A. and Chalmers, D. (1998). The extended mind. Analysis, 58(1):7--19

  11. [11]

    Carlini, N., Chien, S., Nasr, M., Song, S., Terzis, A., and Tramer, F

    Desai, A. P., Mallya, G. S., Luqman, M., Ravi, T., Kota, N., and Yadav, P. (2024). Opportunities and challenges of generative- AI in finance. In 2024 IEEE International Conference on Big Data (BigData), pages 4913--4920. doi:10.1109/BigData62323.2024.10825658

  12. [12]

    J., Simmons, J

    Dietvorst, B. J., Simmons, J. P., and Massey, C. (2015). Algorithm aversion: People erroneously avoid algorithms after seeing them err. Journal of Experimental Psychology: General, 144(1):114--126

  13. [13]

    Doshi, A. R. and Hauser, O. P. (2024). Generative AI enhances individual creativity but reduces the collective diversity of novel content. Science Advances, 10(28):eadn5290

  14. [14]

    Durmus, E., Nguyen, K., Liao, T. I., Schiefer, N., Askell, A., Bakhtin, A., Chen, C., Hatfield-Dodds, Z., Hernandez, D., Joseph, N., Lovitt, L., McCandlish, S., Sikder, O., Tamkin, A., Thamkul, J., Kaplan, J., Clark, J., and Ganguli, D. (2024). Towards measuring the representation of subjective global opinions in language models. arXiv preprint arXiv:2306.16388

  15. [15]

    Eisenstein, E. L. (1980). The Printing Press as an Agent of Change. Cambridge University Press

  16. [16]

    Geng, M., Trotta, R., Nanni, F., and Bizer, C. (2025). The impact of large language models in academia: From writing to speaking. In Findings of the Association for Computational Linguistics: ACL 2025, pages 19303--19319

  17. [17]

    a m\" a l\

    H\" a m\" a l\" a inen, P., Tavast, M., and Kunnari, A. (2023). Evaluating large language models in generating synthetic HCI research data: a case study. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, pages 1--19

  18. [18]

    How Gen Z uses gen AI --- and why it worries them

    Harvard Business Review/Gallup/Walton Family Foundation (2026). How Gen Z uses gen AI --- and why it worries them. (https://hbr.org/2026/01/how-gen-z-uses-gen-ai-and-why-it-worries-them) Harvard Business Review, January 2026

  19. [19]

    Hutchins, E. (1995). Cognition in the Wild. MIT Press

  20. [20]

    and Raghavan, M

    Kleinberg, J. and Raghavan, M. (2021). Algorithmic monoculture and social welfare. Proceedings of the National Academy of Sciences, 118(22):e2018340118

  21. [21]

    and Thelwall, M

    Kousha, K. and Thelwall, M. (2026). How much are LLMs changing the language of academic papers after ChatGPT? A multi-database and full text analysis. Scientometrics

  22. [22]

    Laestadius, L., Bishop, A., Gonzalez, M., Illen c \' i k, D., and Campos-Castillo, C. (2024). Too human and not human enough: A grounded theory analysis of mental health harms from intimate human--AI relationships. New Media & Society, 26(10):5923--5941

  23. [23]

    M., Minson, J

    Logg, J. M., Minson, J. A., and Moore, D. A. (2019). Algorithm appreciation: People prefer algorithmic to human judgment. Organizational Behavior and Human Decision Processes, 151:90--103

  24. [24]

    Maples, B., Cerit, M., Vishwanath, A., and Pea, R. (2024). Loneliness and suicide mitigation for students using GPT-3-enabled chatbots. npj Mental Health Research, 3:4

  25. [25]

    Page, S. E. (2007). The Difference: How the Power of Diversity Creates Better Groups, Firms, Schools, and Societies. Princeton University Press

  26. [26]

    and He, H

    Padmakumar, V. and He, H. (2024). Does writing with language models reduce content diversity? In The Twelfth International Conference on Learning Representations (ICLR)

  27. [27]

    and Riley, V

    Parasuraman, R. and Riley, V. (1997). Humans and automation: Use, misuse, disuse, abuse. Human Factors, 39(2):230--253

  28. [28]

    Parsons, T. (1937). The Structure of Social Action. McGraw-Hill

  29. [29]

    Teens, social media and AI chatbots 2025 (https://www.pewresearch.org/internet/2025/12/09/teens-social-media-and-ai-chatbots-2025/)

    Pew Research Center (2025). Teens, social media and AI chatbots 2025 (https://www.pewresearch.org/internet/2025/12/09/teens-social-media-and-ai-chatbots-2025/). Technical report, Pew Research Center, December 2025

  30. [30]

    and Nass, C

    Reeves, B. and Nass, C. (1996). The Media Equation: How People Treat Computers, Television, and New Media Like Real People and Places. Cambridge University Press

  31. [31]

    H., Gallotti, R., and West, R

    Salvi, F., Ribeiro, M. H., Gallotti, R., and West, R. (2025). On the conversational persuasiveness of GPT-4 Nature Human Behaviour, 9:1645--1653

  32. [32]

    Santurkar, S., Durmus, E., Ladd, F., Lee, E., Liang, P., and Hashimoto, T. (2023). Whose opinions do language models reflect? In Proceedings of the 40th International Conference on Machine Learning, pages 29971--30004

  33. [33]

    Seabrooke, T., Schneiders, E., Dowthwaite, L., Krook, J., Leesakul, N., Clos, J., Maior, H., and Fischer, J. (2024). A survey of lay people's willingness to generate legal advice using large language models. In Proceedings of the Second International Symposium on Trustworthy Autonomous Systems (TAS '24). Association for Computing Machinery

  34. [34]

    and Choudhury, A

    Shahsavar, Y. and Choudhury, A. (2023). User intentions to use ChatGPT for self-diagnosis and health-related purposes: Cross-sectional survey study. JMIR Human Factors, 10:e47564

  35. [35]

    Shumailov, I., Shumaylov, Z., Zhao, Y., Gal, Y., Papernot, N., and Anderson, R. (2024). AI models collapse when trained on recursively generated data. Nature, 631:755--759

  36. [36]

    Slocum, S., Parker-Sartori, A., and Hadfield-Menell, D. (2025). Diverse preference learning for capabilities and alignment. In The Thirteenth International Conference on Learning Representations

  37. [37]

    Sparrow, B., Liu, J., and Wegner, D. M. (2011). Google effects on memory: Cognitive consequences of having information at our fingertips. Science, 333(6043):776--778

  38. [38]

    S., Shergill, S

    Teixeira, A. S., Shergill, S. S., Laban, G. (2026) Human–AI interactions reshape the self and our social networks. Nat Mach Intell. https://doi.org/10.1038/s42256-026-01248-2

  39. [39]

    Weidinger, L., Mellor, J., Rauh, M., Griffin, C., Uesato, J., Huang, P.-S., Cheng, M., Glaese, M., Balle, B., Kasirzadeh, A., et al. (2021). Ethical and social risks of harm from language models. arXiv preprint arXiv:2112.04359

  40. [40]

    Yakura, H., Lopez-Lopez, E., Brinkmann, L., Serna, I., Gupta, P., and Rahwan, I. (2024). Empirical evidence of large language model's influence on human spoken communication. arXiv preprint arXiv:2409.01754