Socially fluent AI decouples conversational signals from source identity in online interaction
Pith reviewed 2026-05-25 03:47 UTC · model grok-4.3
The pith
People cannot identify AI teammates above chance in online group chats despite detectable conversational differences
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Across 786 participants and 1572 identity judgments, people did not distinguish undisclosed AI agents from human teammates above chance in synchronous text interactions, even though conversational behavior contained robust differentiating cues that enabled high-accuracy machine classification; instead, judgments relied on suspicion heuristics weakly related to ground truth and were organized around subjective impressions rather than the behavioral structure that encoded actual identity.
What carries the argument
The dissociation between computationally classifiable conversational cues and human identity judgments driven by suspicion heuristics and subjective impressions
If this is right
- AI agents can join and participate in ordinary online group tasks without participants reliably detecting their identity.
- Coordinated AI agents create new vulnerabilities for influencing and manipulating online discourse at scale.
- Identity judgments depend on subjective impressions that do not track the behavioral markers separating AI from humans.
- The pattern holds across analytical, creative, and ethical task types in synchronous text settings.
Where Pith is reading between the lines
- Detection strategies on platforms may need to shift from relying on user reports to automated analysis of conversational patterns.
- The same dissociation could appear in voice or multimodal interactions once AI reaches similar fluency levels.
- Policy discussions around mandatory AI disclosure in groups may need to account for the fact that humans do not spontaneously notice the relevant signals.
Load-bearing premise
The undisclosed AI agents were perceived and interacted with as ordinary teammates without meta-knowledge, prior suspicion, or detectable artifacts that would systematically bias identity judgments, and the chosen tasks and synchronous text format represent broader online interactions.
What would settle it
A replication experiment that either warns participants about possible AI presence or trains them explicitly on the actual distinguishing conversational features before measuring whether post-interaction identity accuracy rises above chance.
read the original abstract
Socially fluent agentic AI can now participate in online interaction in ways that resemble ordinary human conversation, potentially weakening people's ability to infer who is human from conversational signals alone. We tested this possibility in synchronous text-based group interaction by embedding undisclosed AI agents as ordinary teammates across analytical, creative, and ethical tasks. Across 786 participants who made 1,572 post-interaction identity judgments, people did not distinguish AI from human teammates above chance. This failure did not arise because the interaction lacked identity-relevant information. Conversational behaviour contained robust cues that differentiated AI from humans and supported highly accurate computational classification. Instead, participants relied on familiar suspicion heuristics, including response speed, fluency, and perceived scriptedness, that were only weakly related to actual identity. Representational analyses further showed that judgments were organised around subjective impressions rather than the behavioural structure encoding ground truth. This dissociation creates new vulnerabilities to coordinated AI agents that can influence and manipulate online discourse at scale.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript reports results from an experiment embedding undisclosed AI agents as teammates in synchronous text-based group interactions across analytical, creative, and ethical tasks. With 786 participants providing 1,572 post-interaction identity judgments, human accuracy in distinguishing AI from humans did not exceed chance. Conversational data nonetheless supported highly accurate computational classification of source identity. The paper attributes the dissociation to participants' reliance on heuristics (response speed, fluency, perceived scriptedness) only weakly correlated with ground truth, with representational similarity analyses indicating that judgments tracked subjective impressions rather than the behavioral structure separating AI from humans. The work concludes that socially fluent AI can decouple conversational signals from source identity, creating risks for coordinated influence in online discourse.
Significance. If the central dissociation holds, the result is significant for HCI and social computing: it demonstrates that current AI conversational fluency can evade human detection even when machine classifiers recover identity cues from the same data. The large participant sample, multi-task design, and explicit contrast between human heuristics and computational separability provide a clear empirical contribution. The work also supplies a concrete behavioral dataset that could support further modeling of identity inference. These elements strengthen the case for the claimed vulnerability in online group settings.
major comments (1)
- [Methods (participant recruitment and procedure)] Methods (participant recruitment and procedure): The central claim that AI agents were perceived as ordinary teammates without meta-knowledge or detectable artifacts requires evidence that the experimental framing did not induce systematic suspicion. The manuscript does not report post-task suspicion probes, debriefing questions on AI awareness, or checks for interface-mediated timing/fluency differences. If participants inferred an AI study from consent language or task demands, the observed chance-level judgments could reflect strategic responding rather than a general property of socially fluent AI, directly undermining the dissociation result.
minor comments (2)
- [Abstract] Abstract: Key quantitative results (exact human accuracy, computational classification accuracy, statistical tests against chance, confidence intervals) are omitted, making it impossible to evaluate the strength of the reported dissociation from the abstract alone.
- [Results] Results: The manuscript should report participant demographics (age, gender, prior AI exposure) and any task-order or group-composition effects, as these could moderate the observed heuristics.
Simulated Author's Rebuttal
We thank the referee for their constructive feedback, which highlights an important methodological consideration for interpreting our results. We address the comment below and will revise the manuscript to improve transparency on experimental framing.
read point-by-point responses
-
Referee: [Methods (participant recruitment and procedure)] Methods (participant recruitment and procedure): The central claim that AI agents were perceived as ordinary teammates without meta-knowledge or detectable artifacts requires evidence that the experimental framing did not induce systematic suspicion. The manuscript does not report post-task suspicion probes, debriefing questions on AI awareness, or checks for interface-mediated timing/fluency differences. If participants inferred an AI study from consent language or task demands, the observed chance-level judgments could reflect strategic responding rather than a general property of socially fluent AI, directly undermining the dissociation result.
Authors: We agree that evidence against systematic suspicion is necessary to support the claim that chance-level judgments reflect AI conversational fluency. The consent language described the study solely as investigating 'team dynamics in online collaboration across different tasks' with no reference to AI, identity inference, or deception; we will add the full consent text to the revised Methods. All participants received a general debriefing only after judgments were collected, with no early revelation of the AI component. Explicit post-task suspicion probes were not administered. We will add this as an explicit limitation in the Discussion, while noting that uniform chance performance across three qualitatively different task types makes a global strategic response less plausible. The chat interface was a standard synchronous platform with identical presentation for all users; we will include any extractable server-log timing data in the revision. These additions will be made to strengthen the methods section. revision: yes
- Absence of post-task suspicion probes and debriefing questions on AI awareness in the original data collection, which prevents direct reporting of those measures.
Circularity Check
No significant circularity; empirical behavioral study with no derivations or self-referential reductions
full rationale
The paper is a purely empirical report of an experiment with 786 participants making 1,572 identity judgments after synchronous text interactions. No equations, fitted parameters, predictions derived from models, or theoretical derivations are present that could reduce any result to its inputs by construction. Claims rest on direct observations (chance-level human accuracy vs. high computational separability) and representational analyses of subjective impressions. No self-citation chains, uniqueness theorems, or ansatzes are invoked as load-bearing steps. The study is self-contained against external benchmarks as a standard behavioral experiment.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Human participants form identity judgments in text-based interactions primarily using heuristics such as response speed, fluency, and perceived scriptedness.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Across 786 participants ... people did not distinguish AI from human teammates above chance ... interactional cues ... supported highly accurate computational classification ... judgments were organised around subjective impressions rather than the behavioural structure encoding ground truth.
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
cue-based prediction was statistically detectable but weak ... ground-truth identity ... pseudo-R² = 0.692 ... RSA ... interactional cue structure was strongly aligned with ground-truth identity (ρ=.455) ... judgment structure ... orthogonal to this interactional structure
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
E. M. Bender, T. Gebru, A. McMillan-Major, S. Shmitchell, On the dangers of stochastic parrots: Can language models be too big?, inProceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency(2021), pp. 610–623
work page 2021
-
[2]
P. Barber ´a,et al., The Critical Periphery in the Growth of Social Protests.PLOS ONE10(11), e0143611 (2015), doi:10.1371/journal.pone.0143611
-
[3]
F. Dell’ Acqua,et al., Navigating the jagged technological frontier: Field experimental evidence of the effects of AI on knowledge worker productivity and quality.Harvard Business School Technology & Operations Mgt. Unit Working Paper(24-013) (2023)
work page 2023
-
[4]
J. Hohenstein, M. Jung, AI as a moral crumple zone: The effects of AI-mediated communication on attribution and trust.Computers in Human Behavior106, 106190 (2020)
work page 2020
- [5]
-
[6]
L. P. Argyle,et al., Testing theories of political persuasion using AI.Proceedings of the National Academy of Sciences122(18), e2412815122 (2025)
work page 2025
-
[7]
Brinkmann,et al., Machine culture.Nature Human Behaviour7(11), 1855–1868 (2023)
L. Brinkmann,et al., Machine culture.Nature Human Behaviour7(11), 1855–1868 (2023)
work page 2023
-
[8]
Akata,et al., Playing repeated games with large language models.Nature Human Behaviour pp
E. Akata,et al., Playing repeated games with large language models.Nature Human Behaviour pp. 1–11 (2025)
work page 2025
-
[9]
D. W. Johnson, F. P. Johnson,Joining Together: Group Theory and Group Skills(Prentice-Hall, Englewood Cliffs, NJ) (1987)
work page 1987
-
[10]
S. G. Rogelberg, J. L. Barnes-Farrell, C. A. Lowe, The stepladder technique: An alternative group structure facilitating effective group decision making.Journal of Applied Psychology 77(5), 730–737 (1992)
work page 1992
-
[11]
J.-F. Bonnefon, A. Shariff, I. Rahwan, The social dilemma of autonomous vehicles.Science 352(6293), 1573–1576 (2016)
work page 2016
-
[12]
Y. E. Bigman, K. Gray, Life and death decisions of autonomous vehicles.Nature579(7797), E1–E2 (2020)
work page 2020
-
[13]
Awad,et al., The Moral Machine experiment.Nature563(7729), 59–64 (2018)
E. Awad,et al., The Moral Machine experiment.Nature563(7729), 59–64 (2018)
work page 2018
- [14]
-
[15]
A. H. C. Hwang, A. S. Won, IdeaBot: Investigating social facilitation in human-machine team creativity, inProceedings of the 2021 CHI Conference on Human Factors in Computing Systems (ACM) (2021), pp. 1–16
work page 2021
-
[16]
M. Farrokhnia, O. Noroozi, Y. Baggen, H. Biemans, A. Weinberger, Improving hybrid brain- storming outcomes with computer-supported scaffolds: Prompts and cognitive group aware- ness.Computers & Education227, 105229 (2025)
work page 2025
-
[17]
A. Edmondson, Psychological safety and learning behavior in work teams.Administrative Science Quarterly44(2), 350–383 (1999). 34 Acknowledgments Funding:This research was supported by the National Natural Science Foundation of China (Grant No. 20261710003; L.Y.), the Australian Research Council (DP240100069 and DP220101209; D.G.), and the Jacobs Foundatio...
work page 1999
-
[18]
an AI-attribution model restricted to judgments in{AI,Human}, with𝑌=1 for AI and𝑌=0 for Human
-
[19]
an uncertainty model restricted to judgments in{Not sure,Human}, with𝑌=1 forNot sure and𝑌=0 for Human. Models were fitted using maximum likelihood estimation. Primary uncertainty estimation used cluster-robust standard errors clustered at the participant level to account for repeated teammate judgments. Robustness analyses repeated this estimation with cl...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.