Socially Fluent, Socially Awkward: Artificial Intelligence Relational Talk Backfires in Commercial Interactions
Pith reviewed 2026-05-10 16:13 UTC · model grok-4.3
The pith
AI relational talk in commercial interactions reduces customer satisfaction by violating expectations and creating awkwardness, though goal-relevant talk weakens the harm.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Across four experiments, the authors find a negative main effect of AI relational talk on satisfaction, mediated by expectancy violation and perceived interaction awkwardness. Goal-relevant relational talk attenuates this effect. The work challenges the assumption that increased social fluency will improve satisfaction and highlights the complexity of integrating social features into AI systems, showing that perceived awkwardness in AI-led commercial interactions can elicit negative responses even without real social repercussions.
What carries the argument
Relational talk, the informal and non-obligatory social communication embedded in transactional exchanges, which when used by AI triggers expectancy violation and perceived interaction awkwardness that lower satisfaction.
If this is right
- Customer satisfaction drops when AI uses relational talk in commercial settings.
- The satisfaction drop is explained by expectancy violation and perceived interaction awkwardness.
- Making relational talk goal-relevant reduces the negative impact on satisfaction.
- Perceived awkwardness functions as a barrier to effective human-AI interaction even without actual social consequences.
- Increased social fluency in AI does not necessarily raise satisfaction in commercial contexts.
Where Pith is reading between the lines
- AI design for transactions may need to prioritize task focus over casual social elements unless those elements tie directly to the user's goal.
- The pattern could appear in non-commercial AI settings such as education or healthcare where user expectations about AI behavior also differ from human norms.
- Longitudinal studies with repeated real-world AI use could test whether familiarity reduces the initial awkwardness effect.
Load-bearing premise
The experimental manipulations of relational talk in simulated AI commercial interactions accurately reflect real-world consumer responses and that the observed effects generalize beyond the study settings.
What would settle it
An experiment in which real customers interact with a deployed commercial AI system that includes relational talk and then report increased satisfaction or no rise in awkwardness would falsify the central claim.
read the original abstract
Advancements in Artificial Intelligence (AI) technologies' social fluency are being integrated into commercial interactions. As tools such as OpenAI's assistant are integrated into platforms such as Shopify, Klarna, and Visa, understanding consumer responses to AI social features become essential. One such feature is relational talk, an informal and non-obligatory social communication embedded in transactional exchanges. Across four experiments, we find: 1) a negative main effect of AI relational talk on satisfaction, mediated by expectancy violation and perceived interaction awkwardness, and 2) goal-relevant relational talk to attenuate this effect. This paper extends the literature by challenging the assumption that increased social fluency will improve satisfaction, and highlights the complexity of integrating social features into AI systems. It also identifies awkwardness as a key emotional response and barrier to effective human-AI interaction, showing that even in the absence of real social repercussions, perceived awkwardness in AI-led commercial interactions can elicit negative responses.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript examines consumer responses to 'relational talk' (informal, non-obligatory social communication) by AI agents in commercial interactions. Across four experiments, it reports a negative main effect of such talk on satisfaction, mediated by expectancy violation and perceived interaction awkwardness, and shows that making the relational talk goal-relevant attenuates the negative effect. The work challenges the assumption that greater social fluency in AI improves satisfaction and identifies awkwardness as a key barrier in human-AI commercial exchanges.
Significance. If the empirical results hold after fuller reporting and validation, the paper offers a meaningful contribution to HCI and consumer behavior research by providing evidence that social features in AI can backfire in transactional settings. It extends prior work on human-AI interaction by documenting a mediation pathway through expectancy violation and awkwardness, and by identifying a practical boundary condition (goal relevance). This has implications for AI design in platforms such as customer service or e-commerce assistants, where adding social elements is often assumed to be beneficial.
major comments (2)
- [Abstract and Methods] Abstract and implied Methods sections: The abstract states that four experiments support the negative main effect and mediation, but the manuscript does not report sample sizes, participant demographics, exact manipulation wording or stimuli, control conditions, manipulation checks, full statistical results (means, SDs, p-values, effect sizes, or mediation model details), or power analyses. These omissions are load-bearing because the central claims rest entirely on the experimental evidence; without them the findings cannot be evaluated for robustness or replicability.
- [Discussion/Limitations] Discussion or Limitations: The experiments appear to rely on simulated, vignette-based, or scripted interactions. The manuscript should explicitly test or discuss whether the observed effects on awkwardness and satisfaction persist in field settings with real stakes, repeated interactions, or actual commercial consequences, as demand characteristics or the artificial context could inflate perceived awkwardness independently of the relational talk manipulation.
minor comments (1)
- [Abstract] The abstract would benefit from a brief statement of key sample characteristics or effect sizes to give readers immediate context for the strength of the reported effects.
Simulated Author's Rebuttal
We thank the referee for their constructive feedback, which has helped us strengthen the transparency and scope of our manuscript. We address each major comment below and have made corresponding revisions to improve the reporting of experimental details and the discussion of limitations.
read point-by-point responses
-
Referee: [Abstract and Methods] Abstract and implied Methods sections: The abstract states that four experiments support the negative main effect and mediation, but the manuscript does not report sample sizes, participant demographics, exact manipulation wording or stimuli, control conditions, manipulation checks, full statistical results (means, SDs, p-values, effect sizes, or mediation model details), or power analyses. These omissions are load-bearing because the central claims rest entirely on the experimental evidence; without them the findings cannot be evaluated for robustness or replicability.
Authors: We agree that comprehensive reporting is critical for assessing robustness and replicability. The original manuscript prioritized conceptual flow in the main text, with some methodological details placed in appendices or supplementary materials. In the revised version, we have expanded the Methods and Results sections to include all requested information: exact sample sizes per experiment, participant demographics, verbatim manipulation wordings and stimuli, control condition descriptions, manipulation check results, full statistical outputs (means, SDs, p-values, effect sizes), detailed mediation analyses with model specifications and confidence intervals, and power analyses for each study. These additions allow readers to fully evaluate the evidence without altering the core findings. revision: yes
-
Referee: [Discussion/Limitations] Discussion or Limitations: The experiments appear to rely on simulated, vignette-based, or scripted interactions. The manuscript should explicitly test or discuss whether the observed effects on awkwardness and satisfaction persist in field settings with real stakes, repeated interactions, or actual commercial consequences, as demand characteristics or the artificial context could inflate perceived awkwardness independently of the relational talk manipulation.
Authors: We acknowledge that the four experiments used controlled vignette-based and scripted scenarios, which is a standard approach in initial HCI and consumer research to isolate mechanisms like expectancy violation and awkwardness. We have revised the Discussion and Limitations sections to explicitly address this point, noting the potential influence of demand characteristics and artificial context. We discuss that while real-stakes or repeated commercial interactions may moderate effect sizes, the identified pathways (expectancy violation leading to awkwardness) are rooted in social norms and likely generalize. We outline specific directions for future field studies with actual commercial consequences and repeated interactions to test boundary conditions, without claiming the current results fully substitute for such data. revision: yes
Circularity Check
No significant circularity: empirical experimental study with no derivation chain
full rationale
The paper is an empirical study reporting results from four experiments on consumer responses to AI relational talk. It presents no mathematical derivations, equations, fitted parameters, or first-principles claims that could reduce to inputs by construction. Claims rest on observed main effects, mediation pathways, and moderation from experimental data, with no self-definitional loops, self-citation load-bearing for core premises, or renaming of known results as novel derivations. The derivation chain is absent, making the work self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Experimental manipulations of relational talk in AI responses validly represent social features in commercial AI interactions.
- domain assumption Consumer satisfaction and awkwardness can be reliably measured via self-report in simulated commercial scenarios.
Reference graph
Works this paper leans on
-
[1]
Ameen, N., Tarhini, A., Reppel, A., & Anand, A. (2021). Customer experiences in the age of artificial intelligence. Computers in Human Behavior, 114, 106548. https://doi.org/10.1016/j.chb.2020.106548 Babel, F., Kraus, J., Miller, L., Kraus, M., Wagner, N., Minker, W., & Baumann, M. (2021). Small talk with a robot? The impact of dialog content, talk initia...
-
[2]
https://doi.org/10.1007/s11747-019-00696-0 Félix-Brasdefer, J. C. (2015). The language of service encounters: a pragmatic-discursive approach. Cambridge University Press. https://doi.org/10.1017/CBO9781139565431 Goffman, E. (2017). Interaction Ritual: essays in face-to-face behavior. Routledge. https://doi.org/10.4324/9780203788387 Gremler, D. D., & Gwinn...
-
[3]
I would be happy with the interaction I got from this AI marketing representative I would be satisfied with my decision to interact with this AI marketing representative I would think I did the right thing when I interacted with this AI marketing representative Expectancy violation (1 - “Strongly disagree”, 7 - “Strongly agree”) (Yang & Aggarwal, 2019; Ka...
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.