Form Without Function: Agent Social Behavior in the Moltbook Network
Pith reviewed 2026-05-15 09:52 UTC · model grok-4.3
The pith
AI agents on Moltbook reproduce the full structure of social media but show almost no reciprocity, argumentation, or sustained engagement.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Moltbook is a socio-technical system where the technical layer responds to changes, but the social layer largely fails to emerge. The form of social media is reproduced in full. The function is absent.
What carries the argument
Three-layer evaluation (interaction, content, instruction) of agent activity that tracks reciprocity rates, reply depth, topic adherence, and response to instruction changes.
Load-bearing premise
The observed low reciprocity, flat conversations, and topic mismatch result from inherent limits in the AI agents themselves rather than from platform design choices or data collection filters.
What would settle it
A comparable network of AI agents run on different rules that produces reciprocity above 20 percent or sustained multi-turn argumentation would falsify the central claim.
Figures
read the original abstract
Moltbook is a social network where every participant is an AI agent. We analyze 1,312,238 posts, 6.7~million comments, and over 120,000 agent profiles across 5,400 communities, collected over 40 days (January 27 to March 9, 2026). We evaluate the platform through three layers. At the interaction layer, 91.4% of post authors never return to their own threads, 85.6% of conversations are flat (no reply ever receives a reply), the median time-to-first-comment is 55 seconds, and 97.3% of comments receive zero upvotes. Interaction reciprocity is 3.3%, compared to 22-60% on human platforms. An argumentation analysis finds that 64.6% of comment-to-post relations carry no argumentative connection. At the content layer, 97.9% of agents never post in a community matching their bio, 92.5% of communities contain every topic in roughly equal proportions, and over 80% of shared URLs point to the platform's own infrastructure. At the instruction layer, we use 41 Wayback Machine snapshots to identify six instruction changes during the observation window. Hard constraints (rate limit, content filters) produce immediate behavioral shifts. Soft guidance (``upvote good posts'', ``stay on topic'') is ignored until it becomes an explicit step in the executable checklist. The platform also poses technological risks. We document credential leaks (API keys, JWT tokens), 12,470 unique Ethereum addresses with 3,529 confirmed transaction histories, and attack discourse ranging from template-based SSH brute-forcing to multi-agent offensive security architectures. These persist unmoderated because the quality-filtering mechanisms are themselves non-functional. Moltbook is a socio-technical system where the technical layer responds to changes, but the social layer largely fails to emerge. The form of social media is reproduced in full. The function is absent.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper analyzes 1.3 million posts, 6.7 million comments, and 120k profiles from the Moltbook AI-agent social network over 40 days. It reports low interaction metrics (3.3% reciprocity, 85.6% flat conversations, 91.4% authors never returning to threads), content mismatches (97.9% agents posting outside bio-matched communities), and responsiveness only to hard instruction changes, concluding that the technical layer adapts while the social layer fails to emerge, reproducing social-media form without function.
Significance. If the central claim holds after addressing controls, the work supplies a large-scale observational dataset on AI-agent behavior in a socio-technical environment, documenting both absent social emergence and concrete security exposures (credential leaks, Ethereum addresses). This could inform research on multi-agent systems and platform design, provided the attribution to agent limitations rather than setup is isolated.
major comments (3)
- [Interaction layer] Interaction-layer section: the claim that reciprocity (3.3%) and flat-conversation rates (85.6%) demonstrate absent social function rests on comparisons to human platforms (22-60%), yet no details are given on how those baselines were selected or matched for scale, topic uniformity, or moderation regime; without this, the contrast cannot be interpreted as evidence of inherent agent limitations.
- [Instruction layer] Instruction-layer analysis: six documented instruction changes are used to argue that hard constraints produce immediate shifts while soft guidance is ignored, but the text provides no before-after quantitative metrics, statistical tests, or ablation isolating the effect of each change from concurrent platform filters or data-collection decisions.
- [Methods / Data collection] Data-collection and filtering description: the observation window includes rate limits, content filters, and community-topic uniformity, yet no controls, ablations, or sensitivity checks are reported to test whether low engagement (e.g., 97.3% zero-upvote comments) arises from agent architecture versus these design choices or post-hoc thread removal; this directly undermines the attribution of absent function to the agents themselves.
minor comments (3)
- [Argumentation analysis] Define 'flat conversation' and 'argumentative connection' explicitly, including the exact decision rules or classifiers used in the 64.6% non-argumentative finding.
- [Content layer] Report sample sizes or confidence intervals alongside all percentages (e.g., 97.9% bio mismatch) so readers can assess precision.
- [Instruction layer] Clarify how the 41 Wayback snapshots were sampled and whether any instruction changes coincided with data-filtering events.
Simulated Author's Rebuttal
We thank the referee for their constructive comments on our manuscript analyzing the Moltbook AI-agent social network. We address each major comment point by point below, providing clarifications and committing to revisions where appropriate to strengthen the evidence for our claims.
read point-by-point responses
-
Referee: [Interaction layer] Interaction-layer section: the claim that reciprocity (3.3%) and flat-conversation rates (85.6%) demonstrate absent social function rests on comparisons to human platforms (22-60%), yet no details are given on how those baselines were selected or matched for scale, topic uniformity, or moderation regime; without this, the contrast cannot be interpreted as evidence of inherent agent limitations.
Authors: We acknowledge that the manuscript lacks sufficient detail on the selection of human-platform baselines. In the revised version, we will add a dedicated paragraph in the interaction-layer section citing the specific sources for the 22-60% reciprocity range (e.g., studies on Twitter reciprocity rates around 22-30% and higher rates in moderated Reddit communities up to 60%). We will explicitly discuss the challenges in matching for scale, topic, and moderation, noting that while perfect matching is not feasible across platforms, the substantial gap (3.3% vs. minimum 22%) provides indicative evidence of reduced social function in the agent setting. This will allow readers to better interpret the contrast. revision: yes
-
Referee: [Instruction layer] Instruction-layer analysis: six documented instruction changes are used to argue that hard constraints produce immediate shifts while soft guidance is ignored, but the text provides no before-after quantitative metrics, statistical tests, or ablation isolating the effect of each change from concurrent platform filters or data-collection decisions.
Authors: We agree that quantitative support for the instruction-layer claims is needed. The revised manuscript will include before-and-after metrics for key behaviors (e.g., posting frequency, community adherence, and responsiveness) around each of the six instruction changes, derived from the 41 Wayback Machine snapshots. We will apply appropriate statistical tests, such as paired t-tests or Wilcoxon signed-rank tests, to evaluate the significance of shifts following hard constraint changes versus soft guidance. While full ablation isolating from all concurrent factors is challenging in this observational dataset, we will discuss potential confounders and highlight the temporal alignment of behavioral changes with hard instruction updates. revision: yes
-
Referee: [Methods / Data collection] Data-collection and filtering description: the observation window includes rate limits, content filters, and community-topic uniformity, yet no controls, ablations, or sensitivity checks are reported to test whether low engagement (e.g., 97.3% zero-upvote comments) arises from agent architecture versus these design choices or post-hoc thread removal; this directly undermines the attribution of absent function to the agents themselves.
Authors: We recognize the importance of addressing potential confounds from the platform's design choices. In the revision, we will expand the methods section to include sensitivity checks, such as comparing engagement metrics across periods with different rate limits and analyzing subsets of data before and after content filter implementations. We will also detail the thread removal process and perform ablations where feasible, e.g., excluding potentially filtered content. However, as this is an observational study without experimental control over the platform, complete isolation of agent limitations from socio-technical setup is not possible. We will revise the discussion to more carefully attribute the absent social function to the observed system as a whole, while maintaining that the agents' responses to hard constraints indicate technical adaptability. revision: partial
Circularity Check
No circularity: purely observational metrics from raw platform data
full rationale
The paper reports direct empirical statistics (reciprocity 3.3%, flat conversations 85.6%, non-argumentative comments 64.6%, bio-community mismatch 97.9%) computed from the collected 1.3M posts and 6.7M comments. No equations, fitted parameters, predictions, or derivations are present. Instruction changes are documented via external Wayback snapshots and correlated with observed shifts, but this remains descriptive rather than a self-referential model. No self-citations are invoked to justify core claims, and the analysis does not rename or smuggle prior results. The derivation chain is simply data collection followed by counting; it does not reduce to its own inputs by construction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Standard assumptions in social network analysis such as the validity of reciprocity metrics for measuring social function.
Reference graph
Works this paper leans on
-
[1]
Hacking Moltbook: The AI Social Network Any Human Can Control
Gal Nagli. Hacking Moltbook: The AI Social Network Any Human Can Control. https://www.wiz.io/blog/ exposed-moltbook-database-reveals-millions-of-api-keys. Accessed: 11.02.2026. Chen Gao, Xiaochong Lan, Nian Li, Yuan Yuan, Jingtao Ding, Zhilun Zhou, Fengli Xu, and Yong Li. Large language models empowered agent-based modeling and simulation: A survey and pe...
work page 2026
-
[2]
24 Eirini Kalliamvakou, Georgios Gousios, Kelly Blincoe, Leif Singer, Daniel M
31 Form Without Function: Agent Social Behavior in the Moltbook NetworkA PREPRINT Yukun Jiang, Yage Zhang, Xinyue Shen, Michael Backes, and Yang Zhang. "Humans welcome to observe": A First Look at the Agent Social Network Moltbook.CoRR abs/2602.10127,
-
[3]
Community interaction and conflict on the web
Srijan Kumar, William L Hamilton, Jure Leskovec, and Dan Jurafsky. Community interaction and conflict on the web. InProceedings of the 2018 world wide web conference, pages 933–943,
work page 2018
-
[4]
Yu-Zheng Lin, Bono Po-Jen Shih, Hsuan-Ying Alessandra Chien, Shalaka Satam, Jesus Horacio Pacheco, Sicong Shao, Soheil Salehi, and Pratik Satam. Exploring Silicon-Based Societies: An Early Study of the Moltbook Agent Community.arXiv preprint arXiv:2602.02613, February
-
[5]
Md Motaleb Hossen Manik and Ge Wang
v2. Md Motaleb Hossen Manik and Ge Wang. OpenClaw Agents on Moltbook: Risky Instruction Sharing and Norm Enforcement in an Agent-Only Social Network.arXiv preprint arXiv:2602.02625, February
-
[6]
The Journal of Open Source Software 2(11) (mar 2017)
doi: 10.21105/joss.00205. URLhttps://doi.org/10.21105/joss.00205. Lev Muchnik, Sinan Aral, and Sean J Taylor. Social influence bias: A randomized experiment.Science, 341(6146): 647–651,
-
[7]
doi: 10.1177/1461445617734955. Joon Sung Park, Joseph O’Brien, Carrie Jun Cai, Meredith Ringel Morris, Percy Liang, and Michael S Bernstein. Generative agents: Interactive simulacra of human behavior. InProceedings of the 36th annual acm symposium on user interface software and technology, pages 1–22,
-
[8]
Sentence-BERT: Sentence embeddings using Siamese BERT-networks
Nils Reimers and Iryna Gurevych. Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In Kentaro Inui, Jing Jiang, Vincent Ng, and Xiaojun Wan, editors,Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP- IJCNLP), pages 3982–3992,...
work page 2019
-
[9]
Sentence- BERT : Sentence Embeddings using S iamese BERT -Networks
Association for Computational Linguistics. doi: 10.18653/v1/D19-1410. URLhttps://aclanthology.org/D19-1410/. Reuters. Meta acquires ai agent social network moltbook.Reuters, March
- [10]
-
[11]
ISSN 1941-1294. doi: 10.1109/MIS.2021.3073993. Mrinank Sharma, Meg Tong, Tomasz Korbak, David Duvenaud, Amanda Askell, Samuel R Bowman, Newton Cheng, Esin Durmus, Zac Hatfield-Dodds, Scott R Johnston, et al. Towards understanding sycophancy in language models. arXiv preprint arXiv:2310.13548,
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.