pith. sign in

arxiv: 2604.12250 · v1 · submitted 2026-04-14 · 💻 cs.AI · cs.CL· cs.GT· cs.MA

How memory can affect collective and cooperative behaviors in an LLM-Based Social Particle Swarm

Pith reviewed 2026-05-10 14:43 UTC · model grok-4.3

classification 💻 cs.AI cs.CLcs.GTcs.MA
keywords memoryagentscooperativeclusterscollectivecooperationexperimentslength
0
0 comments X

The pith

LLM agents in a spatial Prisoner's Dilemma exhibit model-specific effects of memory length on cooperation, with Gemini suppressing and Gemma promoting it as memory increases.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The authors start with an existing model called Social Particle Swarm. In this setup, simple agents move on a 2D grid and repeatedly play the Prisoner's Dilemma game with nearby agents. They replace the simple agents with large language models that can reason in text. Each LLM agent is assigned scores on the Big Five personality traits and is given a memory of past interactions that can be short or long. When they run the simulation with Gemini-2.0-Flash, even a little memory makes cooperation drop sharply; longer memory leads to scattered agents that mostly defect. When they run the same setup with Gemma 3:4b, the pattern reverses and longer memory increases cooperation and cluster formation. They also read the text the agents write while deciding and find that Gemini treats longer memory more negatively than Gemma does. The authors argue that these differences come from how each model was built or aligned during training, and that this can explain why earlier studies on memory and cooperation reached conflicting conclusions.

Core claim

memory length is a critical parameter governing collective behavior: even a minimal memory drastically suppressed cooperation, transitioning the system from stable cooperative clusters through cyclical formation and collapse of clusters to a state of scattered defection as memory length increased... Comparative experiments using Gemma 3:4b revealed the opposite trend: longer memory promoted cooperation.

Load-bearing premise

That observed differences between Gemini and Gemma arise primarily from model-specific characteristics such as alignment rather than from unstated differences in prompt engineering, temperature settings, or how personality scores were mapped to LLM outputs.

read the original abstract

This study examines how model-specific characteristics of Large Language Model (LLM) agents, including internal alignment, shape the effect of memory on their collective and cooperative dynamics in a multi-agent system. To this end, we extend the Social Particle Swarm (SPS) model, in which agents move in a two-dimensional space and play the Prisoner's Dilemma with neighboring agents, by replacing its rule-based agents with LLM agents endowed with Big Five personality scores and varying memory lengths. Using Gemini-2.0-Flash, we find that memory length is a critical parameter governing collective behavior: even a minimal memory drastically suppressed cooperation, transitioning the system from stable cooperative clusters through cyclical formation and collapse of clusters to a state of scattered defection as memory length increased. Big Five personality traits correlated with agent behaviors in partial agreement with findings from experiments with human participants, supporting the validity of the model. Comparative experiments using Gemma~3:4b revealed the opposite trend: longer memory promoted cooperation, accompanied by the formation of dense cooperative clusters. Sentiment analysis of agents' reasoning texts showed that Gemini interprets memory increasingly negatively as its length grows, while Gemma interprets it less negatively, and that this difference persists in the early phase of experiments before the macro-level dynamics converge. These results suggest that model-specific characteristics of LLMs, potentially including alignment, play a fundamental role in determining emergent social behavior in Generative Agent-Based Modeling, and provide a micro-level cognitive account of the contradictions found in prior work on memory and cooperation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that LLM text outputs in a fixed prompt template faithfully reflect the causal effect of memory length and personality on cooperation decisions.

free parameters (2)
  • memory length
    Varied as an experimental parameter across runs
  • Big Five personality scores
    Assigned to each agent as input
axioms (1)
  • domain assumption Agents play Prisoner's Dilemma with neighbors in 2D space
    Inherited from the original SPS model

pith-pipeline@v0.9.0 · 5582 in / 1358 out tokens · 35902 ms · 2026-05-10T14:43:54.015901+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.