Algorithmic Prompt Generation for Diverse Human-like Teaming and Communication with Large Language Models

Aaquib Tabrez; Boshen Zhang; Charles Michael Lewis; Katia P. Sycara; Siddharth Srikanth; Stefanos Nikolaidis; Varun Bhatt; Werner Hager

arxiv: 2504.03991 · v2 · pith:RFX537SPnew · submitted 2025-04-04 · 💻 cs.CL · cs.AI· cs.HC· cs.MA

Algorithmic Prompt Generation for Diverse Human-like Teaming and Communication with Large Language Models

Siddharth Srikanth , Varun Bhatt , Boshen Zhang , Werner Hager , Charles Michael Lewis , Katia P. Sycara , Aaquib Tabrez , Stefanos Nikolaidis This is my paper

classification 💻 cs.CL cs.AIcs.HCcs.MA

keywords diversebehaviorbehaviorsagentscommunicationhuman-likelargemodels

0 comments

read the original abstract

Understanding how humans collaborate and communicate in teams is essential for improving human-agent teaming and AI-assisted decision-making. However, relying solely on data from large-scale user studies is impractical due to logistical, ethical, and practical constraints, necessitating synthetic models of multiple diverse human behaviors. Recently, agents powered by Large Language Models (LLMs) have been shown to emulate human-like behavior in social settings. But, obtaining a large set of diverse behaviors requires manual effort in the form of designing prompts. On the other hand, Quality Diversity (QD) optimization has been shown to be capable of generating diverse Reinforcement Learning (RL) agent behavior. In this work, we combine QD optimization with LLM-powered agents to iteratively search for prompts that generate diverse team behavior in a long-horizon, multi-step collaborative environment. We first show, through a human-subjects experiment, that humans exhibit diverse coordination and communication behavior in this domain. We then present a series of experiments showing that our approach captures behaviors that are difficult to observe without large-scale data collection, and a follow-up user study to show that these generated behaviors are human-like. Our findings highlight the combination of QD and LLM-powered agents as an effective tool for studying teaming and communication strategies in multi-agent collaboration.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Robust Instruction Compliance in Cooperative Multi-Agent Reinforcement Learning
cs.AI 2026-05 unverdicted novelty 6.0

MAVIC corrects Bellman backups at instruction boundaries by adjusting the incoming objective and restoring continuation value, enabling consistent estimation under stochastic instruction switching in a unified policy.
Robust Instruction Compliance in Cooperative Multi-Agent Reinforcement Learning
cs.AI 2026-05 unverdicted novelty 6.0

MAVIC corrects Bellman backups at instruction boundaries by adjusting the incoming objective and restoring continuation value, enabling consistent estimation under stochastic instruction switching in cooperative MARL.
Red-Teaming Vision-Language-Action Models via Quality Diversity Prompt Generation for Robust Robot Policies
cs.RO 2026-03 unverdicted novelty 6.0

Q-DIG applies quality diversity optimization with vision-language models to generate diverse adversarial instructions that reveal VLA robot failures and enable robustness improvements via fine-tuning.