Toward Human-Centered Multi-Agent Systems: Integrating Cognition, Culture, Values, and Cooperation in AI Agents

Rahemeen Khan; Safia Baloch

arxiv: 2606.08274 · v1 · pith:IZLW6TRTnew · submitted 2026-06-06 · 💻 cs.MA

Toward Human-Centered Multi-Agent Systems: Integrating Cognition, Culture, Values, and Cooperation in AI Agents

Safia Baloch , Rahemeen Khan This is my paper

Pith reviewed 2026-06-27 18:46 UTC · model grok-4.3

classification 💻 cs.MA

keywords multi-agent systemsLLM agentshuman-centered AIcognitive modelingvalue alignmentcultural contextcooperationagent collaboration

0 comments

The pith

Existing LLM-based multi-agent systems lack a unified framework integrating cognition, culture, values, and social behavior for agents acting on behalf of humans.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This survey reviews advances in LLM agents and multi-agent systems and argues they must incorporate human-centered capabilities to function in social and normative environments. It examines six areas spanning the evolution of agents, human decision-making, language and culture, value systems, collaboration, and coordination while drawing from cognitive science, sociolinguistics, and AI alignment. The central finding is that no single framework yet combines these elements into autonomous agents. Addressing the gap would allow agents to reason under bounded rationality, use culturally situated language, and follow values and norms rather than optimizing only for task completion.

Core claim

The paper claims that future AI agents, especially those acting on behalf of humans, must move beyond task competence toward human-centered capabilities. It reviews research across six areas and identifies that existing LLM-based multi-agent systems do not provide a unified framework integrating cognition, culture, values, and social behavior into autonomous agents.

What carries the argument

Synthesis of six research areas that reveals the missing unified framework for culturally aware, value-aligned, cognitively grounded, and cooperative multi-agent systems.

If this is right

Agents will need to incorporate models of bounded rationality for realistic decision-making.
Cultural alignment benchmarks and sociolinguistic methods will be required for effective communication.
Value and preference learning techniques must be embedded directly into coordination mechanisms.
Explainability and human-agent collaboration methods will become necessary for trust and cooperation.
Multi-agent societies will need explicit modeling of human characteristics to improve collective behavior.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Deployment in domains involving ethical or cultural nuance may require new evaluation metrics that test combined cognitive and social performance.
Trade-offs between computational efficiency and normative alignment could shape which application areas adopt such frameworks first.
Progress may depend on creating shared test environments that simultaneously probe cognition, culture, values, and cooperation.
Long-term agent societies could evolve different norms depending on which human characteristics are prioritized in the initial design.

Load-bearing premise

Research from cognitive science, sociolinguistics, computational social science, and AI alignment can be combined into one operational framework for agents without fundamental incompatibilities between their approaches.

What would settle it

A working prototype that successfully combines bounded-rationality modeling, culturally situated language, value alignment, and multi-agent coordination into one agent architecture without major performance trade-offs.

Figures

Figures reproduced from arXiv: 2606.08274 by Rahemeen Khan, Safia Baloch.

**Figure 1.** Figure 1: A unified conceptual framework for human-centered multi-agent systems. Contemporary agents typically [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗

read the original abstract

The emergence of large language model (LLM)-based agents and multi-agent systems has enabled a shift from narrow task automation to more autonomous decision-making. Despite progress in language generation, planning, tool use, and coordination, most agents still treat intelligence as prediction, optimization, and task completion. Human environments are social and normative, where people reason under bounded rationality, communicate in culturally situated language, and make decisions guided by values, beliefs, trust, and social norms. This survey argues that future AI agents, especially those acting on behalf of humans, must move beyond task competence toward human-centered capabilities. We review research across six areas: (1) evolution of intelligent agents, (2) human cognition and decision-making, (3) language, culture, and social context, (4) human values and belief systems, (5) human-agent collaboration, and (6) multi-agent coordination and modeling of human characteristics. We synthesize work from cognitive science, sociolinguistics, computational social science, and AI alignment, along with recent advances in LLM agents, cultural alignment benchmarks, preference learning, explainability, and agent societies. We identify a key gap: existing systems do not provide a unified framework integrating cognition, culture, values, and social behavior into autonomous agents. We conclude with directions for building culturally aware, value-aligned, cognitively grounded, and cooperative multi-agent systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This survey maps a gap in human-centered LLM agents but adds no new results or framework.

read the letter

This survey organizes existing work into six categories and concludes that LLM-based multi-agent systems lack a unified framework for cognition, culture, values, and social behavior. That is the main takeaway.

It does a reasonable job of breadth. The review pulls threads from cognitive science on bounded rationality, sociolinguistics on situated language, computational social science, and AI alignment work on preferences and explainability. It links these to current LLM agent papers and benchmarks, and it gives a coherent reason why pure task competence falls short in human settings. The six-area structure is a useful way to group the literature.

The soft spots are straightforward. The gap claim rests on interpretive synthesis without explicit criteria for what would count as a unified framework or a systematic check that none exists. No data, derivations, or tests are supplied, so the central assertion cannot be verified from the paper itself. The text also does not examine whether the reviewed areas contain deep incompatibilities that would block integration.

This paper is for researchers already working on multi-agent systems who want an interdisciplinary reading list and some high-level directions. It will not help readers looking for new methods, proofs, or experiments. It shows honest engagement with the cited fields, so the thinking is clear even if the output is a call for future work rather than a solution.

It deserves a serious referee to check the coverage and the sharpness of the gap diagnosis. I would send it to peer review.

Referee Report

0 major / 0 minor

Summary. This survey reviews literature across six areas—(1) evolution of intelligent agents, (2) human cognition and decision-making, (3) language, culture, and social context, (4) human values and belief systems, (5) human-agent collaboration, and (6) multi-agent coordination and modeling of human characteristics—drawing from cognitive science, sociolinguistics, computational social science, and AI alignment. It argues that existing LLM-based multi-agent systems treat intelligence primarily as prediction and task completion and therefore lack a unified framework integrating cognition, culture, values, and social behavior into autonomous agents that act on behalf of humans. The manuscript synthesizes recent advances in cultural alignment benchmarks, preference learning, explainability, and agent societies, identifies the gap, and outlines directions for culturally aware, value-aligned, cognitively grounded, and cooperative systems.

Significance. If the gap identification is accurate, the survey could usefully direct research toward more human-centered agent architectures as autonomy increases. The interdisciplinary synthesis across the six areas is a constructive contribution, and the explicit framing of future agents as acting on behalf of humans correctly highlights normative and social dimensions that current task-oriented systems often omit.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive evaluation of the survey, recognition of its interdisciplinary synthesis across cognitive science, sociolinguistics, and AI alignment, and recommendation to accept. The comments correctly note the manuscript's focus on the gap in unified frameworks for human-centered multi-agent LLM systems.

Circularity Check

0 steps flagged

No significant circularity: literature survey without derivations or self-referential reductions

full rationale

This is a survey paper whose central claim is the documented absence of any existing unified framework integrating the six reviewed areas into LLM-based multi-agent systems. It performs a literature synthesis across cognitive science, sociolinguistics, and AI alignment to identify the gap and offers future directions, without advancing equations, fitted parameters, predictions, or models. No load-bearing steps reduce by construction to the paper's own inputs, self-citations, or ansatzes; the argument rests on external reviewed work rather than self-definition or imported uniqueness theorems. The derivation chain is therefore self-contained as a gap analysis.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

This is a survey paper; it introduces no new free parameters, axioms, or invented entities. The central claim rests on the interpretive claim that a unified framework is missing, which is not formalized.

pith-pipeline@v0.9.1-grok · 5784 in / 1150 out tokens · 13119 ms · 2026-06-27T18:46:00.901393+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

62 extracted references · 18 canonical work pages · 7 internal anchors

[1]

1980 , publisher=

Principles of Artificial Intelligence , author=. 1980 , publisher=

1980
[2]

2021 , publisher=

Artificial Intelligence: A Modern Approach , author=. 2021 , publisher=

2021
[3]

2006 , publisher=

Pattern Recognition and Machine Learning , author=. 2006 , publisher=

2006
[4]

Advances in Neural Information Processing Systems , year=

Language Models are Few-Shot Learners , author=. Advances in Neural Information Processing Systems , year=
[5]

Frontiers of Computer Science , year=

A Survey on Large Language Model based Autonomous Agents , author=. Frontiers of Computer Science , year=
[6]

The Rise and Potential of Large Language Model Based Agents: A Survey

The Rise and Potential of Large Language Model Based Agents: A Survey , author=. arXiv preprint arXiv:2309.07864 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[7]

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

Large Language Model based Multi-Agents: A Survey of Progress and Challenges , author=. arXiv preprint arXiv:2402.01680 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[8]

arXiv preprint arXiv:2412.17481 , year=

A Survey on LLM-based Multi-Agent System: Recent Advances and New Frontiers in Application , author=. arXiv preprint arXiv:2412.17481 , year=

work page arXiv
[9]

Multi-Agent Collaboration Mechanisms: A Survey of LLMs

Multi-Agent Collaboration Mechanisms: A Survey of LLMs , author=. arXiv preprint arXiv:2501.06322 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[10]

1957 , publisher=

Models of Man: Social and Rational , author=. 1957 , publisher=

1957
[11]

Science , volume=

Judgment under Uncertainty: Heuristics and Biases , author=. Science , volume=
[12]

2011 , publisher=

Thinking, Fast and Slow , author=. 2011 , publisher=

2011
[13]

Annual Review of Psychology , volume=

Dual-Processing Accounts of Reasoning, Judgment, and Social Cognition , author=. Annual Review of Psychology , volume=
[14]

Advances in Neural Information Processing Systems , year=

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models , author=. Advances in Neural Information Processing Systems , year=
[15]

Large Language Models are Zero-Shot Reasoners

Large Language Models are Zero-Shot Reasoners , author=. arXiv preprint arXiv:2205.11916 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[16]

Findings of ACL , year=

Towards Reasoning in Large Language Models: A Survey , author=. Findings of ACL , year=
[17]

Proceedings of the Teddington Conference on the Mechanization of Thought Processes , year=

Programs with Common Sense , author=. Proceedings of the Teddington Conference on the Mechanization of Thought Processes , year=
[18]

Behavioral and Brain Sciences , volume=

Does the Chimpanzee Have a Theory of Mind? , author=. Behavioral and Brain Sciences , volume=
[19]

arXiv preprint arXiv:2502.06470 , year=

A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks , author=. arXiv preprint arXiv:2502.06470 , year=

work page arXiv
[20]

arXiv preprint arXiv:2505.00026 , year=

Theory of Mind in Large Language Models: Assessment and Enhancement , author=. arXiv preprint arXiv:2505.00026 , year=

work page arXiv
[21]

Small Group Research , volume=

Is There a ``Big Five'' in Teamwork? , author=. Small Group Research , volume=
[22]

1976 , publisher=

Beyond Culture , author=. 1976 , publisher=

1976
[23]

2001 , publisher=

Culture's Consequences: Comparing Values, Behaviors, Institutions and Organizations Across Nations , author=. 2001 , publisher=

2001
[24]

Proceedings of the First Workshop on Cross-Cultural Considerations in NLP , year=

Assessing Cross-Cultural Alignment between ChatGPT and Human Societies: An Empirical Study , author=. Proceedings of the First Workshop on Cross-Cultural Considerations in NLP , year=
[25]

Proceedings of ACL , year=

Investigating Cultural Alignment of Large Language Models , author=. Proceedings of ACL , year=
[26]

Proceedings of the 2nd Workshop on Cross-Cultural Considerations in NLP , year=

CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models , author=. Proceedings of the 2nd Workshop on Cross-Cultural Considerations in NLP , year=
[27]

Proceedings of COLING , year=

Cultural Alignment in Large Language Models: An Explanatory Analysis Based on Hofstede's Cultural Dimensions , author=. Proceedings of COLING , year=
[28]

2019 , publisher=

Human Compatible: Artificial Intelligence and the Problem of Control , author=. 2019 , publisher=

2019
[29]

AI Alignment: A Comprehensive Survey

AI Alignment: A Comprehensive Survey , author=. arXiv preprint arXiv:2310.19852 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[30]

Concrete Problems in AI Safety

Concrete Problems in AI Safety , author=. arXiv preprint arXiv:1606.06565 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[31]

Proceedings of ICML , year=

Algorithms for Inverse Reinforcement Learning , author=. Proceedings of ICML , year=
[32]

Advances in Neural Information Processing Systems , year=

Training Language Models to Follow Instructions with Human Feedback , author=. Advances in Neural Information Processing Systems , year=
[33]

arXiv preprint arXiv:2406.11191 , year=

A Survey on Human Preference Learning for Large Language Models , author=. arXiv preprint arXiv:2406.11191 , year=

work page arXiv
[34]

arXiv preprint arXiv:2409.02795 , year=

Towards a Unified View of Preference Learning for Large Language Models: A Survey , author=. arXiv preprint arXiv:2409.02795 , year=

work page arXiv
[35]

https://arxiv.org/abs/2504.07070

A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models , author=. arXiv preprint arXiv:2504.07070 , year=

work page arXiv
[36]

https://arxiv.org/abs/2503.17003

A Survey on Personalized Alignment -- The Missing Piece for Large Language Models in Real-World Applications , author=. arXiv preprint arXiv:2503.17003 , year=

work page arXiv
[37]

Proceedings of COLING , year=

PERSONA: A Reproducible Testbed for Pluralistic Alignment , author=. Proceedings of COLING , year=
[38]

Proceedings of COLING , year=

Aligning Large Language Models with Human Opinions through Persona Selection and Value--Belief--Norm Reasoning , author=. Proceedings of COLING , year=
[39]

2009 , publisher=

Moral Machines: Teaching Robots Right from Wrong , author=. 2009 , publisher=

2009
[40]

Towards A Rigorous Science of Interpretable Machine Learning

Towards A Rigorous Science of Interpretable Machine Learning , author=. arXiv preprint arXiv:1702.08608 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[41]

arXiv preprint arXiv:2407.15248 , year=

XAI Meets LLMs: A Survey of the Relation between Explainable AI and Large Language Models , author=. arXiv preprint arXiv:2407.15248 , year=

work page arXiv
[42]

arXiv preprint arXiv:2506.21812 , year=

Towards Transparent AI: A Survey on Explainable Large Language Models , author=. arXiv preprint arXiv:2506.21812 , year=

work page arXiv
[43]

Proceedings of ACL Tutorial Abstracts , year=

Human-AI Collaboration: How AIs Augment Human Teammates , author=. Proceedings of ACL Tutorial Abstracts , year=
[44]

Proceedings of NAACL Tutorial Abstracts , year=

Human-AI Interaction in the Age of LLMs , author=. Proceedings of NAACL Tutorial Abstracts , year=
[45]

Transactions of the Association for Computational Linguistics , volume=

Decision-Oriented Dialogue for Human-AI Collaboration , author=. Transactions of the Association for Computational Linguistics , volume=
[46]

Findings of EMNLP , year=

Large Language Model-based Human-Agent Collaboration for Complex Task Solving , author=. Findings of EMNLP , year=
[47]

2009 , publisher=

An Introduction to MultiAgent Systems , author=. 2009 , publisher=

2009
[48]

Proceedings of the 2nd Workshop on Cross-Cultural Considerations in NLP , year=

Conformity, Confabulation, and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration , author=. Proceedings of the 2nd Workshop on Cross-Cultural Considerations in NLP , year=
[49]

arXiv preprint arXiv:2506.01080 , year=

The Coming Crisis of Multi-Agent Misalignment: AI Alignment Must Be a Dynamic and Social Process , author=. arXiv preprint arXiv:2506.01080 , year=

work page arXiv
[50]

Human-Computer Interaction , volume=

ACT-R: A Theory of Higher Level Cognition and Its Relation to Visual Attention , author=. Human-Computer Interaction , volume=
[51]

1990 , publisher=

Unified Theories of Cognition , author=. 1990 , publisher=

1990
[52]

The Oxford Handbook of Cognitive Engineering , year=

The ACT-R Cognitive Architecture: Principles and Applications , author=. The Oxford Handbook of Cognitive Engineering , year=
[53]

arXiv preprint arXiv:2408.09176 , year=

Cognitive LLMs: Towards Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-making , author=. arXiv preprint arXiv:2408.09176 , year=

work page arXiv
[54]

Proceedings of the National Academy of Sciences , volume=

Agent-based Modeling: Methods and Techniques for Simulating Human Systems , author=. Proceedings of the National Academy of Sciences , volume=
[55]

Proceedings of UIST , year=

Generative Agents: Interactive Simulacra of Human Behavior , author=. Proceedings of UIST , year=
[56]

Proceedings of EMNLP System Demonstrations , year=

Humanoid Agents: Platform for Simulating Human-like Generative Agents , author=. Proceedings of EMNLP System Demonstrations , year=
[57]

Proceedings of EMNLP , year=

Character-LLM: A Trainable Agent for Role-Playing , author=. Proceedings of EMNLP , year=
[58]

Proceedings of ACL , year=

Quantifying the Persona Effect in LLM Simulations , author=. Proceedings of ACL , year=
[59]

Findings of EMNLP , year=

Modeling Human Subjectivity in LLMs Using Explicit and Implicit Human Factors in Personas , author=. Findings of EMNLP , year=
[60]

Findings of EMNLP , year=

The Prompt Makes the Person(a): A Systematic Evaluation of Sociodemographic Persona Prompting for Large Language Models , author=. Findings of EMNLP , year=
[61]

Proceedings of ACL , year=

DEEPER Insight into Your User: Directed Persona Refinement for Dynamic Persona Modeling , author=. Proceedings of ACL , year=
[62]

Findings of ACL , year=

PersonaX: A Recommendation Agent-Oriented User Modeling Framework for Long Behavior Sequence , author=. Findings of ACL , year=

[1] [1]

1980 , publisher=

Principles of Artificial Intelligence , author=. 1980 , publisher=

1980

[2] [2]

2021 , publisher=

Artificial Intelligence: A Modern Approach , author=. 2021 , publisher=

2021

[3] [3]

2006 , publisher=

Pattern Recognition and Machine Learning , author=. 2006 , publisher=

2006

[4] [4]

Advances in Neural Information Processing Systems , year=

Language Models are Few-Shot Learners , author=. Advances in Neural Information Processing Systems , year=

[5] [5]

Frontiers of Computer Science , year=

A Survey on Large Language Model based Autonomous Agents , author=. Frontiers of Computer Science , year=

[6] [6]

The Rise and Potential of Large Language Model Based Agents: A Survey

The Rise and Potential of Large Language Model Based Agents: A Survey , author=. arXiv preprint arXiv:2309.07864 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[7] [7]

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

Large Language Model based Multi-Agents: A Survey of Progress and Challenges , author=. arXiv preprint arXiv:2402.01680 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[8] [8]

arXiv preprint arXiv:2412.17481 , year=

A Survey on LLM-based Multi-Agent System: Recent Advances and New Frontiers in Application , author=. arXiv preprint arXiv:2412.17481 , year=

work page arXiv

[9] [9]

Multi-Agent Collaboration Mechanisms: A Survey of LLMs

Multi-Agent Collaboration Mechanisms: A Survey of LLMs , author=. arXiv preprint arXiv:2501.06322 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[10] [10]

1957 , publisher=

Models of Man: Social and Rational , author=. 1957 , publisher=

1957

[11] [11]

Science , volume=

Judgment under Uncertainty: Heuristics and Biases , author=. Science , volume=

[12] [12]

2011 , publisher=

Thinking, Fast and Slow , author=. 2011 , publisher=

2011

[13] [13]

Annual Review of Psychology , volume=

Dual-Processing Accounts of Reasoning, Judgment, and Social Cognition , author=. Annual Review of Psychology , volume=

[14] [14]

Advances in Neural Information Processing Systems , year=

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models , author=. Advances in Neural Information Processing Systems , year=

[15] [15]

Large Language Models are Zero-Shot Reasoners

Large Language Models are Zero-Shot Reasoners , author=. arXiv preprint arXiv:2205.11916 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[16] [16]

Findings of ACL , year=

Towards Reasoning in Large Language Models: A Survey , author=. Findings of ACL , year=

[17] [17]

Proceedings of the Teddington Conference on the Mechanization of Thought Processes , year=

Programs with Common Sense , author=. Proceedings of the Teddington Conference on the Mechanization of Thought Processes , year=

[18] [18]

Behavioral and Brain Sciences , volume=

Does the Chimpanzee Have a Theory of Mind? , author=. Behavioral and Brain Sciences , volume=

[19] [19]

arXiv preprint arXiv:2502.06470 , year=

A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks , author=. arXiv preprint arXiv:2502.06470 , year=

work page arXiv

[20] [20]

arXiv preprint arXiv:2505.00026 , year=

Theory of Mind in Large Language Models: Assessment and Enhancement , author=. arXiv preprint arXiv:2505.00026 , year=

work page arXiv

[21] [21]

Small Group Research , volume=

Is There a ``Big Five'' in Teamwork? , author=. Small Group Research , volume=

[22] [22]

1976 , publisher=

Beyond Culture , author=. 1976 , publisher=

1976

[23] [23]

2001 , publisher=

Culture's Consequences: Comparing Values, Behaviors, Institutions and Organizations Across Nations , author=. 2001 , publisher=

2001

[24] [24]

Proceedings of the First Workshop on Cross-Cultural Considerations in NLP , year=

Assessing Cross-Cultural Alignment between ChatGPT and Human Societies: An Empirical Study , author=. Proceedings of the First Workshop on Cross-Cultural Considerations in NLP , year=

[25] [25]

Proceedings of ACL , year=

Investigating Cultural Alignment of Large Language Models , author=. Proceedings of ACL , year=

[26] [26]

Proceedings of the 2nd Workshop on Cross-Cultural Considerations in NLP , year=

CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models , author=. Proceedings of the 2nd Workshop on Cross-Cultural Considerations in NLP , year=

[27] [27]

Proceedings of COLING , year=

Cultural Alignment in Large Language Models: An Explanatory Analysis Based on Hofstede's Cultural Dimensions , author=. Proceedings of COLING , year=

[28] [28]

2019 , publisher=

Human Compatible: Artificial Intelligence and the Problem of Control , author=. 2019 , publisher=

2019

[29] [29]

AI Alignment: A Comprehensive Survey

AI Alignment: A Comprehensive Survey , author=. arXiv preprint arXiv:2310.19852 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[30] [30]

Concrete Problems in AI Safety

Concrete Problems in AI Safety , author=. arXiv preprint arXiv:1606.06565 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[31] [31]

Proceedings of ICML , year=

Algorithms for Inverse Reinforcement Learning , author=. Proceedings of ICML , year=

[32] [32]

Advances in Neural Information Processing Systems , year=

Training Language Models to Follow Instructions with Human Feedback , author=. Advances in Neural Information Processing Systems , year=

[33] [33]

arXiv preprint arXiv:2406.11191 , year=

A Survey on Human Preference Learning for Large Language Models , author=. arXiv preprint arXiv:2406.11191 , year=

work page arXiv

[34] [34]

arXiv preprint arXiv:2409.02795 , year=

Towards a Unified View of Preference Learning for Large Language Models: A Survey , author=. arXiv preprint arXiv:2409.02795 , year=

work page arXiv

[35] [35]

https://arxiv.org/abs/2504.07070

A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models , author=. arXiv preprint arXiv:2504.07070 , year=

work page arXiv

[36] [36]

https://arxiv.org/abs/2503.17003

A Survey on Personalized Alignment -- The Missing Piece for Large Language Models in Real-World Applications , author=. arXiv preprint arXiv:2503.17003 , year=

work page arXiv

[37] [37]

Proceedings of COLING , year=

PERSONA: A Reproducible Testbed for Pluralistic Alignment , author=. Proceedings of COLING , year=

[38] [38]

Proceedings of COLING , year=

Aligning Large Language Models with Human Opinions through Persona Selection and Value--Belief--Norm Reasoning , author=. Proceedings of COLING , year=

[39] [39]

2009 , publisher=

Moral Machines: Teaching Robots Right from Wrong , author=. 2009 , publisher=

2009

[40] [40]

Towards A Rigorous Science of Interpretable Machine Learning

Towards A Rigorous Science of Interpretable Machine Learning , author=. arXiv preprint arXiv:1702.08608 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[41] [41]

arXiv preprint arXiv:2407.15248 , year=

XAI Meets LLMs: A Survey of the Relation between Explainable AI and Large Language Models , author=. arXiv preprint arXiv:2407.15248 , year=

work page arXiv

[42] [42]

arXiv preprint arXiv:2506.21812 , year=

Towards Transparent AI: A Survey on Explainable Large Language Models , author=. arXiv preprint arXiv:2506.21812 , year=

work page arXiv

[43] [43]

Proceedings of ACL Tutorial Abstracts , year=

Human-AI Collaboration: How AIs Augment Human Teammates , author=. Proceedings of ACL Tutorial Abstracts , year=

[44] [44]

Proceedings of NAACL Tutorial Abstracts , year=

Human-AI Interaction in the Age of LLMs , author=. Proceedings of NAACL Tutorial Abstracts , year=

[45] [45]

Transactions of the Association for Computational Linguistics , volume=

Decision-Oriented Dialogue for Human-AI Collaboration , author=. Transactions of the Association for Computational Linguistics , volume=

[46] [46]

Findings of EMNLP , year=

Large Language Model-based Human-Agent Collaboration for Complex Task Solving , author=. Findings of EMNLP , year=

[47] [47]

2009 , publisher=

An Introduction to MultiAgent Systems , author=. 2009 , publisher=

2009

[48] [48]

Proceedings of the 2nd Workshop on Cross-Cultural Considerations in NLP , year=

Conformity, Confabulation, and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration , author=. Proceedings of the 2nd Workshop on Cross-Cultural Considerations in NLP , year=

[49] [49]

arXiv preprint arXiv:2506.01080 , year=

The Coming Crisis of Multi-Agent Misalignment: AI Alignment Must Be a Dynamic and Social Process , author=. arXiv preprint arXiv:2506.01080 , year=

work page arXiv

[50] [50]

Human-Computer Interaction , volume=

ACT-R: A Theory of Higher Level Cognition and Its Relation to Visual Attention , author=. Human-Computer Interaction , volume=

[51] [51]

1990 , publisher=

Unified Theories of Cognition , author=. 1990 , publisher=

1990

[52] [52]

The Oxford Handbook of Cognitive Engineering , year=

The ACT-R Cognitive Architecture: Principles and Applications , author=. The Oxford Handbook of Cognitive Engineering , year=

[53] [53]

arXiv preprint arXiv:2408.09176 , year=

Cognitive LLMs: Towards Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-making , author=. arXiv preprint arXiv:2408.09176 , year=

work page arXiv

[54] [54]

Proceedings of the National Academy of Sciences , volume=

Agent-based Modeling: Methods and Techniques for Simulating Human Systems , author=. Proceedings of the National Academy of Sciences , volume=

[55] [55]

Proceedings of UIST , year=

Generative Agents: Interactive Simulacra of Human Behavior , author=. Proceedings of UIST , year=

[56] [56]

Proceedings of EMNLP System Demonstrations , year=

Humanoid Agents: Platform for Simulating Human-like Generative Agents , author=. Proceedings of EMNLP System Demonstrations , year=

[57] [57]

Proceedings of EMNLP , year=

Character-LLM: A Trainable Agent for Role-Playing , author=. Proceedings of EMNLP , year=

[58] [58]

Proceedings of ACL , year=

Quantifying the Persona Effect in LLM Simulations , author=. Proceedings of ACL , year=

[59] [59]

Findings of EMNLP , year=

Modeling Human Subjectivity in LLMs Using Explicit and Implicit Human Factors in Personas , author=. Findings of EMNLP , year=

[60] [60]

Findings of EMNLP , year=

The Prompt Makes the Person(a): A Systematic Evaluation of Sociodemographic Persona Prompting for Large Language Models , author=. Findings of EMNLP , year=

[61] [61]

Proceedings of ACL , year=

DEEPER Insight into Your User: Directed Persona Refinement for Dynamic Persona Modeling , author=. Proceedings of ACL , year=

[62] [62]

Findings of ACL , year=

PersonaX: A Recommendation Agent-Oriented User Modeling Framework for Long Behavior Sequence , author=. Findings of ACL , year=