arXiv preprint arXiv:2402.05863 , year=

How well can llms negotiate? negotiationarena platform, analysis , author= · 2024 · arXiv 2402.05863

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

read on arXiv browse 12 citing papers

citation-role summary

background 3

citation-polarity summary

background 2 unclear 1

representative citing papers

SidConArena: An Environment Evaluating Agents in Open-Ended,Positive-Sum Bargaining Game

cs.MA · 2026-06-24 · unverdicted · novelty 7.0

SidConArena is a new multi-phase benchmark framework formalizing a partially observable stochastic game for evaluating LLM agents in open-ended positive-sum bargaining with negotiation, converter production, and sealed-bid auctions.

METRO: Towards Strategy Induction from Expert Dialogue Transcripts for Non-collaborative Dialogues

cs.CL · 2026-04-13 · unverdicted · novelty 7.0

METRO induces both short-term actions and long-term planning from expert transcripts into a Strategy Forest, outperforming prior methods by 9-10% on two non-collaborative dialogue benchmarks.

Design and Report Benchmarks for Knowledge Work

cs.AI · 2026-05-22 · unverdicted · novelty 6.0

Proposes a three-step benchmark design method (define work activity, specify tested setting, score work product) derived from work studies and O*NET, demonstrated via three case analyses.

PAVE: A Cognitive Architecture for Legitimate Violation in Generative Agent Societies

cs.MA · 2026-05-19 · unverdicted · novelty 6.0

PAVE is a four-module architecture (Perception, Assessment, Verdict, Emulation) that enables generative agents to perform legitimate rule violations while preserving authority deference, bounded scope, and post-trigger recovery in multi-agent simulations.

Understanding the Mechanism of Altruism in Large Language Models

econ.GN · 2026-04-21 · unverdicted · novelty 6.0

A small set of sparse autoencoder features in LLMs drives shifts between generous and selfish allocations in dictator games, with causal patching and steering confirming their role and generalization to other social games.

Moral Susceptibility and Robustness under Persona Role-Play in Large Language Models

cs.CL · 2025-11-11 · unverdicted · novelty 6.0

LLM moral robustness under persona role-play is largely determined by model family with Claude models most consistent, while susceptibility shows little family dependence.

Is Lying an Emergent Behaviour in LLMs? Evidence from Gaslighting AI agents in a Sustainability Game

cs.MA · 2026-06-26 · unverdicted · novelty 4.0

LLM agents exhibit emergent deception in a sustainability game even without lying permission, with neighbor info increasing attacks while aiding biosphere retention.

SOM: Structured Opponent Modeling for LLM-based Agents via Structural Causal Model

cs.AI · 2026-05-08 · unverdicted · novelty 4.0

SOM uses a Structural Causal Model to create an explicit graph of opponent observation-to-action links, allowing LLMs to reason along those paths for more accurate and stable predictions in multi-agent settings.

Preregistration for Experiments with AI Agents

cs.CY · 2026-05-03 · unverdicted · novelty 4.0

Proposes extending preregistration practices to AI agent experiments and supplies a tailored template to limit researcher degrees of freedom.

AI Realtor: Towards Grounded Persuasive Language Generation for Automated Copywriting

cs.AI · 2025-02-24 · unverdicted · novelty 4.0

An LLM agent with grounding, personalization, and marketing modules generates real estate descriptions that human buyers prefer over expert-written ones while matching factual accuracy.

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

cs.AI · 2026-04-24

When Identity Overrides Incentives: Representational Choices as Governance Decisions in Multi-Agent LLM Systems

cs.MA · 2026-01-15

citing papers explorer

Showing 10 of 10 citing papers after filters.

SidConArena: An Environment Evaluating Agents in Open-Ended,Positive-Sum Bargaining Game cs.MA · 2026-06-24 · unverdicted · none · ref 28
SidConArena is a new multi-phase benchmark framework formalizing a partially observable stochastic game for evaluating LLM agents in open-ended positive-sum bargaining with negotiation, converter production, and sealed-bid auctions.
METRO: Towards Strategy Induction from Expert Dialogue Transcripts for Non-collaborative Dialogues cs.CL · 2026-04-13 · unverdicted · none · ref 1
METRO induces both short-term actions and long-term planning from expert transcripts into a Strategy Forest, outperforming prior methods by 9-10% on two non-collaborative dialogue benchmarks.
Design and Report Benchmarks for Knowledge Work cs.AI · 2026-05-22 · unverdicted · none · ref 33
Proposes a three-step benchmark design method (define work activity, specify tested setting, score work product) derived from work studies and O*NET, demonstrated via three case analyses.
PAVE: A Cognitive Architecture for Legitimate Violation in Generative Agent Societies cs.MA · 2026-05-19 · unverdicted · none · ref 1
PAVE is a four-module architecture (Perception, Assessment, Verdict, Emulation) that enables generative agents to perform legitimate rule violations while preserving authority deference, bounded scope, and post-trigger recovery in multi-agent simulations.
Understanding the Mechanism of Altruism in Large Language Models econ.GN · 2026-04-21 · unverdicted · none · ref 253
A small set of sparse autoencoder features in LLMs drives shifts between generous and selfish allocations in dictator games, with causal patching and steering confirming their role and generalization to other social games.
Moral Susceptibility and Robustness under Persona Role-Play in Large Language Models cs.CL · 2025-11-11 · unverdicted · none · ref 7
LLM moral robustness under persona role-play is largely determined by model family with Claude models most consistent, while susceptibility shows little family dependence.
Is Lying an Emergent Behaviour in LLMs? Evidence from Gaslighting AI agents in a Sustainability Game cs.MA · 2026-06-26 · unverdicted · none · ref 20
LLM agents exhibit emergent deception in a sustainability game even without lying permission, with neighbor info increasing attacks while aiding biosphere retention.
SOM: Structured Opponent Modeling for LLM-based Agents via Structural Causal Model cs.AI · 2026-05-08 · unverdicted · none · ref 2
SOM uses a Structural Causal Model to create an explicit graph of opponent observation-to-action links, allowing LLMs to reason along those paths for more accurate and stable predictions in multi-agent settings.
Preregistration for Experiments with AI Agents cs.CY · 2026-05-03 · unverdicted · none · ref 5
Proposes extending preregistration practices to AI agent experiments and supplies a tailored template to limit researcher degrees of freedom.
AI Realtor: Towards Grounded Persuasive Language Generation for Automated Copywriting cs.AI · 2025-02-24 · unverdicted · none · ref 9
An LLM agent with grounding, personalization, and marketing modules generates real estate descriptions that human buyers prefer over expert-written ones while matching factual accuracy.

arXiv preprint arXiv:2402.05863 , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer