LLM -Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang · 2024 · DOI 10.18653/v1/2024.emnlp-main.7

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Beyond Single Plots: A Benchmark for Question Answering on Multi-Charts

cs.CL · 2026-04-23 · unverdicted · novelty 7.0

PolyChartQA is a new mid-scale dataset for multi-chart question answering that reveals a 27.4% accuracy drop for multimodal models on human-authored questions compared to AI-generated ones, plus a modest gain from a proposed prompting method.

Who Plays Which Role When? Communication Role Dynamics for Peer Recognition and Team Performance Prediction

cs.CY · 2026-06-26 · unverdicted · novelty 4.0

A theory-grounded taxonomy of eight communication roles enables scalable annotation via LLMs and outperforms baselines when predicting peer recognition in student teams and performance improvement on a public deliberation dataset.

Prompt Governance? On Governing Technologies Governed by Natural Language

cs.CY · 2026-04-29 · unverdicted · novelty 4.0

Literature on system prompts for AI shows fragmented and contradictory claims that complicate policy efforts to use them as reliable governance mechanisms.

citing papers explorer

Showing 3 of 3 citing papers.

Beyond Single Plots: A Benchmark for Question Answering on Multi-Charts cs.CL · 2026-04-23 · unverdicted · none · ref 52
PolyChartQA is a new mid-scale dataset for multi-chart question answering that reveals a 27.4% accuracy drop for multimodal models on human-authored questions compared to AI-generated ones, plus a modest gain from a proposed prompting method.
Who Plays Which Role When? Communication Role Dynamics for Peer Recognition and Team Performance Prediction cs.CY · 2026-06-26 · unverdicted · none · ref 37
A theory-grounded taxonomy of eight communication roles enables scalable annotation via LLMs and outperforms baselines when predicting peer recognition in student teams and performance improvement on a public deliberation dataset.
Prompt Governance? On Governing Technologies Governed by Natural Language cs.CY · 2026-04-29 · unverdicted · none · ref 177
Literature on system prompts for AI shows fragmented and contradictory claims that complicate policy efforts to use them as reliable governance mechanisms.

LLM -Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

fields

years

verdicts

representative citing papers

citing papers explorer