Agency in sustained human-AI chatbot talks emerges as co-constructed turn-by-turn through boundary-setting and intention-steering, organized in a new 3-by-4 framework of actors and actions.
Mixed citations
Towards the Scalable Evaluation of Cooperativeness in Language Models, March 2023a
Mixed citation behavior. Most common role is background (67%).
citation-role summary
citation-polarity summary
representative citing papers
The 2025 AI Agent Index catalogs technical and safety details for 30 deployed AI agents and finds low developer transparency on safety, evaluations, and societal impacts.
Trust calibration in agentic tool use is cast as preferential Bayesian optimization over a latent human risk-tolerance function observed through binary approve/deny feedback with a probit likelihood.
A new toolkit with cards and maps enables AI designers to juxtapose values and harms in early concept stages, shown valuable in designer surveys and interviews.
Designers using generative AI for concept envisioning engage in reciprocal reflection-in-action that surfaces multi-level value tensions and prioritizes harm recognition over positive value articulation.
Insider action research in an AI startup identifies three patterns of how practitioners view regulatory requirements and proposes internal expert collaboration as a way to turn external governance rules into shared, practical ownership.
A literature review concludes that pursuing consensus in data annotation creates biased AI by dismissing subjective disagreements and enforcing geographic hegemony, and proposes mapping diversity instead.
The paper maps LLM agent architectures onto a six-level continuum and argues that higher levels can enable simulation of emergent social phenomena while requiring attention to reproducibility and ethical issues.
citing papers explorer
-
Does My Chatbot Have an Agenda? Understanding Human and AI Agency in Human-Human-like Chatbot Interaction
Agency in sustained human-AI chatbot talks emerges as co-constructed turn-by-turn through boundary-setting and intention-steering, organized in a new 3-by-4 framework of actors and actions.
-
The 2025 AI Agent Index: Documenting Technical and Safety Features of Deployed Agentic AI Systems
The 2025 AI Agent Index catalogs technical and safety details for 30 deployed AI agents and finds low developer transparency on safety, evaluations, and societal impacts.
-
Progressive Autonomy as Preference Learning: A Formalization of Trust Calibration for Agentic Tool Use
Trust calibration in agentic tool use is cast as preferential Bayesian optimization over a latent human risk-tolerance function observed through binary approve/deny feedback with a probit likelihood.
-
Developing an AI Concept Envisioning Toolkit to Support Reflective Juxtaposition of Values and Harms
A new toolkit with cards and maps enables AI designers to juxtapose values and harms in early concept stages, shown valuable in designer surveys and interviews.
-
How Designers Envision Value-Oriented AI Design Concepts with Generative AI
Designers using generative AI for concept envisioning engage in reciprocal reflection-in-action that surfaces multi-level value tensions and prioritizes harm recognition over positive value articulation.
-
Engaged AI Governance: Addressing the Last Mile Challenge Through Internal Expert Collaboration
Insider action research in an AI startup identifies three patterns of how practitioners view regulatory requirements and proposes internal expert collaboration as a way to turn external governance rules into shared, practical ownership.
-
The Consensus Trap: Dissecting Subjectivity and the "Ground Truth" Illusion in Data Annotation
A literature review concludes that pursuing consensus in data annotation creates biased AI by dismissing subjective disagreements and enforcing geographic hegemony, and proposes mapping diversity instead.
-
Beyond Static Responses: Multi-Agent LLM Systems as a New Paradigm for Social Science Research
The paper maps LLM agent architectures onto a six-level continuum and argues that higher levels can enable simulation of emergent social phenomena while requiring attention to reproducibility and ethical issues.