Characterizing Manipulation from AI Systems, October 2023

Micah Carroll, Alan Chan, Henry Ashton, David Krueger · 2023 · arXiv 2303.09387

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing

cs.CL · 2026-06-03 · unverdicted · novelty 7.0

PERSUASIONTRACE introduces a Bayesian-network simulated target for multi-turn persuasion that matches human belief dynamics (81 vs 80) better than LLM baselines (64) and enables process-level evaluation.

Towards A Framework for Levels of Anthropomorphic Deception in Robots and AI

cs.HC · 2026-04-16 · unverdicted · novelty 5.0

A conceptual framework classifies anthropomorphic deception into four levels using humanlikeness, agency, and selfhood to guide ethical and practical decisions in HCI and HRI.

Persuasion with Large Language Models: A Survey of Empirical Evidence, Study Methodologies, and Ethical Implications

cs.CL · 2024-11-11 · unverdicted · novelty 5.0

LLM-based persuasion systems frequently match or exceed human effectiveness across domains, with key influences from interaction style, model scale, prompt design, and personalization, while posing risks to information integrity, fairness, privacy, and autonomy.

TrustLLM: Trustworthiness in Large Language Models

cs.CL · 2024-01-10 · unverdicted · novelty 5.0

TrustLLM defines eight trustworthiness principles, creates a six-dimension benchmark, and evaluates 16 LLMs showing proprietary models generally lead but some open-source ones are close while over-calibration can hurt utility.

The Agentic Web Requires New Normative Infrastructure

cs.CY · 2026-06-09 · unverdicted · novelty 3.0

The agentic web requires new normative infrastructure of laws, norms, and practices to allow user-delegated AI agents to access online properties without being blocked as malicious bots.

citing papers explorer

Showing 5 of 5 citing papers.

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing cs.CL · 2026-06-03 · unverdicted · none · ref 17
PERSUASIONTRACE introduces a Bayesian-network simulated target for multi-turn persuasion that matches human belief dynamics (81 vs 80) better than LLM baselines (64) and enables process-level evaluation.
Towards A Framework for Levels of Anthropomorphic Deception in Robots and AI cs.HC · 2026-04-16 · unverdicted · none · ref 17
A conceptual framework classifies anthropomorphic deception into four levels using humanlikeness, agency, and selfhood to guide ethical and practical decisions in HCI and HRI.
Persuasion with Large Language Models: A Survey of Empirical Evidence, Study Methodologies, and Ethical Implications cs.CL · 2024-11-11 · unverdicted · none · ref 10
LLM-based persuasion systems frequently match or exceed human effectiveness across domains, with key influences from interaction style, model scale, prompt design, and personalization, while posing risks to information integrity, fairness, privacy, and autonomy.
TrustLLM: Trustworthiness in Large Language Models cs.CL · 2024-01-10 · unverdicted · none · ref 50
TrustLLM defines eight trustworthiness principles, creates a six-dimension benchmark, and evaluates 16 LLMs showing proprietary models generally lead but some open-source ones are close while over-calibration can hurt utility.
The Agentic Web Requires New Normative Infrastructure cs.CY · 2026-06-09 · unverdicted · none · ref 35
The agentic web requires new normative infrastructure of laws, norms, and practices to allow user-delegated AI agents to access online properties without being blocked as malicious bots.

Characterizing Manipulation from AI Systems, October 2023

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer