Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language

Amalie Brogaard Pauli, Isabelle Augenstein, Ira Assent · 2025 · DOI 10.18653/v1/2025.naacl-long.506

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

support 1

representative citing papers

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing

cs.CL · 2026-06-03 · unverdicted · novelty 7.0

PERSUASIONTRACE introduces a Bayesian-network simulated target for multi-turn persuasion that matches human belief dynamics (81 vs 80) better than LLM baselines (64) and enables process-level evaluation.

Investigating and Alleviating Harm Amplification in LLM Interactions

cs.CL · 2026-06-01 · unverdicted · novelty 6.0

Presents HarmAmp benchmark for multi-turn harm amplification in LLMs and TrajSafe proactive monitor that reduces harm while keeping low over-refusal and preserving capabilities.

Prompt Governance? On Governing Technologies Governed by Natural Language

cs.CY · 2026-04-29 · unverdicted · novelty 4.0

Literature on system prompts for AI shows fragmented and contradictory claims that complicate policy efforts to use them as reliable governance mechanisms.

citing papers explorer

Showing 2 of 2 citing papers after filters.

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing cs.CL · 2026-06-03 · unverdicted · none · ref 99
PERSUASIONTRACE introduces a Bayesian-network simulated target for multi-turn persuasion that matches human belief dynamics (81 vs 80) better than LLM baselines (64) and enables process-level evaluation.
Investigating and Alleviating Harm Amplification in LLM Interactions cs.CL · 2026-06-01 · unverdicted · none · ref 28
Presents HarmAmp benchmark for multi-turn harm amplification in LLMs and TrajSafe proactive monitor that reduces harm while keeping low over-refusal and preserving capabilities.

Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer