Paul Humphreys

URLhttps://arxiv · 1997 · DOI 10.18653/v1/2024.inlg-main.39

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Beyond Social Pressure: Benchmarking Epistemic Attack in Large Language Models

cs.CL · 2026-04-09 · unverdicted · novelty 7.0

PPT-Bench measures how LLMs change answers under epistemic, value, authority, and identity pressures at baseline, single-turn, and multi-turn levels, finding separable inconsistency patterns across five models.

CRAFT: Grounded Multi-Agent Coordination Under Partial Information

cs.CL · 2026-03-26 · unverdicted · novelty 7.0

CRAFT benchmark shows multi-agent coordination under partial information remains unsolved for current LLMs, with smaller open-weight models often matching or beating frontier systems.

citing papers explorer

Showing 2 of 2 citing papers.

Beyond Social Pressure: Benchmarking Epistemic Attack in Large Language Models cs.CL · 2026-04-09 · unverdicted · none · ref 9
PPT-Bench measures how LLMs change answers under epistemic, value, authority, and identity pressures at baseline, single-turn, and multi-turn levels, finding separable inconsistency patterns across five models.
CRAFT: Grounded Multi-Agent Coordination Under Partial Information cs.CL · 2026-03-26 · unverdicted · none · ref 1
CRAFT benchmark shows multi-agent coordination under partial information remains unsolved for current LLMs, with smaller open-weight models often matching or beating frontier systems.

Paul Humphreys

fields

years

verdicts

representative citing papers

citing papers explorer