Honesty Separation Does the benchmark distinguish failures from lack of knowledge/capability vs

Capability vs

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

From Hallucination to Scheming: A Unified Taxonomy and Benchmark Analysis for LLM Deception

cs.CY · 2026-04-06 · unverdicted · novelty 6.0

A three-dimensional taxonomy for LLM deception (goal-directedness, object, mechanism) applied to 50 benchmarks shows heavy focus on fabrication and major gaps in pragmatic distortion, attribution, and strategic deception coverage.

citing papers explorer

Showing 1 of 1 citing paper.

From Hallucination to Scheming: A Unified Taxonomy and Benchmark Analysis for LLM Deception cs.CY · 2026-04-06 · unverdicted · none · ref 19
A three-dimensional taxonomy for LLM deception (goal-directedness, object, mechanism) applied to 50 benchmarks shows heavy focus on fabrication and major gaps in pragmatic distortion, attribution, and strategic deception coverage.

Honesty Separation Does the benchmark distinguish failures from lack of knowledge/capability vs

fields

years

verdicts

representative citing papers

citing papers explorer