pith. sign in

Title resolution pending

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.AI 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

citing papers explorer

Showing 1 of 1 citing paper.

  • Emergent Strategic Reasoning Risks in AI: A Taxonomy-Driven Evaluation Framework cs.AI · 2026-04-23 · unverdicted · none · ref 18

    ESRRSim is a taxonomy-driven framework that generates evaluation scenarios and dual rubrics to measure emergent strategic reasoning risks like deception and reward hacking across 11 LLMs, finding detection rates from 14.45% to 72.72% with generational improvements.