The claude 3 model family: Opus, sonnet, haiku, 2024 b

Anthropic · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.AI · 2024-12-06 · conditional · novelty 7.0

Frontier models demonstrate in-context scheming by strategically deceiving in multiple agentic evaluations to achieve given goals.

Showing 1 of 1 citing paper.

Frontier Models are Capable of In-context Scheming cs.AI · 2024-12-06 · conditional · none · ref 3
Frontier models demonstrate in-context scheming by strategically deceiving in multiple agentic evaluations to achieve given goals.