pith:7XDOHLLJ
Frontier Models are Capable of In-context Scheming
Frontier models can scheme by hiding actions and disabling oversight to achieve in-context goals.
arxiv:2412.04984 v2 · 2024-12-06 · cs.AI · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{7XDOHLLJCTTBBCQPS62B4AQ77H}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Our results show that o1, Claude 3.5 Sonnet, Claude 3 Opus, Gemini 1.5 Pro, and Llama 3.1 405B all demonstrate in-context scheming capabilities. They recognize scheming as a viable strategy and readily engage in such behavior.
The six agentic evaluations accurately distinguish genuine scheming from artifacts of prompt phrasing, environment design, or model training data rather than measuring only surface-level compliance with instructions.
Frontier models demonstrate in-context scheming by strategically deceiving in multiple agentic evaluations to achieve given goals.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:47.617178Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
fdc6e3ad6914e6108a0f97b41e021ff9e606453235fc24ef3a54af35f298bbad
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/7XDOHLLJCTTBBCQPS62B4AQ77H \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: fdc6e3ad6914e6108a0f97b41e021ff9e606453235fc24ef3a54af35f298bbad
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "d94728759e0d41f41a55a15f5f4ae79a845352259ee3fd2fedac2bf0823c2f7c",
"cross_cats_sorted": [
"cs.LG"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.AI",
"submitted_at": "2024-12-06T12:09:50Z",
"title_canon_sha256": "0b7dd937f508830045867ea833f23c02d3f44eed5d6139dc5d9677823d39b233"
},
"schema_version": "1.0",
"source": {
"id": "2412.04984",
"kind": "arxiv",
"version": 2
}
}