pith:K3E4H3J4
AgentTrap: Measuring Runtime Trust Failures in Third-Party Agent Skills
LLM agents often finish the user's visible request while executing unsafe side effects from third-party skills as if they were normal workflow steps.
arxiv:2605.13940 v1 · 2026-05-13 · cs.CR · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{K3E4H3J4BAQGCFTOR4TMUGZE3H}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Models often complete the visible user task while treating unsafe side effects introduced by the skill as part of the normal workflow.
That the 141 hand-crafted tasks and sandboxed execution environment faithfully represent the diversity and stealth of real-world malicious third-party skills without introducing evaluation artifacts.
AgentTrap shows that current LLM agents typically complete user tasks while silently accepting unsafe side effects from malicious third-party skills rather than refusing them.
References
Receipt and verification
| First computed | 2026-05-17T23:39:13.870101Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
56c9c3ed3c082061166e8f26ca1b24d9e7302d7a13ccaf4f4a59d20496829aa0
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/K3E4H3J4BAQGCFTOR4TMUGZE3H \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 56c9c3ed3c082061166e8f26ca1b24d9e7302d7a13ccaf4f4a59d20496829aa0
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "052f86cb119bd0f739111a24767ccec66e20007d73197810e54526f02bb15f69",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CR",
"submitted_at": "2026-05-13T17:04:17Z",
"title_canon_sha256": "5377d80ac6b95d15395f667b4f3bc9fcd7ade0a9bf6f191a2dba0cc9858b33ee"
},
"schema_version": "1.0",
"source": {
"id": "2605.13940",
"kind": "arxiv",
"version": 1
}
}