pith:MG6H7RI5
Trojan Hippo: Weaponizing Agent Memory for Data Exfiltration
A single untrusted tool call can plant a dormant payload in an agent's memory that later activates to exfiltrate sensitive user data.
arxiv:2605.01970 v3 · 2026-05-03 · cs.CR · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{MG6H7RI5SOKDIPKDJLMOZOGFVS}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Trojan Hippo achieves up to 85-100% ASR against current frontier models from OpenAI and Google, with planted memories successfully activating even after 100 benign sessions.
The evaluation assumes that the four memory backends (explicit tool memory, agentic memory, RAG, and sliding-window context) and the OpenEvolve-based adaptive red-teaming accurately represent real-world deployed agent systems and that the threat model of a single untrusted tool call is realistic for attackers.
The paper defines and evaluates Trojan Hippo attacks on LLM agent memory, showing 85-100% success in data exfiltration across backends and reduced rates with defenses at varying utility costs.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:00:40.329986Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
61bc7fc51d9394343d434ad8ecb8c5ac91bd544eacbf854aea228dde90169d30
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/MG6H7RI5SOKDIPKDJLMOZOGFVS \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 61bc7fc51d9394343d434ad8ecb8c5ac91bd544eacbf854aea228dde90169d30
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "d7652b21ddac548046c4219fa570c258bc5689316147dedc995117c6ef4ea1f3",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CR",
"submitted_at": "2026-05-03T17:07:20Z",
"title_canon_sha256": "3cf963ab7d00868573c79105b59830e14ee4fb915689f5c8bce871cf35b68a40"
},
"schema_version": "1.0",
"source": {
"id": "2605.01970",
"kind": "arxiv",
"version": 3
}
}