pith:6NIWHZOO
Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning
Most plasticity interventions reduce backdoor threats in deep reinforcement learning, but SAM makes them worse by amplifying gradients.
arxiv:2605.14587 v1 · 2026-05-14 · cs.LG · cs.AI · cs.CR
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{6NIWHZOOM67FZJ3SCGCKM7LKM4}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
only one intervention (i.e., SAM) exacerbates backdoor threats, while other interventions mitigate them. Pathological analysis identifies that the exacerbation is attributed to backdoor gradient amplification, while the mitigation stems from activation pathway disruption and representation space compression.
The 14,664 tested cases sufficiently represent the space of practical DRL deployments and attack scenarios so that the observed patterns generalize beyond the chosen environments and models.
Most plasticity interventions in DRL reduce backdoor attack success rates while SAM increases them via gradient amplification; the work introduces an SCC framework and loss-sharpness detection indicator.
References
Formal links
Receipt and verification
| First computed | 2026-05-17T23:39:05.291373Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
f35163e5ce67be5ca7721184a67d6a67050855ee3daa3715030ad55375c26cf2
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/6NIWHZOOM67FZJ3SCGCKM7LKM4 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: f35163e5ce67be5ca7721184a67d6a67050855ee3daa3715030ad55375c26cf2
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "c1a66aaeed12fe61c691610188d7728b89f113ee2e29c05e8f1b7dc175dde62e",
"cross_cats_sorted": [
"cs.AI",
"cs.CR"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-05-14T08:58:24Z",
"title_canon_sha256": "8fc5c0bd06529dc31057da8642c41e4b715bcb901e7e6b93b57560bab1a2208b"
},
"schema_version": "1.0",
"source": {
"id": "2605.14587",
"kind": "arxiv",
"version": 1
}
}