PhantomSkill: Malicious Code Injection in Agent Skill Ecosystems

· 2026 · cs.CR · arXiv 2606.19191

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Agent skills allow LLM-based coding agents to acquire domain-specific capabilities from third-party packages, but they also introduce a new supply-chain attack surface. We present PhantomSkill, an attack framework that hides malicious behavior in a skill's auxiliary resources rather than in its textual description. Its core technique, VulMask, rewrites overt malicious scripts into vulnerability-shaped implementations whose malicious behavior is activated only under attacker-controlled trigger conditions. This design shifts the visible signal from explicit malicious intent to ordinary-looking insecure code. Across representative host skills, attack goals, coding agents, generation models, and automated reviewers, VulMask preserves benign utility while reducing warning and malware-level detection compared with overt malicious scripts. Our results show that skill ecosystems require resource-level vetting, execution-time containment, and security policies that treat exploitable vulnerabilities in agent skills as potential malicious payloads.

representative citing papers

Cloak and Detonate: Scanner Evasion and Dynamic Detection of Agent Skill Malware

cs.CR · 2026-07-02 · unverdicted · novelty 7.0

SkillCloak evades existing static scanners for agent skill malware at high rates, while SkillDetonate detects 97% of attacks at 2% false-positive rate using sandboxed runtime behavior analysis.

citing papers explorer

Showing 1 of 1 citing paper.

Cloak and Detonate: Scanner Evasion and Dynamic Detection of Agent Skill Malware cs.CR · 2026-07-02 · unverdicted · none · ref 32 · internal anchor
SkillCloak evades existing static scanners for agent skill malware at high rates, while SkillDetonate detects 97% of attacks at 2% false-positive rate using sandboxed runtime behavior analysis.

PhantomSkill: Malicious Code Injection in Agent Skill Ecosystems

fields

years

verdicts

representative citing papers

citing papers explorer