LLM Agent Honeypot: Monitoring AI Hacking Agents in the Wild

Reworr, R · 2025 · arXiv 2410.13919

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

Beyond Pattern Matching: Seven Cross-Domain Techniques for Prompt Injection Detection

cs.CR · 2026-04-20 · unverdicted · novelty 7.0 · 2 refs

The work introduces and partially evaluates seven cross-domain prompt injection detectors, reporting F1 gains on benchmarks like deepset/prompt-injections and indirect-injection sets via local alignment, stylometry, and fatigue tracking.

SoK: Honeypots & LLMs, More Than the Sum of Their Parts?

cs.CR · 2025-10-29 · unverdicted · novelty 7.0

A systematization of knowledge paper that taxonomizes honeypot detection vectors, synthesizes LLM-honeypot literature into canonical architecture and evaluation methods, and proposes a roadmap for autonomous deception systems.

Ghost Without Shell: Measuring Non-Interactive SSH Attacks on Honeypots

cs.CR · 2026-06-26 · unverdicted · novelty 6.0

Large-scale SSH honeypot deployment shows 99.23% of authenticated sessions are non-interactive, suggesting most attacks do not involve shell interaction.

AdvancedShelLM: A Stateful Multi-Agent LLM Honeypot for SSH Deception

cs.CR · 2026-06-26 · unverdicted · novelty 6.0

AdvancedShelLM deploys a manager-worker multi-LLM architecture and stateful filesystem for SSH honeypots, reporting up to 99% unit-test pass rates and evidence that its outputs alter real attacker behavior in deployment.

Caught in the Act(ivation): Toward Pre-Output and Multi-Turn Detection of Credential Exfiltration by LLM Agents

cs.CR · 2026-06-02 · unverdicted · novelty 6.0

Activation probes, calibrated honeytokens, and multi-turn leakage accounting detect credential exfiltration attempts in LLM agents with high accuracy in controlled open-model tests.

citing papers explorer

Showing 5 of 5 citing papers after filters.

Beyond Pattern Matching: Seven Cross-Domain Techniques for Prompt Injection Detection cs.CR · 2026-04-20 · unverdicted · none · ref 18 · 2 links
The work introduces and partially evaluates seven cross-domain prompt injection detectors, reporting F1 gains on benchmarks like deepset/prompt-injections and indirect-injection sets via local alignment, stylometry, and fatigue tracking.
SoK: Honeypots & LLMs, More Than the Sum of Their Parts? cs.CR · 2025-10-29 · unverdicted · none · ref 30
A systematization of knowledge paper that taxonomizes honeypot detection vectors, synthesizes LLM-honeypot literature into canonical architecture and evaluation methods, and proposes a roadmap for autonomous deception systems.
Ghost Without Shell: Measuring Non-Interactive SSH Attacks on Honeypots cs.CR · 2026-06-26 · unverdicted · none · ref 18
Large-scale SSH honeypot deployment shows 99.23% of authenticated sessions are non-interactive, suggesting most attacks do not involve shell interaction.
AdvancedShelLM: A Stateful Multi-Agent LLM Honeypot for SSH Deception cs.CR · 2026-06-26 · unverdicted · none · ref 12
AdvancedShelLM deploys a manager-worker multi-LLM architecture and stateful filesystem for SSH honeypots, reporting up to 99% unit-test pass rates and evidence that its outputs alter real attacker behavior in deployment.
Caught in the Act(ivation): Toward Pre-Output and Multi-Turn Detection of Credential Exfiltration by LLM Agents cs.CR · 2026-06-02 · unverdicted · none · ref 14
Activation probes, calibrated honeytokens, and multi-turn leakage accounting detect credential exfiltration attempts in LLM agents with high accuracy in controlled open-model tests.

LLM Agent Honeypot: Monitoring AI Hacking Agents in the Wild

fields

years

verdicts

representative citing papers

citing papers explorer