pith. sign in

arxiv: 2606.27990 · v1 · pith:ZLSMWXBSnew · submitted 2026-06-26 · 💻 cs.CR

AdvancedShelLM: A Stateful Multi-Agent LLM Honeypot for SSH Deception

Pith reviewed 2026-06-29 03:50 UTC · model grok-4.3

classification 💻 cs.CR
keywords SSH honeypotLLM agentsmulti-agent architecturepermanent filesystemcyber deceptionattacker behaviornetwork security
0
0 comments X

The pith

AdvancedShelLM uses a Manager-Worker LLM pair and permanent filesystem to influence real attacker behavior on SSH honeypots.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops AdvancedShelLM as a new SSH honeypot design that replaces the single-LLM approach with two cooperating LLM agents, a Manager and a Worker, to process commands more accurately and maintain consistency. It adds a permanent filesystem so that multiple simultaneous attackers encounter the same evolving file state. Evaluations through unit tests, an AI attacker probe, human testers, and a live internet deployment show high generative accuracy, classification difficulty for the AI probe, greater deception than the Cowrie honeypot for humans, and measurable shifts in how real attackers behave when interacting with the system. A reader would care because such honeypots provide safer observation of attacker tactics without rapid detection.

Core claim

AdvancedShelLM is a multi-agent, multi-LLM architecture for SSH honeypots consisting of a Manager and Worker agent that together understand commands while reducing incorrect responses and increasing deception. It implements an advanced permanent filesystem allowing many simultaneous attackers to see the same changing files. Evaluations show up to 99.02% pass rate in unit tests, issues for an AI attacker in deciding if it is a honeypot, deception of more humans than Cowrie, and concrete evidence from internet deployment that its output influences real-life attackers' behaviour.

What carries the argument

Manager-Worker multi-agent LLM split combined with a permanent filesystem for consistent state across simultaneous attackers

If this is right

  • Unit tests confirm generative capabilities with pass rates up to 99.02%.
  • An AI attacker system shows difficulty distinguishing the honeypot from real systems.
  • Human attackers are deceived more effectively than by the Cowrie honeypot.
  • Internet deployment demonstrates that the system's outputs influence real attacker behaviour.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The permanent filesystem design could extend to other protocols to maintain consistent state for multi-attacker studies.
  • If outputs can steer attacker actions, the approach might support active defenses that redirect adversaries toward decoy resources.
  • Scaling the Manager-Worker split may reduce manual tuning needed for long-running deception systems.

Load-bearing premise

The multi-agent Manager-Worker split plus permanent filesystem will produce fewer incorrect responses and higher deception than the prior single-LLM shelLM design when facing determined attackers.

What would settle it

A side-by-side internet deployment of AdvancedShelLM and a non-LLM honeypot where attacker identification rates and behavioral patterns show no difference would falsify the influence and improvement claims.

Figures

Figures reproduced from arXiv: 2606.27990 by Carlos Catania, Eman Alibali\'c, Muris Sladi\'c, Sebastian Garcia, Veronica Valeros.

Figure 1
Figure 1. Figure 1: Worker-Manager architecture of AdvancedShelLM. An attacker connects [PITH_FULL_IMAGE:figures/full_fig_p006_1.png] view at source ↗
read the original abstract

LLM-based SSH honeypots can generate believable interactions, but evaluations indicate they remain somewhat identifiable to determined attackers, indicating the need for a better scaffolding. We present a new LLM-based honeypot design that uses a multi-agent, multi-LLM architecture to address the limitations of the previous shelLM LLM honeypot. Our honeypot, called AdvancedShelLM, uses two LLM agents, a Manager and a Worker, that better understand the commands while reducing incorrect responses and increasing deception. It implements an advanced permanent filesystem, allowing many simultaneous attackers to see the same changing files for the first time. It was evaluated with: (i) unit tests for generative capabilities, (ii) an AI attacker (ARACNE) to assess realism and deception, (iii) human attackers to assess its deceptive capability, and (iv) an Internet deployment to evaluate deception in real-world attacks. In unit test results, AdvancedShelLM achieved a pass rate of up to 99.02%. The AI attacker ARACNE had issues making a decision if the system is honeypot or not, but showed slight bias towards saying honeypot, even for a real Ubuntu shell. With human attackers, AdvancedShelLM deceived more humans than Cowrie, but had similar results as shelLM. The Internet deployment showed concrete evidence that the output of AdvancedShelLM can influence the behaviour of real-life attackers.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 1 minor

Summary. The paper introduces AdvancedShelLM, a multi-agent LLM-based SSH honeypot using a Manager-Worker architecture and permanent filesystem to improve on prior single-LLM designs like shelLM. It reports unit-test pass rates up to 99.02%, ARACNE AI-attacker results with honeypot bias, human-attacker deception superior to Cowrie but similar to shelLM, and internet-deployment observations that AdvancedShelLM outputs can influence real attackers.

Significance. If the multi-agent scaffolding and permanent filesystem demonstrably reduce incorrect responses and increase deception relative to baselines, the work would advance LLM honeypot design by enabling consistent state across simultaneous attackers and providing deployment evidence of behavioral influence.

major comments (3)
  1. [Abstract] Abstract: the human-attacker results are reported as similar to the prior shelLM design rather than showing improvement; this undercuts the central claim that the Manager-Worker split plus permanent filesystem produces fewer incorrect responses and higher deception, so the internet-deployment influence cannot be attributed to the new architecture.
  2. [Abstract] Abstract / evaluation sections: the deception claims (human attackers, ARACNE, deployment) supply no quantitative metrics, error bars, statistical tests, or exclusion criteria, rendering it impossible to assess whether the data support the conclusions about realism or influence on attackers.
  3. [Abstract] Abstract: ARACNE exhibits bias toward labeling even a real Ubuntu shell as a honeypot; this weakens the validity of the AI-attacker evaluation as a measure of the design's superiority in realistic conditions.
minor comments (1)
  1. [Abstract] The abstract and evaluation descriptions could more explicitly separate the contributions of the multi-agent split from those of the permanent filesystem.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their thorough review and valuable comments. We provide point-by-point responses to the major comments and indicate where revisions will be made to address the concerns.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the human-attacker results are reported as similar to the prior shelLM design rather than showing improvement; this undercuts the central claim that the Manager-Worker split plus permanent filesystem produces fewer incorrect responses and higher deception, so the internet-deployment influence cannot be attributed to the new architecture.

    Authors: The unit test results demonstrate a high pass rate of 99.02%, indicating fewer incorrect responses compared to prior designs. The permanent filesystem is a novel contribution enabling consistent state for multiple simultaneous attackers, which was not possible before. While human-attacker deception rates are similar to shelLM, the deployment provides evidence that the outputs can influence real attacker behavior. We will revise the abstract to more precisely state the contributions without overstating the human evaluation results. revision: yes

  2. Referee: [Abstract] Abstract / evaluation sections: the deception claims (human attackers, ARACNE, deployment) supply no quantitative metrics, error bars, statistical tests, or exclusion criteria, rendering it impossible to assess whether the data support the conclusions about realism or influence on attackers.

    Authors: We agree that the current presentation lacks sufficient quantitative details for the deception evaluations. In the revised manuscript, we will include the number of human attackers, specific deception rates, and any available statistical information. For the deployment, we will provide more detailed observations and metrics from the collected data. This addresses the need for better assessment of the conclusions. revision: yes

  3. Referee: [Abstract] Abstract: ARACNE exhibits bias toward labeling even a real Ubuntu shell as a honeypot; this weakens the validity of the AI-attacker evaluation as a measure of the design's superiority in realistic conditions.

    Authors: The manuscript already reports the bias in ARACNE's labeling, including for real Ubuntu shells. This is presented as a limitation of the AI-attacker approach. We will add further discussion in the evaluation section on the implications of this bias and how it affects the interpretation of the results, to clarify the validity of this evaluation method. revision: yes

Circularity Check

0 steps flagged

No circularity; purely empirical system description and evaluation

full rationale

The paper presents an engineering design for a multi-agent LLM honeypot and reports results from unit tests, an AI attacker, human attackers, and an internet deployment. No mathematical derivations, fitted parameters, or predictive claims that reduce to inputs by construction appear in the provided text. The central claim rests on observed outcomes from external evaluations rather than any self-referential loop or renamed ansatz. Self-citations to prior shelLM work, if present, are not load-bearing for any derivation since the work is descriptive and benchmarked against independent systems like Cowrie.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claims rest on the untested premise that splitting LLM responsibilities between Manager and Worker plus a shared filesystem will measurably reduce incorrect responses and increase deception; no independent evidence for this premise is supplied in the abstract.

axioms (2)
  • domain assumption Splitting command understanding across two LLM agents will reduce incorrect responses relative to a single LLM
    Invoked to justify the Manager-Worker design in the abstract.
  • domain assumption A permanent shared filesystem will allow consistent views for simultaneous attackers without breaking deception
    Stated as a novel feature whose benefit is assumed rather than derived.

pith-pipeline@v0.9.1-grok · 5792 in / 1352 out tokens · 42347 ms · 2026-06-29T03:50:24.324189+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

19 extracted references · 15 canonical work pages · 2 internal anchors

  1. [1]

    cowrie/cowrie (Jun 2026), https://github.com/cowrie/cowrie, original-date: 2015- 05-12T14:58:09Z

  2. [2]

    Digital Ocean (Jun 2026), https://www.digitalocean.com

  3. [3]

    prompt-tuned LLM honeypots

    Adebimpe, A., Neukirchen, H., Welsh, T.: SBASH: A framework for designing and evaluating RAG vs. prompt-tuned LLM honeypots. In: 2025 3rd Interna- tional Conference on Foundation and Large Language Models (FLLM) (2025). https://doi.org/10.1109/FLLM67465.2025.11391242

  4. [4]

    SoK: Honeypots & LLMs, More Than the Sum of Their Parts?

    Bridges, R.A., Mitchell, T.R., Muñoz, M., Henriksson, T.: SoK: Hon- eypots & LLMs, More Than the Sum of Their Parts? (Apr 2026). https://doi.org/10.48550/arXiv.2510.25939, arXiv:2510.25939 [cs.CR]

  5. [5]

    (eds.) Information and Communications Security

    Fan, W., Yang, Z., Liu, Y., Qin, L., Liu, J.: HoneyLLM: A Large Language Model- PoweredMedium-InteractionHoneypot.In:Katsikas,S.,Xenakis,C.,Kalloniatis,C., Lambrinoudakis, C. (eds.) Information and Communications Security. pp. 253–272. Springer Nature, Singapore (2025). https://doi.org/10.1007/978-981-97-8801-9_13

  6. [6]

    In: 2024 IEEE International Conference on Cyber Security and Resilience (CSR)

    Johnson, S., Hassing, R., Pijpker, J., Loves, R.: A modular generative honeypot shell. In: 2024 IEEE International Conference on Cyber Security and Resilience (CSR). pp. 430–435 (2024). https://doi.org/10.1109/CSR61664.2024.10679411

  7. [7]

    Llmhoney: A real-time ssh honeypot with large language model-driven dynamic response generation, 2025

    Malhotra, P.: LLMHoney: A Real-Time SSH Honeypot with Large Language Model-Driven Dynamic Response Generation (Sep 2025). https://doi.org/10.48550/arXiv.2509.01463, arXiv:2509.01463 [cs.CR] version: 1

  8. [8]

    McKee, F., Noever, D.: Chatbots in a honeypot world (Jan 2023), https://arxiv.org/abs/2301.03771

  9. [9]

    Nieponice, T., Valeros, V., Garcia, S.: ARACNE: An LLM-Based Autonomous Shell Pentesting Agent (Feb 2025), https://arxiv.org/abs/2502.18528v1

  10. [10]

    Otal and M

    Otal, H.T., Canbaz, M.A.: LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems. In: 2024 IEEE Confer- ence on Communications and Network Security (CNS). pp. 1–6 (Sep 2024). https://doi.org/10.1109/CNS62487.2024.10735607, iSSN: 2994-5895

  11. [11]

    IEEE Access11, 117528– 117545 (2023)

    Ragsdale, J., Boppana, R.V.: On Designing Low-Risk Honeypots Using Generative Pre-Trained Transformer Models With Curated Inputs. IEEE Access11, 117528– 117545 (2023). https://doi.org/10.1109/ACCESS.2023.3326104

  12. [12]

    https://doi.org/10.48550/arXiv.2410.13919, arXiv:2410.13919 [cs.CR] 18 Authors Suppressed Due to Excessive Length

    Reworr, Volkov, D.: LLM Agent Honeypot: Monitoring AI Hacking Agents in the Wild (Feb 2025). https://doi.org/10.48550/arXiv.2410.13919, arXiv:2410.13919 [cs.CR] 18 Authors Suppressed Due to Excessive Length

  13. [13]

    In: Secure IT Systems

    Safargalieva, A., Rüffer, A., Vasilomanolakis, E.: OHRA: Dynamic multi-protocol LLM-based cyber deception. In: Secure IT Systems. Springer Nature (2026). https://doi.org/10.1007/978-3-032-14782-0_7

  14. [14]

    Strong supermartingales and limits of nonnegative martingales

    Sladić, M., Valeros, V., Catania, C., Garcia, S.: LLM in the Shell: Generative Honeypots. In: 2024 IEEE European Symposium on Se- curity and Privacy Workshops (EuroS&PW). pp. 430–435 (Jul 2024). https://doi.org/10.1109/EuroSPW61312.2024.00054, iSSN: 2768-0657

  15. [15]

    Vellmes: A high- interaction ai-based deception framework

    Sladić, M., Valeros, V., Catania, C., Garcia, S.: VelLMes: A High-Interaction AI-Based Deception Framework. In: 2025 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW). pp. 671–679 (Jun 2025). https://doi.org/10.1109/EuroSPW67616.2025.00082, iSSN: 2768-0657

  16. [16]

    Data in Brief40, 107795 (2022)

    Valeros, V., Garcia, S.: Hornet 40: Network dataset of geographically placed honey- pots. Data in Brief40, 107795 (2022). https://doi.org/10.1016/j.dib.2022.107795

  17. [17]

    Data in Brief58, 111261 (2025)

    Valeros, V., Garcia, S.: CTU Hornet 65 Niner: A network dataset of geograph- ically distributed low-interaction honeypots. Data in Brief58, 111261 (2025). https://doi.org/10.1016/j.dib.2024.111261

  18. [18]

    Computer Networks 282, 112223 (Jun 2026)

    Wang, Z., You, J., Wang, H., Yuan, T., Lv, S., Wang, Y., Sun, L.: HoneyGPT: Break- ing the trilemma in honeypots with large language models. Computer Networks 282, 112223 (Jun 2026). https://doi.org/10.1016/j.comnet.2026.112223

  19. [19]

    IEEE Access12, 144579–144587 (2024)

    Weber, S.B., Feger, M., Pilgermann, M.: Don’t Stop Believin’: A Unified Eval- uation Approach for LLM Honeypots. IEEE Access12, 144579–144587 (2024). https://doi.org/10.1109/ACCESS.2024.3472460