Vellmes: A high- interaction ai-based deception framework

Muris Sladi ´c, Veronica Valeros, Carlos Catania, Sebastian Garcia · 2025 · arXiv 7616.2025

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

Mind your key: An Empirical Study of LLM API Credential Leakage in iOS Apps

cs.SE · 2026-06-10 · unverdicted · novelty 8.0

Empirical analysis of 444 iOS apps using dynamic traffic interception found 282 leaking LLM API keys across ten providers, with only 28% remediation after three months.

Honeyval: A Comprehensive Evaluation Framework for LLM-powered HTTP Honeypots

cs.CR · 2026-05-28 · unverdicted · novelty 7.0

Honeyval evaluates LLM HTTP honeypots with AI attackers and shows they produce longer interactions, lower detection rates, and cost advantages over rule-based baselines.

Ghost Without Shell: Measuring Non-Interactive SSH Attacks on Honeypots

cs.CR · 2026-06-26 · unverdicted · novelty 6.0

Large-scale SSH honeypot deployment shows 99.23% of authenticated sessions are non-interactive, suggesting most attacks do not involve shell interaction.

AdvancedShelLM: A Stateful Multi-Agent LLM Honeypot for SSH Deception

cs.CR · 2026-06-26 · unverdicted · novelty 6.0

AdvancedShelLM deploys a manager-worker multi-LLM architecture and stateful filesystem for SSH honeypots, reporting up to 99% unit-test pass rates and evidence that its outputs alter real attacker behavior in deployment.

Probabilistic Agents in Deterministic Audits: Evaluating Multi-Agent Systems for Automated Audits Based on the German IT-Grundschutz

cs.CR · 2026-06-24 · conditional · novelty 5.0

A multi-agent system with hybrid RAG and two new enforcement mechanisms shows strong results on semantic extraction phases of IT-Grundschutz but weak results on logical reasoning phases when evaluated against a BSI case study.

citing papers explorer

Showing 5 of 5 citing papers.

Mind your key: An Empirical Study of LLM API Credential Leakage in iOS Apps cs.SE · 2026-06-10 · unverdicted · none · ref 17
Empirical analysis of 444 iOS apps using dynamic traffic interception found 282 leaking LLM API keys across ten providers, with only 28% remediation after three months.
Honeyval: A Comprehensive Evaluation Framework for LLM-powered HTTP Honeypots cs.CR · 2026-05-28 · unverdicted · none · ref 22
Honeyval evaluates LLM HTTP honeypots with AI attackers and shows they produce longer interactions, lower detection rates, and cost advantages over rule-based baselines.
Ghost Without Shell: Measuring Non-Interactive SSH Attacks on Honeypots cs.CR · 2026-06-26 · unverdicted · none · ref 20
Large-scale SSH honeypot deployment shows 99.23% of authenticated sessions are non-interactive, suggesting most attacks do not involve shell interaction.
AdvancedShelLM: A Stateful Multi-Agent LLM Honeypot for SSH Deception cs.CR · 2026-06-26 · unverdicted · none · ref 15
AdvancedShelLM deploys a manager-worker multi-LLM architecture and stateful filesystem for SSH honeypots, reporting up to 99% unit-test pass rates and evidence that its outputs alter real attacker behavior in deployment.
Probabilistic Agents in Deterministic Audits: Evaluating Multi-Agent Systems for Automated Audits Based on the German IT-Grundschutz cs.CR · 2026-06-24 · conditional · none · ref 14
A multi-agent system with hybrid RAG and two new enforcement mechanisms shows strong results on semantic extraction phases of IT-Grundschutz but weak results on logical reasoning phases when evaluated against a BSI case study.

Vellmes: A high- interaction ai-based deception framework

fields

years

verdicts

representative citing papers

citing papers explorer