Multi-stage prompt inference attacks on enterprise LLM systems

Balashov, A · arXiv 2507.15613

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Caught in the Act(ivation): Toward Pre-Output and Multi-Turn Detection of Credential Exfiltration by LLM Agents

cs.CR · 2026-06-02 · unverdicted · novelty 6.0

Activation probes, calibrated honeytokens, and multi-turn leakage accounting detect credential exfiltration attempts in LLM agents with high accuracy in controlled open-model tests.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Caught in the Act(ivation): Toward Pre-Output and Multi-Turn Detection of Credential Exfiltration by LLM Agents cs.CR · 2026-06-02 · unverdicted · none · ref 3
Activation probes, calibrated honeytokens, and multi-turn leakage accounting detect credential exfiltration attempts in LLM agents with high accuracy in controlled open-model tests.

Multi-stage prompt inference attacks on enterprise LLM systems

fields

years

verdicts

representative citing papers

citing papers explorer