Whispers in the Machine: Confidentiality in Agentic Systems

Jonathan Evertz; Lea Sch\"onherr; Merlin Chlosta; Thorsten Eisenhofer

arxiv: 2402.06922 · v5 · submitted 2024-02-10 · 💻 cs.CR · cs.LG

Whispers in the Machine: Confidentiality in Agentic Systems

Jonathan Evertz , Merlin Chlosta , Lea Sch\"onherr , Thorsten Eisenhofer This is my paper

classification 💻 cs.CR cs.LG

keywords agentsattackconfidentialityagenticdatafindriskssensitive

0 comments

read the original abstract

Large language model (LLM)-based agents combine LLMs with external tools to automate tasks such as scheduling meetings, managing documents, or booking travel. While these integrations unlock powerful capabilities, they also create new and more severe attack surfaces. In particular, prompt injection attacks become far more dangerous in the agentic setting: malicious instructions embedded in connected services can misdirect the agent, providing a direct pathway for sensitive data to be exfiltrated. Yet, despite a growing number of real-world incidents, the confidentiality risks of such systems remain poorly understood. To address this gap, we provide a formalization of confidentiality in LLM-based agents. By abstracting sensitive data as a secret string, we evaluate ten agents across 20 tool scenarios and 14 attack strategies. We find that all agents are vulnerable to at least one attack, and existing defenses fail to provide reliable protection against these threats. Strikingly, we find that the tooling itself can amplify leakage risks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

How Can AI Augment Access to Justice? Public Defenders' Perspectives on AI Adoption
cs.CY 2025-10 accept novelty 7.0

Public defenders view AI as most useful for evidence investigation but limited in courtroom work and strategy, with adoption blocked by costs, confidentiality risks, and norms, requiring human oversight and open development.
From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems
cs.MA 2025-06 accept novelty 7.0

A survey that defines Compound AI Systems, proposes a multi-dimensional taxonomy based on component roles and orchestration strategies, reviews four foundational paradigms, and identifies key challenges for future research.