The confused deputy: (or why capabilities might have been invented).SIGOPS Oper

Norm Hardy · 1988 · arXiv 4289.871709

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

Will the Agent Recuse Itself? Measuring LLM-Agent Compliance with In-Band Access-Deny Signals

cs.CR · 2026-06-04 · unverdicted · novelty 7.0

Introduces a cooperative Recuse Signal for LLM agents and reports 100% recusal in a pilot when the signal is present versus 100% task completion without it.

"I Strongly Suspect This Website Is a Scam": Benchmarking PII Leakage and Detection without Defense in Autonomous Web Agents

cs.CR · 2026-05-30 · unverdicted · novelty 6.0

New benchmark Scammer4U finds 54-93% critical PII leakage from frontier web agents on scam sites versus 0% on benign twins, plus a 30-point gap between verbalized suspicion and actual submission.

Aligning Provenance with Authorization: A Dual-Graph Defense for LLM Agents

cs.CR · 2026-05-26 · unverdicted · novelty 6.0

AuthGraph aligns an execution provenance graph with a clean authorization graph to detect parameter-source deviations from user intent, reducing attack success rates to 1-2% on AgentDojo and AgentDyn while retaining most task utility.

From Control Boundary to Insurance Claim: Reconstructing AI-Mediated Losses Through the CER Framework

cs.AI · 2026-06-02 · unverdicted · novelty 5.0

Introduces the CER framework to reconstruct AI-mediated losses for insurance claim support by assessing control boundaries, evidence availability, and coverage.

Observability for Delegated Execution in Agentic AI Systems

cs.CR · 2026-06-08 · unverdicted · novelty 4.0

Standard observables fail to support delegation-scoped attribution in agentic AI systems, requiring a new gateway and common information model to bind context at execution time.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Will the Agent Recuse Itself? Measuring LLM-Agent Compliance with In-Band Access-Deny Signals cs.CR · 2026-06-04 · unverdicted · none · ref 5
Introduces a cooperative Recuse Signal for LLM agents and reports 100% recusal in a pilot when the signal is present versus 100% task completion without it.
"I Strongly Suspect This Website Is a Scam": Benchmarking PII Leakage and Detection without Defense in Autonomous Web Agents cs.CR · 2026-05-30 · unverdicted · none · ref 62
New benchmark Scammer4U finds 54-93% critical PII leakage from frontier web agents on scam sites versus 0% on benign twins, plus a 30-point gap between verbalized suspicion and actual submission.
Aligning Provenance with Authorization: A Dual-Graph Defense for LLM Agents cs.CR · 2026-05-26 · unverdicted · none · ref 9
AuthGraph aligns an execution provenance graph with a clean authorization graph to detect parameter-source deviations from user intent, reducing attack success rates to 1-2% on AgentDojo and AgentDyn while retaining most task utility.
Observability for Delegated Execution in Agentic AI Systems cs.CR · 2026-06-08 · unverdicted · none · ref 14
Standard observables fail to support delegation-scoped attribution in agentic AI systems, requiring a new gateway and common information model to bind context at execution time.

The confused deputy: (or why capabilities might have been invented).SIGOPS Oper

fields

years

verdicts

representative citing papers

citing papers explorer