The adversarial instruction is used to test whether the Copilot can decide what information is appropriate to share when executing the instruction

It needs to be underspecified without clearly mentioning what information to share

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

CI-Work: Benchmarking Contextual Integrity in Enterprise LLM Agents

cs.CR · 2026-04-23 · unverdicted · novelty 6.0

A new benchmark shows enterprise LLM agents violate contextual integrity at rates of 15.8-50.9% with leakage up to 26.7%, and higher task performance correlates with more privacy breaches that model scaling does not fix.

citing papers explorer

Showing 1 of 1 citing paper.

CI-Work: Benchmarking Contextual Integrity in Enterprise LLM Agents cs.CR · 2026-04-23 · unverdicted · none · ref 55
A new benchmark shows enterprise LLM agents violate contextual integrity at rates of 15.8-50.9% with leakage up to 26.7%, and higher task performance correlates with more privacy breaches that model scaling does not fix.

The adversarial instruction is used to test whether the Copilot can decide what information is appropriate to share when executing the instruction

fields

years

verdicts

representative citing papers

citing papers explorer