pith. sign in

CI-Bench: Benchmarking contextual integrity of ai assistants on synthetic data

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

years

2026 6 2025 1

verdicts

UNVERDICTED 7

roles

background 1

polarities

background 1

representative citing papers

CI-Work: Benchmarking Contextual Integrity in Enterprise LLM Agents

cs.CR · 2026-04-23 · unverdicted · novelty 6.0

A new benchmark shows enterprise LLM agents violate contextual integrity at rates of 15.8-50.9% with leakage up to 26.7%, and higher task performance correlates with more privacy breaches that model scaling does not fix.

citing papers explorer

Showing 7 of 7 citing papers.