Mpci-bench: A benchmark for multimodal pairwise contextual integrity evaluation of language model agents

MPCI-Bench: A Benchmark for Multimodal Pairwise Contextual Integrity Evaluation of Language Model Agents , author= · 2026 · arXiv 2601.08235

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

PrivacyPeek: Auditing What LLM-Based Agents Acquire, Not Just What They Say

cs.CR · 2026-05-29 · unverdicted · novelty 7.0

PrivacyPeek is a benchmark with 1,182 cases across 7 acquisition behaviors and 16 domains that evaluates acquisition-stage privacy leakage in LLM agents, finding it widespread with limited prompt mitigation.

Symbolic Guardrails for Domain-Specific Agents: Stronger Safety and Security Guarantees Without Sacrificing Utility

cs.SE · 2026-04-16 · unverdicted · novelty 5.0

Symbolic guardrails enforce 74% of specified safety policies in agent benchmarks and boost safety without hurting utility.

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

cs.LG · 2026-05-18 · unverdicted · novelty 4.0

SELFCI uses complementary self-distillation with two reverse KL divergences to align LLMs to contextual integrity while preserving utility, outperforming RL baselines like GRPO in agentic settings.

citing papers explorer

Showing 3 of 3 citing papers after filters.

PrivacyPeek: Auditing What LLM-Based Agents Acquire, Not Just What They Say cs.CR · 2026-05-29 · unverdicted · none · ref 36
PrivacyPeek is a benchmark with 1,182 cases across 7 acquisition behaviors and 16 domains that evaluates acquisition-stage privacy leakage in LLM agents, finding it widespread with limited prompt mitigation.
Symbolic Guardrails for Domain-Specific Agents: Stronger Safety and Security Guarantees Without Sacrificing Utility cs.SE · 2026-04-16 · unverdicted · none · ref 70
Symbolic guardrails enforce 74% of specified safety policies in agent benchmarks and boost safety without hurting utility.
It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs cs.LG · 2026-05-18 · unverdicted · none · ref 45
SELFCI uses complementary self-distillation with two reverse KL divergences to align LLMs to contextual integrity while preserving utility, outperforming RL baselines like GRPO in agentic settings.

Mpci-bench: A benchmark for multimodal pairwise contextual integrity evaluation of language model agents

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer