Can llms keep a secret? testing privacy implications of language models via contextual integrity theory

· 2023 · arXiv 2310.17884

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Can You Keep a Secret? Involuntary Information Leakage in Language Model Writing

cs.CR · 2026-05-11 · unverdicted · novelty 7.0

Frontier LLMs leak prompted secret information thematically in generated stories at rates up to 79% above chance in binary discrimination tests, even when told to hide it, with leakage scaling by model size and vanishing for short-form outputs.

When Are LLM Inferences Acceptable? User Reactions and Control Preferences for Inferred Personal Information

cs.HC · 2026-05-11 · unverdicted · novelty 7.0

Users show curiosity over concern toward LLM inferences of personal information, with acceptability depending on context, alignment with expectations, and who uses the inferences rather than just the content.

CAMP: Cumulative Agentic Masking and Pruning for Privacy Protection in Multi-Turn LLM Conversations

cs.CR · 2026-04-16 · unverdicted · novelty 7.0

CAMP formalizes Cumulative PII Exposure and uses a session registry, co-occurrence graph, and CPE score to trigger retroactive masking in multi-turn LLM conversations, neutralizing re-identifiable profiles in synthetic tests while keeping utility intact.

AgentCollabBench: Diagnosing When Good Agents Make Bad Collaborators

cs.CL · 2026-05-09 · unverdicted · novelty 6.0

AgentCollabBench shows that multi-agent reliability is limited by communication topology, with converging-DAG nodes causing synthesis bottlenecks that discard constraints and explain 7-40% of information loss variance.

How Far Are VLMs from Privacy Awareness in the Physical World? An Empirical Study

cs.CR · 2026-05-06 · unverdicted · novelty 6.0 · 2 refs

Vision-language models exhibit perceptual fragility and fail to consistently respect privacy constraints when operating in simulated physical environments, with performance declining in cluttered scenes and under conflicting commands.

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

cs.CV · 2024-02-27 · unverdicted · novelty 2.0

The paper reviews the background, technology, applications, limitations, and future directions of OpenAI's Sora text-to-video generative model based on public information.

citing papers explorer

Showing 6 of 6 citing papers.

Can You Keep a Secret? Involuntary Information Leakage in Language Model Writing cs.CR · 2026-05-11 · unverdicted · none · ref 8
Frontier LLMs leak prompted secret information thematically in generated stories at rates up to 79% above chance in binary discrimination tests, even when told to hide it, with leakage scaling by model size and vanishing for short-form outputs.
When Are LLM Inferences Acceptable? User Reactions and Control Preferences for Inferred Personal Information cs.HC · 2026-05-11 · unverdicted · none · ref 39
Users show curiosity over concern toward LLM inferences of personal information, with acceptability depending on context, alignment with expectations, and who uses the inferences rather than just the content.
CAMP: Cumulative Agentic Masking and Pruning for Privacy Protection in Multi-Turn LLM Conversations cs.CR · 2026-04-16 · unverdicted · none · ref 20
CAMP formalizes Cumulative PII Exposure and uses a session registry, co-occurrence graph, and CPE score to trigger retroactive masking in multi-turn LLM conversations, neutralizing re-identifiable profiles in synthetic tests while keeping utility intact.
AgentCollabBench: Diagnosing When Good Agents Make Bad Collaborators cs.CL · 2026-05-09 · unverdicted · none · ref 31
AgentCollabBench shows that multi-agent reliability is limited by communication topology, with converging-DAG nodes causing synthesis bottlenecks that discard constraints and explain 7-40% of information loss variance.
How Far Are VLMs from Privacy Awareness in the Physical World? An Empirical Study cs.CR · 2026-05-06 · unverdicted · none · ref 25 · 2 links
Vision-language models exhibit perceptual fragility and fail to consistently respect privacy constraints when operating in simulated physical environments, with performance declining in cluttered scenes and under conflicting commands.
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models cs.CV · 2024-02-27 · unverdicted · none · ref 132
The paper reviews the background, technology, applications, limitations, and future directions of OpenAI's Sora text-to-video generative model based on public information.

Can llms keep a secret? testing privacy implications of language models via contextual integrity theory

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer