Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.
Hugging Face dataset
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 4verdicts
UNVERDICTED 4roles
background 1polarities
background 1representative citing papers
ADAM extracts data from LLM agent memory with up to 100% attack success rate by estimating data distribution and selecting queries via entropy guidance.
CanaryRAG detects RAG extraction attacks in real time by embedding canary tokens and checking dual target-oracle paths for integrity violations, achieving lower chunk recovery rates with negligible overhead.
ALDEN boosts private data extraction rates from RAG systems by combining active learning for query diversification with dynamic estimation of the underlying knowledge-base topic distribution.
citing papers explorer
-
A Case Study on the Impact of Anonymization Along the RAG Pipeline
Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.
-
ADAM: A Systematic Data Extraction Attack on Agent Memory via Adaptive Querying
ADAM extracts data from LLM agent memory with up to 100% attack success rate by estimating data distribution and selecting queries via entropy guidance.
-
Detecting RAG Extraction Attack via Dual-Path Runtime Integrity Game
CanaryRAG detects RAG extraction attacks in real time by embedding canary tokens and checking dual target-oracle paths for integrity violations, achieving lower chunk recovery rates with negligible overhead.
-
ALDEN: Boosting Private Data Extraction from Retrieval-Augmented Generation Systems via Active Learning and Distribution Estimation
ALDEN boosts private data extraction rates from RAG systems by combining active learning for query diversification with dynamic estimation of the underlying knowledge-base topic distribution.