Utility-Privacy Tradeoff in Databases: An Information-theoretic Approach

H. Vincent Poor; Lalitha Sankar; S. Raj Rajagopalan

arxiv: 1102.3751 · v4 · pith:5JZBGOPQnew · submitted 2011-02-18 · 💻 cs.IT · math.IT

Utility-Privacy Tradeoff in Databases: An Information-theoretic Approach

Lalitha Sankar , S. Raj Rajagopalan , H. Vincent Poor This is my paper

classification 💻 cs.IT math.IT

keywords dataprivacyanalyticalencodingframeworkinformationinformation-theoreticproblem

0 comments

read the original abstract

Ensuring the usefulness of electronic data sources while providing necessary privacy guarantees is an important unsolved problem. This problem drives the need for an analytical framework that can quantify the safety of personally identifiable information (privacy) while still providing a quantifable benefit (utility) to multiple legitimate information consumers. This paper presents an information-theoretic framework that promises an analytical model guaranteeing tight bounds of how much utility is possible for a given level of privacy and vice-versa. Specific contributions include: i) stochastic data models for both categorical and numerical data; ii) utility-privacy tradeoff regions and the encoding (sanization) schemes achieving them for both classes and their practical relevance; and iii) modeling of prior knowledge at the user and/or data source and optimal encoding schemes for both cases.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Cross-Flow Correlations Survive Synthesis: Measuring Source-Level Privacy Leakage in Synthetic Network Traces
cs.CR 2025-08 conditional novelty 8.0

Synthetic network generators preserve cross-flow correlations enabling source-level membership inference, shown via the TraceBleed attack across five datasets and six generators.