From the Information Bottleneck to the Privacy Funnel

Ali Makhdoumi; Muriel Medard; Nadia Fawaz; Salman Salamatian

arxiv: 1402.1774 · v5 · pith:ADZGQMABnew · submitted 2014-02-07 · 💻 cs.IT · math.IT

From the Information Bottleneck to the Privacy Funnel

Ali Makhdoumi , Salman Salamatian , Nadia Fawaz , Muriel Medard This is my paper

classification 💻 cs.IT math.IT

keywords dataprivacyinformationdisclosedunderlog-lossmetricprivate

0 comments

read the original abstract

We focus on the privacy-utility trade-off encountered by users who wish to disclose some information to an analyst, that is correlated with their private data, in the hope of receiving some utility. We rely on a general privacy statistical inference framework, under which data is transformed before it is disclosed, according to a probabilistic privacy mapping. We show that when the log-loss is introduced in this framework in both the privacy metric and the distortion metric, the privacy leakage and the utility constraint can be reduced to the mutual information between private data and disclosed data, and between non-private data and disclosed data respectively. We justify the relevance and generality of the privacy metric under the log-loss by proving that the inference threat under any bounded cost function can be upper-bounded by an explicit function of the mutual information between private data and disclosed data. We then show that the privacy-utility tradeoff under the log-loss can be cast as the non-convex Privacy Funnel optimization, and we leverage its connection to the Information Bottleneck, to provide a greedy algorithm that is locally optimal. We evaluate its performance on the US census dataset.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

On What We Can Learn from Low-Resolution Data
cs.LG 2026-05 unverdicted novelty 6.0

Low-resolution data improves high-resolution model performance when high-resolution samples are limited, via KL-divergence bounds and experiments on vision transformers and CNNs.