Privacy and Statistical Risk: Formalisms and Minimax Bounds

John C. Duchi; Rina Foygel Barber

arxiv: 1412.4451 · v1 · pith:EKLOMYMMnew · submitted 2014-12-15 · 🧮 math.ST · cs.IT· math.IT· stat.TH

Privacy and Statistical Risk: Formalisms and Minimax Bounds

Rina Foygel Barber , John C. Duchi This is my paper

classification 🧮 math.ST cs.ITmath.ITstat.TH

keywords privacyestimationdefinitionsboundsdifferentdisclosureriskstatistical

0 comments

read the original abstract

We explore and compare a variety of definitions for privacy and disclosure limitation in statistical estimation and data analysis, including (approximate) differential privacy, testing-based definitions of privacy, and posterior guarantees on disclosure risk. We give equivalence results between the definitions, shedding light on the relationships between different formalisms for privacy. We also take an inferential perspective, where---building off of these definitions---we provide minimax risk bounds for several estimation problems, including mean estimation, estimation of the support of a distribution, and nonparametric density estimation. These bounds highlight the statistical consequences of different definitions of privacy and provide a second lens for evaluating the advantages and disadvantages of different techniques for disclosure limitation.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

General Lower Bounds for Differentially Private Federated Learning with Arbitrary Public-Transcript Interactions
cs.LG 2026-05 unverdicted novelty 8.0

Derives a federated van Trees lower bound under total clientwise sample-level zCDP for parameter estimation with squared l2 loss in federated learning protocols with arbitrary public-transcript interactions.
Optimal Rates for Pure $\varepsilon$-Differentially Private Stochastic Convex Optimization with Heavy Tails
cs.LG 2026-04 unverdicted novelty 8.0

The minimax optimal excess-risk rate for pure ε-DP heavy-tailed SCO is characterized up to logarithmic factors, with a polynomial-time algorithm based on Lipschitz extensions of the empirical loss and a nearly matchin...
Robust Statistical Estimators with Bounded Empirical Sensitivity
math.ST 2026-05 conditional novelty 7.0

Defines empirical sensitivity and proves Ω(η + √(η d/n)) lower bound (tight up to logs) for any Gaussian mean estimator achieving optimal O(√(d/n)) ℓ₂ error.
High-Dimensional Private Linear Regression with Optimal Rates
stat.ML 2025-05 accept novelty 7.0

DP-GD achieves minimax optimal non-asymptotic risk O(γ + γ²/ρ²) for well-conditioned high-dimensional data and power-law scaling for ill-conditioned power-law spectra, with the exponent depending on the privacy parameter ρ.