pith. sign in

arxiv: 2305.09903 · v1 · pith:6ASAKQZPnew · submitted 2023-05-17 · 💻 cs.LG · cs.CR· cs.IT· math.IT· math.OC

Privacy Loss of Noisy Stochastic Gradient Descent Might Converge Even for Non-Convex Losses

classification 💻 cs.LG cs.CRcs.ITmath.ITmath.OC
keywords lossprivacydp-sgdalgorithmanalyzeboundsconvexityfindings
0
0 comments X
read the original abstract

The Noisy-SGD algorithm is widely used for privately training machine learning models. Traditional privacy analyses of this algorithm assume that the internal state is publicly revealed, resulting in privacy loss bounds that increase indefinitely with the number of iterations. However, recent findings have shown that if the internal state remains hidden, then the privacy loss might remain bounded. Nevertheless, this remarkable result heavily relies on the assumption of (strong) convexity of the loss function. It remains an important open problem to further relax this condition while proving similar convergent upper bounds on the privacy loss. In this work, we address this problem for DP-SGD, a popular variant of Noisy-SGD that incorporates gradient clipping to limit the impact of individual samples on the training process. Our findings demonstrate that the privacy loss of projected DP-SGD converges exponentially fast, without requiring convexity or smoothness assumptions on the loss function. In addition, we analyze the privacy loss of regularized (unprojected) DP-SGD. To obtain these results, we directly analyze the hockey-stick divergence between coupled stochastic processes by relying on non-linear data processing inequalities.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Privacy Amplification in Differentially Private Zeroth-Order Optimization with Hidden States

    cs.LG 2025-05 unverdicted novelty 8.0

    Introduces hybrid noise and novel coupling analysis to achieve the first convergent hidden-state DP bound for zeroth-order optimization.

  2. Local and Global Contraction Principles for MCMC Mixing

    cs.IT 2026-06 unverdicted novelty 7.0

    Introduces global and local contraction coefficients under E_γ-divergence to derive explicit mixing-time bounds for projected Langevin Monte Carlo and independent Metropolis-Hastings, including heavy-tailed cases.