pith. sign in

arxiv: 2605.26222 · v1 · pith:BFKO6DMDnew · submitted 2026-05-25 · 💻 cs.LG · stat.ML

From Privacy to Generalization: Linear Max-Information Bounds for DP-SGD

classification 💻 cs.LG stat.ML
keywords dp-sgdgeneralizationbounddifferentiallylinearmax-informationprivacyprivate
0
0 comments X
read the original abstract

Understanding the relationship between generalization and privacy remains a central challenge in modern machine learning theory, particularly for deep networks trained by variants of differentially private stochastic gradient descent (DP-SGD). In this work we make progress on this persistent open problem by proving a finite-sample bound on the approximate max-information of DP-SGD that exhibits scaling properties comparable with (Dwork et al, 2015)'s classic result for $\epsilon$-differentially private algorithms, namely at most linear in the dataset size. From our result we obtain a general-purpose PAC-Bayes generalization bound in which the necessary prior distribution can be learned by DP-SGD, as well as a generalization bound for DP-SGD-trained models themselves, with a complexity term that is fully explicit and controlled by the optimization hyperparameters.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.