From Privacy to Generalization: Linear Max-Information Bounds for DP-SGD

Christoph H. Lampert; Hossein Zakerinia

arxiv: 2605.26222 · v1 · pith:BFKO6DMDnew · submitted 2026-05-25 · 💻 cs.LG · stat.ML

From Privacy to Generalization: Linear Max-Information Bounds for DP-SGD

Christoph H. Lampert , Hossein Zakerinia This is my paper

classification 💻 cs.LG stat.ML

keywords dp-sgdgeneralizationbounddifferentiallylinearmax-informationprivacyprivate

0 comments

read the original abstract

Understanding the relationship between generalization and privacy remains a central challenge in modern machine learning theory, particularly for deep networks trained by variants of differentially private stochastic gradient descent (DP-SGD). In this work we make progress on this persistent open problem by proving a finite-sample bound on the approximate max-information of DP-SGD that exhibits scaling properties comparable with (Dwork et al, 2015)'s classic result for $\epsilon$-differentially private algorithms, namely at most linear in the dataset size. From our result we obtain a general-purpose PAC-Bayes generalization bound in which the necessary prior distribution can be learned by DP-SGD, as well as a generalization bound for DP-SGD-trained models themselves, with a complexity term that is fully explicit and controlled by the optimization hyperparameters.

This paper has not been read by Pith yet.

From Privacy to Generalization: Linear Max-Information Bounds for DP-SGD

discussion (0)