Generalization Error Bounds with Probabilistic Guarantee for SGD in Nonconvex Optimization

Huishuai Zhang; Yingbin Liang; Yi Zhou

arxiv: 1802.06903 · v3 · pith:XOIJSSPFnew · submitted 2018-02-19 · 📊 stat.ML · cs.LG· math.OC

Generalization Error Bounds with Probabilistic Guarantee for SGD in Nonconvex Optimization

Yi Zhou , Yingbin Liang , Huishuai Zhang This is my paper

classification 📊 stat.ML cs.LGmath.OC

keywords generalizationerrorboundsfunctionslossnonconvexguaranteestability

0 comments

read the original abstract

The success of deep learning has led to a rising interest in the generalization property of the stochastic gradient descent (SGD) method, and stability is one popular approach to study it. Existing works based on stability have studied nonconvex loss functions, but only considered the generalization error of the SGD in expectation. In this paper, we establish various generalization error bounds with probabilistic guarantee for the SGD. Specifically, for both general nonconvex loss functions and gradient dominant loss functions, we characterize the on-average stability of the iterates generated by SGD in terms of the on-average variance of the stochastic gradients. Such characterization leads to improved bounds for the generalization error for SGD. We then study the regularized risk minimization problem with strongly convex regularizers, and obtain improved generalization error bounds for proximal SGD. With strongly convex regularizers, we further establish the generalization error bounds for nonconvex loss functions under proximal SGD with high-probability guarantee, i.e., exponential concentration in probability.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
cs.LG 2024-01 unverdicted novelty 6.0

SPIN lets weak LLMs become strong by self-generating training data from previous model versions and training to prefer human-annotated responses over its own outputs, outperforming DPO even with extra GPT-4 data on be...