pith. sign in

arxiv: 1806.05159 · v4 · pith:JJXBUXFNnew · submitted 2018-06-13 · 💻 cs.LG · stat.ML

On Tighter Generalization Bound for Deep Neural Networks: CNNs, ResNets, and Beyond

classification 💻 cs.LG stat.ML
keywords networksneuralgeneralizationboundboundsdeepcnnsfamily
0
0 comments X
read the original abstract

We establish a margin based data dependent generalization error bound for a general family of deep neural networks in terms of the depth and width, as well as the Jacobian of the networks. Through introducing a new characterization of the Lipschitz properties of neural network family, we achieve significantly tighter generalization bounds than existing results. Moreover, we show that the generalization bound can be further improved for bounded losses. Aside from the general feedforward deep neural networks, our results can be applied to derive new bounds for popular architectures, including convolutional neural networks (CNNs) and residual networks (ResNets). When achieving same generalization errors with previous arts, our bounds allow for the choice of larger parameter spaces of weight matrices, inducing potentially stronger expressive ability for neural networks. Numerical evaluation is also provided to support our theory.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Overfitting has a limitation: a model-independent generalization gap bound based on R\'enyi entropy

    stat.ML 2025-05 unverdicted novelty 6.0

    A model-independent upper bound on generalization gap is established that depends solely on the Rényi entropy of the data-generating distribution for histogram-determined algorithms such as ERM.