pith. sign in

arxiv: 1603.09260 · v2 · pith:NHLFEFMNnew · submitted 2016-03-30 · 💻 cs.LG · stat.ML

Degrees of Freedom in Deep Neural Networks

classification 💻 cs.LG stat.ML
keywords degreesfreedomnetworksdeepneuralerrorexpectednetwork
0
0 comments X p. Extension
pith:NHLFEFMN Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{NHLFEFMN}

Prints a linked pith:NHLFEFMN badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

In this paper, we explore degrees of freedom in deep sigmoidal neural networks. We show that the degrees of freedom in these models is related to the expected optimism, which is the expected difference between test error and training error. We provide an efficient Monte-Carlo method to estimate the degrees of freedom for multi-class classification methods. We show degrees of freedom are lower than the parameter count in a simple XOR network. We extend these results to neural nets trained on synthetic and real data, and investigate impact of network's architecture and different regularization choices. The degrees of freedom in deep networks are dramatically smaller than the number of parameters, in some real datasets several orders of magnitude. Further, we observe that for fixed number of parameters, deeper networks have less degrees of freedom exhibiting a regularization-by-depth.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Geometric Analysis of Neural Regression Collapse via Intrinsic Dimension

    cs.LG 2025-10 unverdicted novelty 5.0

    Neural regression collapse occurs when last-layer feature intrinsic dimension falls below target intrinsic dimension, creating over-compressed and under-compressed regimes that govern generalization based on data quan...