pith. sign in

arxiv: 1603.00988 · v4 · pith:RLSOPAARnew · submitted 2016-03-03 · 💻 cs.LG

Learning Functions: When Is Deep Better Than Shallow

classification 💻 cs.LG
keywords networksdeepshallowclassfunctionshierarchicalaccuracyalgorithms
0
0 comments X
read the original abstract

While the universal approximation property holds both for hierarchical and shallow networks, we prove that deep (hierarchical) networks can approximate the class of compositional functions with the same accuracy as shallow networks but with exponentially lower number of training parameters as well as VC-dimension. This theorem settles an old conjecture by Bengio on the role of depth in networks. We then define a general class of scalable, shift-invariant algorithms to show a simple and natural set of requirements that justify deep convolutional networks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Fixed-order PCA: Theory for Overestimated Factor Models

    math.ST 2026-05 unverdicted novelty 7.0

    Establishes asymptotic consistency of factor estimates and √T-normality in factor-augmented regressions for fixed R ≥ r using anisotropic local laws from random matrix theory.