On the number of response regions of deep feed forward networks with piece-wise linear activations

· 2013 · cs.LG · arXiv 1312.6098

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open full Pith review browse 7 citing papers arXiv PDF

abstract

This paper explores the complexity of deep feedforward networks with linear pre-synaptic couplings and rectified linear activations. This is a contribution to the growing body of work contrasting the representational power of deep and shallow network architectures. In particular, we offer a framework for comparing deep and shallow models that belong to the family of piecewise linear functions based on computational geometry. We look at a deep rectifier multi-layer perceptron (MLP) with linear outputs units and compare it with a single layer version of the model. In the asymptotic regime, when the number of inputs stays constant, if the shallow model has $kn$ hidden units and $n_0$ inputs, then the number of linear regions is $O(k^{n_0}n^{n_0})$. For a $k$ layer model with $n$ hidden units on each layer it is $\Omega(\left\lfloor {n}/{n_0}\right\rfloor^{k-1}n^{n_0})$. The number $\left\lfloor{n}/{n_0}\right\rfloor^{k-1}$ grows faster than $k^{n_0}$ when $n$ tends to infinity or when $k$ tends to infinity and $n \geq 2n_0$. Additionally, even when $k$ is small, if we restrict $n$ to be $2n_0$, we can show that a deep model has considerably more linear regions that a shallow one. We consider this as a first step towards understanding the complexity of these models and specifically towards providing suitable mathematical tools for future analysis.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Characterizing the Discrete Geometry of ReLU Networks

cs.LG · 2026-06-05 · unverdicted · novelty 7.0

Proves that the connectivity graph of linear regions in fully-connected ReLU networks has average degree ≤ 2×input dimension and diameter bounded independently of input dimension.

Zero-Shot Neural Network Evaluation with Sample-Wise Activation Patterns

cs.LG · 2026-05-08 · unverdicted · novelty 7.0

SWAP-Score evaluates neural networks without training by quantifying sample-wise activation patterns, achieving high correlation with true performance on CIFAR-10 for CNNs and GLUE for Transformers while enabling fast NAS.

HyParLyVe: Hyperplane Partitioning for Neural Lyapunov Verification

eess.SY · 2026-05-05 · unverdicted · novelty 7.0

HyParLyVe verifies neural Lyapunov candidates soundly and completely by modeling shallow ReLU networks as hyperplane arrangements, enabling finite vertex evaluations for positive definiteness and bounded optimization for the decrease condition.

Sparsity is Combinatorial Depth: Quantifying MoE Expressivity via Tropical Geometry

cs.LG · 2026-02-03 · unverdicted · novelty 6.0

MoE Top-k routing equals the k-th elementary symmetric tropical polynomial, making sparsity combinatorial depth that scales capacity by binom(N,k) and gives MoE combinatorial resilience on manifolds.

A Lower Bound for the Number of Linear Regions of Ternary ReLU Regression Neural Networks

cs.LG · 2025-07-21 · unverdicted · novelty 6.0

Proves polynomial-in-width and exponential-in-depth lower bounds on linear regions for ternary ReLU regression networks, with width-doubling constructions achieving bounds comparable to unrestricted ReLU networks.

Complexity of Linear Regions in Self-supervised Deep ReLU Networks

cs.LG · 2026-04-27 · unverdicted · novelty 6.0

Self-supervised ReLU networks form substantially fewer linear regions than supervised models for comparable accuracy, with contrastive methods rapidly expanding regions and self-distillation consolidating them, enabling early geometric detection of representation collapse.

Axiomatizing Neural Networks via Pursuit of Subspaces

cs.LG · 2026-05-19 · unverdicted · novelty 5.0

Authors introduce the Pursuit of Subspaces (PoS) hypothesis, an axiomatic geometric framework that unifies explanations for representation, computation, and generalization in shallow and deep neural networks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

A Lower Bound for the Number of Linear Regions of Ternary ReLU Regression Neural Networks cs.LG · 2025-07-21 · unverdicted · none · ref 10 · internal anchor
Proves polynomial-in-width and exponential-in-depth lower bounds on linear regions for ternary ReLU regression networks, with width-doubling constructions achieving bounds comparable to unrestricted ReLU networks.

On the number of response regions of deep feed forward networks with piece-wise linear activations

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer