Interpretable global minima of deep ReLU neural networks on sequentially separable data

Patr\'icia Mu\~noz Ewald; Thomas Chen

arxiv: 2405.07098 · v3 · pith:HEOB7ZAMnew · submitted 2024-05-11 · 💻 cs.LG · cs.AI· math-ph· math.MP· math.OC· stat.ML

Interpretable global minima of deep ReLU neural networks on sequentially separable data

Thomas Chen , Patr\'icia Mu\~noz Ewald This is my paper

classification 💻 cs.LG cs.AImath-phmath.MPmath.OCstat.ML

keywords dataclassesglobalneuralparametersseparablesequentiallyacting

0 comments

read the original abstract

We explicitly construct zero loss neural network classifiers. We write the weight matrices and bias vectors in terms of cumulative parameters, which determine truncation maps acting recursively on input space. The configurations for the training data considered are (i) sufficiently small, well separated clusters corresponding to each class, and (ii) equivalence classes which are sequentially linearly separable. In the best case, for $Q$ classes of data in $\mathbb{R}^M$, global minimizers can be described with $Q(M+2)$ parameters.

This paper has not been read by Pith yet.

Interpretable global minima of deep ReLU neural networks on sequentially separable data

discussion (0)