pith. sign in

arxiv: 1810.03372 · v1 · pith:KXQ2F5Z7new · submitted 2018-10-08 · 💻 cs.LG · stat.ML

Detecting Memorization in ReLU Networks

classification 💻 cs.LG stat.ML
keywords memorizationnon-linearityactivationapplyingfindmatrixnetworksnon-negative
0
0 comments X
read the original abstract

We propose a new notion of `non-linearity' of a network layer with respect to an input batch that is based on its proximity to a linear system, which is reflected in the non-negative rank of the activation matrix. We measure this non-linearity by applying non-negative factorization to the activation matrix. Considering batches of similar samples, we find that high non-linearity in deep layers is indicative of memorization. Furthermore, by applying our approach layer-by-layer, we find that the mechanism for memorization consists of distinct phases. We perform experiments on fully-connected and convolutional neural networks trained on several image and audio datasets. Our results demonstrate that as an indicator for memorization, our technique can be used to perform early stopping.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.