Reducing Overfitting in Deep Networks by Decorrelating Representations

Dhruv Batra; Faruk Ahmed; Larry Zitnick; Michael Cogswell; Ross Girshick

arxiv: 1511.06068 · v4 · pith:KWGEBX3Tnew · submitted 2015-11-19 · 💻 cs.LG · stat.ML

Reducing Overfitting in Deep Networks by Decorrelating Representations

Michael Cogswell , Faruk Ahmed , Ross Girshick , Larry Zitnick , Dhruv Batra This is my paper

classification 💻 cs.LG stat.ML

keywords overfittingbeendeepnetworksperformanceregularizeralwaysdata

0 comments

read the original abstract

One major challenge in training Deep Neural Networks is preventing overfitting. Many techniques such as data augmentation and novel regularizers such as Dropout have been proposed to prevent overfitting without requiring a massive amount of training data. In this work, we propose a new regularizer called DeCov which leads to significantly reduced overfitting (as indicated by the difference between train and val performance), and better generalization. Our regularizer encourages diverse or non-redundant representations in Deep Neural Networks by minimizing the cross-covariance of hidden activations. This simple intuition has been explored in a number of past works but surprisingly has never been applied as a regularizer in supervised learning. Experiments across a range of datasets and network architectures show that this loss always reduces overfitting while almost always maintaining or increasing generalization performance and often improving performance over Dropout.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Discovering Data Encoding Strategies for Quantum-Classical Neural Networks Using Monte Carlo Tree Search
quant-ph 2026-05 conditional novelty 7.0

MCTS discovers superior data encoding circuits for QCCNNs that outperform standard encodings on medical datasets, with effective rank of feature maps serving as a performance predictor.
Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities
cs.LG 2026-05 conditional novelty 4.0

Model collapse threatens AI democratization by disproportionately degrading data and efficiency for low-resource communities.