Why m heads are better than one: Training a diverse ensemble of deep networks

· 2015 · cs.CV · arXiv 1511.06314

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Convolutional Neural Networks have achieved state-of-the-art performance on a wide range of tasks. Most benchmarks are led by ensembles of these powerful learners, but ensembling is typically treated as a post-hoc procedure implemented by averaging independently trained models with model variation induced by bagging or random initialization. In this paper, we rigorously treat ensembling as a first-class problem to explicitly address the question: what are the best strategies to create an ensemble? We first compare a large number of ensembling strategies, and then propose and evaluate novel strategies, such as parameter sharing (through a new family of models we call TreeNets) as well as training under ensemble-aware and diversity-encouraging losses. We demonstrate that TreeNets can improve ensemble performance and that diverse ensembles can be trained end-to-end under a unified loss, achieving significantly higher "oracle" accuracies than classical ensembles.

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Anatomy of a failure: When, how, and why deep vision fails in scientific domains

cs.CV · 2026-05-05 · unverdicted · novelty 6.0

Deep learning on information-rich scientific images collapses to one-dimensional predictions due to a mismatch between data priors and the model's simplicity bias, even after robustification techniques.

As easy as 1, 2... 4? Uncertainty in counting tasks for medical imaging

eess.IV · 2019-07-25 · unverdicted · novelty 4.0

A multi-task network is introduced to generate narrow predictive intervals for counts in medical images while maintaining target coverage, tested on cell and white matter hyperintensity counting.

citing papers explorer

Showing 2 of 2 citing papers.

Anatomy of a failure: When, how, and why deep vision fails in scientific domains cs.CV · 2026-05-05 · unverdicted · none · ref 115
Deep learning on information-rich scientific images collapses to one-dimensional predictions due to a mismatch between data priors and the model's simplicity bias, even after robustification techniques.
As easy as 1, 2... 4? Uncertainty in counting tasks for medical imaging eess.IV · 2019-07-25 · unverdicted · none · ref 3 · internal anchor
A multi-task network is introduced to generate narrow predictive intervals for counts in medical images while maintaining target coverage, tested on cell and white matter hyperintensity counting.

Why m heads are better than one: Training a diverse ensemble of deep networks

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer