Realistic Evaluation of Deep Semi-Supervised Learning Algorithms

Augustus Odena; Avital Oliver; Colin Raffel; Ekin D. Cubuk; Ian J. Goodfellow

arxiv: 1804.09170 · v4 · pith:WDDCOUI7new · submitted 2018-04-24 · 💻 cs.LG · stat.ML

Realistic Evaluation of Deep Semi-Supervised Learning Algorithms

Avital Oliver , Augustus Odena , Colin Raffel , Ekin D. Cubuk , Ian J. Goodfellow This is my paper

classification 💻 cs.LG stat.ML

keywords unlabeledalgorithmsdataaddressdeepevaluationissueslearning

0 comments

read the original abstract

Semi-supervised learning (SSL) provides a powerful framework for leveraging unlabeled data when labels are limited or expensive to obtain. SSL algorithms based on deep neural networks have recently proven successful on standard benchmark tasks. However, we argue that these benchmarks fail to address many issues that these algorithms would face in real-world applications. After creating a unified reimplementation of various widely-used SSL techniques, we test them in a suite of experiments designed to address these issues. We find that the performance of simple baselines which do not use unlabeled data is often underreported, that SSL methods differ in sensitivity to the amount of labeled and unlabeled data, and that performance can degrade substantially when the unlabeled dataset contains out-of-class examples. To help guide SSL research towards real-world applicability, we make our unified reimplemention and evaluation platform publicly available.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

TabTransformer: Tabular Data Modeling Using Contextual Embeddings
cs.LG 2020-12 unverdicted novelty 6.0

TabTransformer uses Transformer self-attention to generate contextual embeddings from categorical features in tabular data, outperforming prior deep learning methods by at least 1% mean AUC and matching tree-based ens...