Learning from Noisy Labels with Distillation

Jianchao Yang; Jiebo Luo; Liangliang Cao; Li-Jia Li; Yale Song; Yuncheng Li

arxiv: 1703.02391 · v2 · pith:AGVT2FQZnew · submitted 2017-03-07 · 💻 cs.CV · cs.LG· stat.ML

Learning from Noisy Labels with Distillation

Yuncheng Li , Jianchao Yang , Yale Song , Liangliang Cao , Jiebo Luo , Li-Jia Li This is my paper

classification 💻 cs.CV cs.LGstat.ML

keywords labelsnoisylearninglabelapproachesbeendistillationdomains

0 comments

read the original abstract

The ability of learning from noisy labels is very useful in many visual recognition tasks, as a vast amount of data with noisy labels are relatively easy to obtain. Traditionally, the label noises have been treated as statistical outliers, and approaches such as importance re-weighting and bootstrap have been proposed to alleviate the problem. According to our observation, the real-world noisy labels exhibit multi-mode characteristics as the true labels, rather than behaving like independent random outliers. In this work, we propose a unified distillation framework to use side information, including a small clean dataset and label relations in knowledge graph, to "hedge the risk" of learning from noisy labels. Furthermore, unlike the traditional approaches evaluated based on simulated label noises, we propose a suite of new benchmark datasets, in Sports, Species and Artifacts domains, to evaluate the task of learning from noisy labels in the practical setting. The empirical study demonstrates the effectiveness of our proposed method in all the domains.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Self-Distillation is Optimal Among Spectral Shrinkage Estimators in Spiked Covariance Models
math.ST 2026-05 unverdicted novelty 7.0

s-step self-distillation is optimal among spectral shrinkage estimators for s-spiked covariance matrices and necessary for optimality.