pith. sign in

arxiv: 1711.00941 · v1 · pith:XB3EZZ2Rnew · submitted 2017-11-02 · 💻 cs.LG

Deep Active Learning over the Long Tail

classification 💻 cs.LG
keywords learningsamplingactivealgorithmdeepneuralrandomuncertainty
0
0 comments X
read the original abstract

This paper is concerned with pool-based active learning for deep neural networks. Motivated by coreset dataset compression ideas, we present a novel active learning algorithm that queries consecutive points from the pool using farthest-first traversals in the space of neural activation over a representation layer. We show consistent and overwhelming improvement in sample complexity over passive learning (random sampling) for three datasets: MNIST, CIFAR-10, and CIFAR-100. In addition, our algorithm outperforms the traditional uncertainty sampling technique (obtained using softmax activations), and we identify cases where uncertainty sampling is only slightly better than random sampling.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. TinyUSFM: Towards Compact and Efficient Ultrasound Foundation Models

    eess.IV 2025-10 unverdicted novelty 6.0

    TinyUSFM distills a large ultrasound foundation model into a lightweight version using feature-gradient coreset selection and domain-separated masked image modeling, matching performance on a new 18-dataset benchmark ...

  2. Discriminative Active Learning

    cs.LG 2019-07 unverdicted novelty 6.0

    DAL poses batch active learning as a binary classification task between labeled and unlabeled data to select informative examples for labeling.

  3. Are Candidate Models Really Needed for Active Learning?

    cs.CV 2026-05 unverdicted novelty 5.0

    Active learning with randomly initialized models achieves comparable results to traditional candidate-model methods, with low-confidence sampling proving most effective.