pith. sign in

arxiv: 1906.07975 · v1 · pith:QTRMC44Knew · submitted 2019-06-19 · 💻 cs.LG · stat.ML

Batch Active Learning Using Determinantal Point Processes

classification 💻 cs.LG stat.ML
keywords learningactivebatchdatasamplescomputationalmethodspoint
0
0 comments X
read the original abstract

Data collection and labeling is one of the main challenges in employing machine learning algorithms in a variety of real-world applications with limited data. While active learning methods attempt to tackle this issue by labeling only the data samples that give high information, they generally suffer from large computational costs and are impractical in settings where data can be collected in parallel. Batch active learning methods attempt to overcome this computational burden by querying batches of samples at a time. To avoid redundancy between samples, previous works rely on some ad hoc combination of sample quality and diversity. In this paper, we present a new principled batch active learning method using Determinantal Point Processes, a repulsive point process that enables generating diverse batches of samples. We develop tractable algorithms to approximate the mode of a DPP distribution, and provide theoretical guarantees on the degree of approximation. We further demonstrate that an iterative greedy method for DPP maximization, which has lower computational costs but worse theoretical guarantees, still gives competitive results for batch active learning. Our experiments show the value of our methods on several datasets against state-of-the-art baselines.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Active learning for photonic crystals

    physics.optics 2026-01 unverdicted novelty 5.0

    Analytic LL-BNN active learning achieves up to 2.7x reduction in training data for band gap prediction in 2D two-tone photonic crystals while maintaining accuracy.