hub

Deep batch active learning by diverse, uncertain gradient lower bounds

Jordan T Ash, Chicheng Zhang, Akshay Krishnamurthy, John Langford, Alekh Agarwal · 1906 · arXiv 1906.03671

13 Pith papers cite this work. Polarity classification is still indexing.

13 Pith papers citing it

read on arXiv browse 13 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 3

citation-polarity summary

background 2 unclear 1

representative citing papers

MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

MASS-DPO derives a Plackett-Luce-specific log-determinant Fisher information objective to select non-redundant negative samples, matching or exceeding multi-negative DPO performance with substantially fewer negatives across four benchmarks and three model families.

Clip-level Uncertainty and Temporal-aware Active Learning for End-to-End Multi-Object Tracking

cs.CV · 2026-05-11 · unverdicted · novelty 7.0

CUTAL scores multi-frame clips for uncertainty and enforces temporal diversity to train transformer MOT models to near full-supervision performance with 50% of the labels.

UnIte: Uncertainty-based Iterative Document Sampling for Domain Adaptation in Information Retrieval

cs.IR · 2026-04-28 · unverdicted · novelty 7.0

UnIte selects target-domain documents for pseudo-query generation by filtering high aleatoric uncertainty and prioritizing high epistemic uncertainty, yielding +2.45 to +3.49 nDCG@10 gains on BEIR with ~4k samples.

Active Learning with Selective Time-Step Acquisition for PDEs

cs.LG · 2025-11-22 · unverdicted · novelty 7.0

STAP reduces training data costs for PDE surrogates by selectively acquiring key time steps per trajectory instead of full simulations.

Towards Multimodal Active Learning: Efficient Learning with Limited Paired Data

cs.LG · 2025-09-25 · unverdicted · novelty 7.0

Introduces the first active learning framework for unaligned multimodal data that selects alignments using uncertainty and diversity to cut annotation costs by up to 40% on benchmarks while preserving accuracy.

Active Statistical Inference

stat.ML · 2024-03-05 · unverdicted · novelty 7.0

Active inference adapts label collection via ML uncertainty to deliver valid statistical inference with substantially fewer samples than standard non-adaptive methods across any data distribution.

Fine-Tuning Language Models from Human Preferences

cs.CL · 2019-09-18 · unverdicted · novelty 7.0

Language models fine-tuned via RL on 5k-60k human preference comparisons produce stylistically better text continuations and human-preferred summaries that sometimes copy input sentences.

Select Smarter, Not More: Prompt-Aware Evaluation Scheduling with Submodular Guarantees

cs.AI · 2026-04-13 · unverdicted · novelty 6.0

POES frames prompt evaluation as online adaptive testing and uses a provably submodular objective to pick informative examples, delivering 6.2% higher average accuracy and 35-60% token savings versus naive full-set scoring.

Are Candidate Models Really Needed for Active Learning?

cs.CV · 2026-05-14 · unverdicted · novelty 5.0

Active learning with randomly initialized models achieves comparable results to traditional candidate-model methods, with low-confidence sampling proving most effective.

Uncertainty-Guided Edge Learning for Deep Image Regression in Remote Sensing

cs.CV · 2026-05-07 · unverdicted · novelty 5.0

UGEL employs deep beta regression to estimate uncertainty in one forward pass, enabling faster convergence in edge learning for remote sensing image regression than active or semi-supervised baselines.

Neural Operator Representation of Granular Micromechanics-based Failure Envelope

physics.comp-ph · 2026-04-21 · unverdicted · novelty 5.0

A differentiable neural operator learns the mapping from granular microstructure configurations to failure envelopes, with physics-informed convexity enforcement and active learning for efficient training.

Labeled TrustSet Guided: Batch Active Learning with Reinforcement Learning

cs.LG · 2026-04-14 · unverdicted · novelty 5.0

BRAL-T uses TrustSet-guided reinforcement learning for batch active learning and reports state-of-the-art results on 10 image classification benchmarks plus 2 fine-tuning tasks.

ShieldGemma: Generative AI Content Moderation Based on Gemma

cs.CL · 2024-07-31 · unverdicted · novelty 4.0

ShieldGemma delivers a family of Gemma2-based classifiers that outperform Llama Guard and WildCard on public safety benchmarks while introducing a synthetic-data curation pipeline for safety tasks.

citing papers explorer

Showing 13 of 13 citing papers.

MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization cs.LG · 2026-05-11 · unverdicted · none · ref 5
MASS-DPO derives a Plackett-Luce-specific log-determinant Fisher information objective to select non-redundant negative samples, matching or exceeding multi-negative DPO performance with substantially fewer negatives across four benchmarks and three model families.
Clip-level Uncertainty and Temporal-aware Active Learning for End-to-End Multi-Object Tracking cs.CV · 2026-05-11 · unverdicted · none · ref 15
CUTAL scores multi-frame clips for uncertainty and enforces temporal diversity to train transformer MOT models to near full-supervision performance with 50% of the labels.
UnIte: Uncertainty-based Iterative Document Sampling for Domain Adaptation in Information Retrieval cs.IR · 2026-04-28 · unverdicted · none · ref 2
UnIte selects target-domain documents for pseudo-query generation by filtering high aleatoric uncertainty and prioritizing high epistemic uncertainty, yielding +2.45 to +3.49 nDCG@10 gains on BEIR with ~4k samples.
Active Learning with Selective Time-Step Acquisition for PDEs cs.LG · 2025-11-22 · unverdicted · none · ref 1
STAP reduces training data costs for PDE surrogates by selectively acquiring key time steps per trajectory instead of full simulations.
Towards Multimodal Active Learning: Efficient Learning with Limited Paired Data cs.LG · 2025-09-25 · unverdicted · none · ref 1
Introduces the first active learning framework for unaligned multimodal data that selects alignments using uncertainty and diversity to cut annotation costs by up to 40% on benchmarks while preserving accuracy.
Active Statistical Inference stat.ML · 2024-03-05 · unverdicted · none · ref 4
Active inference adapts label collection via ML uncertainty to deliver valid statistical inference with substantially fewer samples than standard non-adaptive methods across any data distribution.
Fine-Tuning Language Models from Human Preferences cs.CL · 2019-09-18 · unverdicted · none · ref 1
Language models fine-tuned via RL on 5k-60k human preference comparisons produce stylistically better text continuations and human-preferred summaries that sometimes copy input sentences.
Select Smarter, Not More: Prompt-Aware Evaluation Scheduling with Submodular Guarantees cs.AI · 2026-04-13 · unverdicted · none · ref 37
POES frames prompt evaluation as online adaptive testing and uses a provably submodular objective to pick informative examples, delivering 6.2% higher average accuracy and 35-60% token savings versus naive full-set scoring.
Are Candidate Models Really Needed for Active Learning? cs.CV · 2026-05-14 · unverdicted · none · ref 151
Active learning with randomly initialized models achieves comparable results to traditional candidate-model methods, with low-confidence sampling proving most effective.
Uncertainty-Guided Edge Learning for Deep Image Regression in Remote Sensing cs.CV · 2026-05-07 · unverdicted · none · ref 4
UGEL employs deep beta regression to estimate uncertainty in one forward pass, enabling faster convergence in edge learning for remote sensing image regression than active or semi-supervised baselines.
Neural Operator Representation of Granular Micromechanics-based Failure Envelope physics.comp-ph · 2026-04-21 · unverdicted · none · ref 1
A differentiable neural operator learns the mapping from granular microstructure configurations to failure envelopes, with physics-informed convexity enforcement and active learning for efficient training.
Labeled TrustSet Guided: Batch Active Learning with Reinforcement Learning cs.LG · 2026-04-14 · unverdicted · none · ref 1
BRAL-T uses TrustSet-guided reinforcement learning for batch active learning and reports state-of-the-art results on 10 image classification benchmarks plus 2 fine-tuning tasks.
ShieldGemma: Generative AI Content Moderation Based on Gemma cs.CL · 2024-07-31 · unverdicted · none · ref 2
ShieldGemma delivers a family of Gemma2-based classifiers that outperform Llama Guard and WildCard on public safety benchmarks while introducing a synthetic-data curation pipeline for safety tasks.

Deep batch active learning by diverse, uncertain gradient lower bounds

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer