AdaPT: An interactive procedure for multiple testing with side information

Lihua Lei; William Fithian

arxiv: 1609.06035 · v4 · pith:YKFMWBEJnew · submitted 2016-09-20 · 📊 stat.ME

AdaPT: An interactive procedure for multiple testing with side information

Lihua Lei , William Fithian This is my paper

classification 📊 stat.ME

keywords thresholdinformationprocedureadapthypothesismultiplep-valuetesting

0 comments

read the original abstract

We consider the problem of multiple hypothesis testing with generic side information: for each hypothesis $H_i$ we observe both a p-value $p_i$ and some predictor $x_i$ encoding contextual information about the hypothesis. For large-scale problems, adaptively focusing power on the more promising hypotheses (those more likely to yield discoveries) can lead to much more powerful multiple testing procedures. We propose a general iterative framework for this problem, called the Adaptive p-value Thresholding (AdaPT) procedure, which adaptively estimates a Bayes-optimal p-value rejection threshold and controls the false discovery rate (FDR) in finite samples. At each iteration of the procedure, the analyst proposes a rejection threshold and observes partially censored p-values, estimates the false discovery proportion (FDP) below the threshold, and either stops to reject or proposes another threshold, until the estimated FDP is below $\alpha$. Our procedure is adaptive in an unusually strong sense, permitting the analyst to use any statistical or machine learning method she chooses to estimate the optimal threshold, and to switch between different models at each iteration as information accrues. We demonstrate the favorable performance of AdaPT by comparing it to state-of-the-art methods in five real applications and two simulation studies.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Controlling False Discovery in Arbitrarily Structured Hypothesis Spaces via Reproducing Kernels
stat.ME 2026-05 unverdicted novelty 5.0

A kernel-based regularized learning framework for FDR control that unifies arbitrary structures and supplies provably valid decision rules with likelihood-based tuning.