pith. machine review for the scientific record. sign in

arxiv: 1012.4397 · v1 · submitted 2010-12-20 · 📊 stat.ME

Recognition: unknown

Control of the False Discovery Rate Under Arbitrary Covariance Dependence

Authors on Pith no claims yet
classification 📊 stat.ME
keywords dependenceapplicationsarbitrarydiscoveryfalseapproachcommoncontrol
0
0 comments X
read the original abstract

Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of tests are performed simultaneously to find if any genes are associated with some traits and those tests are correlated. When test statistics are correlated, false discovery control becomes very challenging under arbitrary dependence. In the current paper, we propose a new methodology based on principal factor approximation, which successfully substracts the common dependence and weakens significantly the correlation structure, to deal with an arbitrary dependence structure. We derive the theoretical distribution for false discovery proportion (FDP) in large scale multiple testing when a common threshold is used and provide a consistent FDP. This result has important applications in controlling FDR and FDP. Our estimate of FDP compares favorably with Efron (2007)'s approach, as demonstrated by in the simulated examples. Our approach is further illustrated by some real data applications.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Nonparametric f-Modeling for Empirical Bayes Inference with Unequal and Unknown Variances

    stat.ME 2026-04 unverdicted novelty 7.0

    A generalized Tweedie identity and moment-generating-function representation enable nonparametric recovery of full posteriors for heteroscedastic normal means with unknown variances without specifying a prior.