A More Powerful Two-Sample Test in High Dimensions using Random Projection
read the original abstract
We consider the hypothesis testing problem of detecting a shift between the means of two multivariate normal distributions in the high-dimensional setting, allowing for the data dimension p to exceed the sample size n. Specifically, we propose a new test statistic for the two-sample test of means that integrates a random projection with the classical Hotelling T^2 statistic. Working under a high-dimensional framework with (p,n) tending to infinity, we first derive an asymptotic power function for our test, and then provide sufficient conditions for it to achieve greater power than other state-of-the-art tests. Using ROC curves generated from synthetic data, we demonstrate superior performance against competing tests in the parameter regimes anticipated by our theoretical results. Lastly, we illustrate an advantage of our procedure's false positive rate with comparisons on high-dimensional gene expression data involving the discrimination of different types of cancer.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Tests for the mean of high-dimensional data
Bootstrap test for high-dimensional mean using squared-norm statistic V_n with asymptotic level-alpha validity via l2 embedding and a new CLT, without sparsity or covariance structure assumptions.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.