pith. machine review for the scientific record. sign in

arxiv: 1306.3171 · v2 · submitted 2013-06-13 · 📊 stat.ME · cs.IT· cs.LG· math.IT

Recognition: unknown

Confidence Intervals and Hypothesis Testing for High-Dimensional Regression

Authors on Pith no claims yet
classification 📊 stat.ME cs.ITcs.LGmath.IT
keywords confidenceintervalsparameterhigh-dimensionalcertainconstructingdatahypothesis
0
0 comments X
read the original abstract

Fitting high-dimensional statistical models often requires the use of non-linear parameter estimation procedures. As a consequence, it is generally impossible to obtain an exact characterization of the probability distribution of the parameter estimates. This in turn implies that it is extremely challenging to quantify the \emph{uncertainty} associated with a certain parameter estimate. Concretely, no commonly accepted procedure exists for computing classical measures of uncertainty and statistical significance as confidence intervals or $p$-values for these models. We consider here high-dimensional linear regression problem, and propose an efficient algorithm for constructing confidence intervals and $p$-values. The resulting confidence intervals have nearly optimal size. When testing for the null hypothesis that a certain parameter is vanishing, our method has nearly optimal power. Our approach is based on constructing a `de-biased' version of regularized M-estimators. The new construction improves over recent work in the field in that it does not assume a special structure on the design matrix. We test our method on synthetic data and a high-throughput genomic data set about riboflavin production rate.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Nonparametric f-Modeling for Empirical Bayes Inference with Unequal and Unknown Variances

    stat.ME 2026-04 unverdicted novelty 7.0

    A generalized Tweedie identity and moment-generating-function representation enable nonparametric recovery of full posteriors for heteroscedastic normal means with unknown variances without specifying a prior.