pith. sign in

arxiv: 1701.07011 · v2 · pith:CYKOVNPOnew · submitted 2017-01-24 · 📊 stat.ME · q-bio.MN· q-bio.QM· stat.AP

Controlling false discoveries in Bayesian gene networks with lasso regression p-values

classification 📊 stat.ME q-bio.MNq-bio.QMstat.AP
keywords bayesianlassopvnetworkslassonetworkregressionexistingfalse
0
0 comments X
read the original abstract

Bayesian networks can represent directed gene regulations and therefore are favored over co-expression networks. However, hardly any Bayesian network study concerns the false discovery control (FDC) of network edges, leading to low accuracies due to systematic biases from inconsistent false discovery levels in the same study. We design four empirical tests to examine the FDC of Bayesian networks from three p-value based lasso regression variable selections --- two existing and one we originate. Our method, lassopv, computes p-values for the critical regularization strength at which a predictor starts to contribute to lasso regression. Using null and Geuvadis datasets, we find that lassopv obtains optimal FDC in Bayesian gene networks, whilst existing methods have defective p-values. The FDC concept and tests extend to most network inference scenarios and will guide the design and improvement of new and existing methods. Our novel variable selection method with lasso regression also allows FDC on other datasets and questions, even beyond network inference and computational biology. Lassopv is implemented in R and freely available at https://github.com/lingfeiwang/lassopv and https://cran.r-project.org/package=lassopv

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.