pith. sign in

arxiv: 1712.00336 · v1 · pith:RY7XGB4Rnew · submitted 2017-12-01 · 🧬 q-bio.QM

BioMM: Biologically-informed Multi-stage Machine learning for identification of epigenetic fingerprints

classification 🧬 q-bio.QM
keywords biommdatalearningmachinebiologicalhigh-dimensionalapproachesframework
0
0 comments X
read the original abstract

The identification of reproducible biological patterns from high-dimensional data is a bottleneck for understanding the biology of complex illnesses such as schizophrenia. To address this, we developed a biologically informed, multi-stage machine learning (BioMM) framework. BioMM incorporates biological pathway information to stratify and aggregate high-dimensional biological data. We demonstrate the utility of this method using genome-wide DNA methylation data and show that it substantially outperforms conventional machine learning approaches. Therefore, the BioMM framework may be a fruitful machine learning strategy in high-dimensional data and be the basis for future, integrative analysis approaches.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.