pith. sign in

arxiv: 1707.08504 · v1 · pith:FQYCQYM2new · submitted 2017-07-26 · 🧬 q-bio.GN · q-bio.QM· q-fin.ST

Mutation Clusters from Cancer Exome

classification 🧬 q-bio.GN q-bio.QMq-fin.ST
keywords cancerexomemutationsamplesstabletypesclusteringout-of-sample
0
0 comments X
read the original abstract

We apply our statistically deterministic machine learning/clustering algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to 10,656 published exome samples for 32 cancer types. A majority of cancer types exhibit mutation clustering structure. Our results are in-sample stable. They are also out-of-sample stable when applied to 1,389 published genome samples across 14 cancer types. In contrast, we find in- and out-of-sample instabilities in cancer signatures extracted from exome samples via nonnegative matrix factorization (NMF), a computationally costly and non-deterministic method. Extracting stable mutation structures from exome data could have important implications for speed and cost, which are critical for early-stage cancer diagnostics such as novel blood-test methods currently in development.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.