pith. sign in

arxiv: 1506.01286 · v1 · pith:HWQ5NHOLnew · submitted 2015-06-03 · 📊 stat.ML · q-bio.GN

PeakSegJoint: fast supervised peak detection via joint segmentation of multiple count data samples

classification 📊 stat.ML q-bio.GN
keywords segmentationdetectionmodelpeakpeaksegjointproposesamplesalgorithms
0
0 comments X
read the original abstract

Joint peak detection is a central problem when comparing samples in genomic data analysis, but current algorithms for this task are unsupervised and limited to at most 2 sample types. We propose PeakSegJoint, a new constrained maximum likelihood segmentation model for any number of sample types. To select the number of peaks in the segmentation, we propose a supervised penalty learning model. To infer the parameters of these two models, we propose to use a discrete optimization heuristic for the segmentation, and convex optimization for the penalty learning. In comparisons with state-of-the-art peak detection algorithms, PeakSegJoint achieves similar accuracy, faster speeds, and a more interpretable model with overlapping peaks that occur in exactly the same positions across all samples.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.