pith. sign in

arxiv: 1703.03352 · v1 · pith:36WZOSBMnew · submitted 2017-03-09 · 📊 stat.CO · q-bio.GN· stat.ML

A log-linear time algorithm for constrained changepoint detection

classification 📊 stat.CO q-bio.GNstat.ML
keywords datadetectionaccuracyalgorithmchangepointtimeconstrainedconstraints
0
0 comments X
read the original abstract

Changepoint detection is a central problem in time series and genomic data. For some applications, it is natural to impose constraints on the directions of changes. One example is ChIP-seq data, for which adding an up-down constraint improves peak detection accuracy, but makes the optimization problem more complicated. We show how a recently proposed functional pruning technique can be adapted to solve such constrained changepoint detection problems. This leads to a new algorithm which can solve problems with arbitrary affine constraints on adjacent segment means, and which has empirical time complexity that is log-linear in the amount of data. This algorithm achieves state-of-the-art accuracy in a benchmark of several genomic data sets, and is orders of magnitude faster than existing algorithms that have similar accuracy. Our implementation is available as the PeakSegPDPA function in the coseg R package, https://github.com/tdhock/coseg

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.