pith. sign in

arxiv: cs/0512016 · v2 · submitted 2005-12-04 · 💻 cs.DS · cs.CE

A linear-time algorithm for finding the longest segment which scores above a given threshold

classification 💻 cs.DS cs.CE
keywords algorithmlongestscoresabovefindinglinear-timeproblemsequence
0
0 comments X
read the original abstract

This paper describes a linear-time algorithm that finds the longest stretch in a sequence of real numbers (``scores'') in which the sum exceeds an input parameter. The algorithm also solves the problem of finding the longest interval in which the average of the scores is above a fixed threshold. The problem originates from molecular sequence analysis: for instance, the algorithm can be employed to identify long GC-rich regions in DNA sequences. The algorithm can also be used to trim low-quality ends of shotgun sequences in a preprocessing step of whole-genome assembly.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.