Fast construction of FM-index for long sequence reads

Heng Li

REVIEW

Fast construction of FM-index for long sequence reads

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1406.0426 v1 pith:LOYFKA5X submitted 2014-06-02 q-bio.GN cs.DS

Fast construction of FM-index for long sequence reads

Heng Li This is my paper

classification q-bio.GN cs.DS

keywords readsfm-indeximplementationlongsequenceshortsortingalgorithm

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

Summary: We present a new method to incrementally construct the FM-index for both short and long sequence reads, up to the size of a genome. It is the first algorithm that can build the index while implicitly sorting the sequences in the reverse (complement) lexicographical order without a separate sorting step. The implementation is among the fastest for indexing short reads and the only one that practically works for reads of averaged kilobases in length. Availability and implementation: https://github.com/lh3/ropebwt2 Contact: hengli@broadinstitute.org

Fast construction of FM-index for long sequence reads

discussion (0)