pith. machine review for the scientific record. sign in

arxiv: 1307.6462 · v1 · submitted 2013-07-24 · 💻 cs.DS · cs.CE

Recognition: unknown

AliBI: An Alignment-Based Index for Genomic Datasets

Authors on Pith no claims yet
classification 💻 cs.DS cs.CE
keywords indexstandardgenomesindexeslz77techniqueaccessaddressed
0
0 comments X
read the original abstract

With current hardware and software, a standard computer can now hold in RAM an index for approximate pattern matching on about half a dozen human genomes. Sequencing technologies have improved so quickly, however, that scientists will soon demand indexes for thousands of genomes. Whereas most researchers who have addressed this problem have proposed completely new kinds of indexes, we recently described a simple technique that scales standard indexes to work on more genomes. Our main idea was to filter the dataset with LZ77, build a standard index for the filtered file, and then create a hybrid of that standard index and an LZ77-based index. In this paper we describe how to our technique to use alignments instead of LZ77, in order to simplify and speed up both preprocessing and random access.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.