pith. sign in

arxiv: 1610.03007 · v1 · pith:FPWTP2YTnew · submitted 2016-10-10 · 💻 cs.DS · cs.DC

Scalable Construction of Text Indexes

classification 💻 cs.DS cs.DC
keywords dataarrayconstructionsuffixalgorithmsprocessingadaptedadvanced
0
0 comments X
read the original abstract

The suffix array is the key to efficient solutions for myriads of string processing problems in different applications domains, like data compression, data mining, or Bioinformatics. With the rapid growth of available data, suffix array construction algorithms had to be adapted to advanced computational models such as external memory and distributed computing. In this article, we present five suffix array construction algorithms utilizing the new algorithmic big data batch processing framework Thrill, which allows us to process input sizes in orders of magnitude that have not been considered before.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.