pith. sign in

arxiv: 1607.08176 · v1 · pith:MHFRZL3Wnew · submitted 2016-07-27 · 💻 cs.DS

Suffix arrays with a twist

classification 💻 cs.DS
keywords dataarraylayoutprefixessearchsuffixsuffixesadding
0
0 comments X
read the original abstract

The suffix array is a classic full-text index, combining effectiveness with simplicity. We discuss three approaches aiming to improve its efficiency even more: changes to the navigation, data layout and adding extra data. In short, we show that $(i)$ how we search for the right interval boundary impacts significantly the overall search speed, $(ii)$ a B-tree data layout easily wins over the standard one, $(iii)$ the well-known idea of a lookup table for the prefixes of the suffixes can be refined with using compression, $(iv)$ caching prefixes of the suffixes in a helper array can pose a(nother) practical space-time tradeoff.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.