pith. sign in

arxiv: 1902.01280 · v1 · pith:CLWF6OPFnew · submitted 2019-02-04 · 💻 cs.DS

A New Class of Searchable and Provably Highly Compressible String Transformations

classification 💻 cs.DS
keywords stringclasstheytransformationtransformationsburrows-wheelercompressiblefamily
0
0 comments X
read the original abstract

The Burrows-Wheeler Transform is a string transformation that plays a fundamental role for the design of self-indexing compressed data structures. Over the years, researchers have successfully extended this transformation outside the domains of strings. However, efforts to find non-trivial alternatives of the original, now 25 years old, Burrows-Wheeler string transformation have met limited success. In this paper we bring new lymph to this area by introducing a whole new family of transformations that have all the myriad virtues of the BWT: they can be computed and inverted in linear time, they produce provably highly compressible strings, and they support linear time pattern search directly on the transformed string. This new family is a special case of a more general class of transformations based on context adaptive alphabet orderings, a concept introduced here. This more general class includes also the Alternating BWT, another invertible string transforms recently introduced in connection with a generalization of Lyndon words.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. The Alternating BWT: an algorithmic perspective

    cs.DS 2019-07 unverdicted novelty 6.0

    ABWT is shown to be rank-invertible like BWT, supports generalized backward search for indices, and admits efficient computation via difference-cover suffix sorting plus alternating-order rotation finding.