pith. sign in

arxiv: 1102.1746 · v1 · pith:5CL47J63new · submitted 2011-02-08 · 💻 cs.DS

Algorithms for Jumbled Pattern Matching in Strings

classification 💻 cs.DS
keywords textvectorparikhalgorithmsindexlinearsizetime
0
0 comments X
read the original abstract

The Parikh vector p(s) of a string s is defined as the vector of multiplicities of the characters. Parikh vector q occurs in s if s has a substring t with p(t)=q. We present two novel algorithms for searching for a query q in a text s. One solves the decision problem over a binary text in constant time, using a linear size index of the text. The second algorithm, for a general finite alphabet, finds all occurrences of a given Parikh vector q and has sub-linear expected time complexity; we present two variants, which both use a linear size index of the text.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.