pith. sign in

arxiv: 1402.2097 · v1 · pith:NXB3FSJCnew · submitted 2014-02-10 · 💻 cs.DS

Longest Common Subsequence in k-length substrings

classification 💻 cs.DS
keywords caseinputspacetimealgorithmdefinelengthorder
0
0 comments X
read the original abstract

In this paper we define a new problem, motivated by computational biology, $LCSk$ aiming at finding the maximal number of $k$ length $substrings$, matching in both input strings while preserving their order of appearance. The traditional LCS definition is a special case of our problem, where $k = 1$. We provide an algorithm, solving the general case in $O(n^2)$ time, where $n$ is the length of the input strings, equaling the time required for the special case of $k=1$. The space requirement of the algorithm is $O(kn)$. %, however, in order to enable %backtracking of the solution, $O(n^2)$ space is needed. We also define a complementary $EDk$ distance measure and show that $EDk(A,B)$ can be computed in $O(nm)$ time and $O(km)$ space, where $m$, $n$ are the lengths of the input sequences $A$ and $B$ respectively.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. The anti-lexicographic SUS-anchor: a near-optimal k=1 sampling scheme

    cs.DS 2026-05 unverdicted novelty 6.0

    The anti-lexicographic SUS-anchor achieves sampling densities less than 1% above the lower bound for alphabet size 4 and k=1, substantially outperforming bidirectional anchors.