pith. sign in

arxiv: 1705.06826 · v1 · pith:VHGN3IJFnew · submitted 2017-05-18 · 🧮 math.PR

Simulations, Computations, and Statistics for Longest Common Subsequences

classification 🧮 math.PR
keywords commonlongestsequencessimilaritysubsequencesapproachatal-sankoffbehavior
0
0 comments X
read the original abstract

The length of the longest common subsequences (LCSs) is often used as a similarity measurement to compare two (or more) random words. Below we study its statistical behavior in mean and variance using a Monte-Carlo approach from which we then develop a hypothesis testing method for sequences similarity. Finally, theoretical upper bounds are obtained for the Chv\'atal-Sankoff constant of multiple sequences.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.