pith. sign in

arxiv: cond-mat/0310649 · v1 · submitted 2003-10-28 · ❄️ cond-mat.stat-mech · cond-mat.soft· q-bio.GN

In the search for the low-complexity sequences in prokaryotic and eukaryotic genomes: how to derive a coherent picture from global and local entropy measures

classification ❄️ cond-mat.stat-mech cond-mat.softq-bio.GN
keywords genomeschoicecorrelationlow-complexitynseisequencestechniqueamount
0
0 comments X
read the original abstract

We investigate on a possible way to connect the presence of Low-Complexity Sequences (LCS) in DNA genomes and the nonstationary properties of base correlations. Under the hypothesis that these variations signal a change in the DNA function, we use a new technique, called Non-Stationarity Entropic Index (NSEI) method, and we prove that this technique is an efficient way to detect functional changes with respect to a random baseline. The remarkable aspect is that NSEI does not imply any training data or fitting parameter, the only arbitrarity being the choice of a marker in the sequence. We make this choice on the basis of biological information about LCS distributions in genomes. We show that there exists a correlation between changing the amount in LCS and the ratio of long- to short-range correlation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.