pith. sign in

arxiv: cond-mat/0102348 · v1 · submitted 2001-02-20 · ❄️ cond-mat.soft · cond-mat.stat-mech· q-bio

Long-Tail Feature of DNA Words Over- and Under-Representation in Coding Sequences

classification ❄️ cond-mat.soft cond-mat.stat-mechq-bio
keywords geneslonglong-tailnucleotidesoligonucleotidesover-representedsequencesshort
0
0 comments X
read the original abstract

We have analyzed DNA sequences of known genes from 16 yeast chromosomes (Saccharomyces cerevisiae) in terms of oligonucleotides. We have noticed that the relative abundances of oligonucleotide usage in the genome follow a long-tail Levy-like distribution. We have observed that long genes often use strongly over-represented and under-represented nucleotides, whereas it was not the case for the short genes (shorter than 300 nucleotides) under consideration. If selection on the extremely over-represented/under-represented oligonucleotides was strong, long genes would be more affected by spontaneous mutations than short ones.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.