Self-organizing Approach for Automated Gene Identification in Whole Genomes
classification
⚛️ physics.bio-ph
q-bio.GN
keywords
approachbeenfrequenciesgenomesidentificationproposedspacetriplet
read the original abstract
An approach based on using the idea of distinguished coding phase in explicit form for identification of protein-coding regions (exons) in whole genome has been proposed. For several genomes an optimal window length for averaging GC-content function and calculating codon frequencies has been found. Self-training procedure based on clustering in multidimensional space of triplet frequencies is proposed. For visualization of data in the space of triplet requiencies method of elastic maps was applied.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.