Entropic Approach for Reduction of Amino Acid Alphabets
classification
⚛️ physics.bio-ph
q-bio.QM
keywords
aminoacidclusterclusteringcountsdatascoreacids
read the original abstract
The primitive data for deducing the Miyazawa-Jernigan contact energy or BLOSUM score metrix are the pair frequency counts. Each amino acid corresponds to a distribution. Taking the Kullback-Leibler distance of two probability distributions as resemblance coefficient and relating cluster to mixed population, we perform cluster analysis of amino acids based on the frequecy counts data. Furthermore, Ward's clustering is also obtained by adopting the average score as an objective function. An ordinal cophenetic is introduced to compare results from different clustering methods.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.