pith. sign in

arxiv: 1810.04707 · v1 · pith:PUGXVAC6new · submitted 2018-10-02 · 🧬 q-bio.OT · cs.LO

An Inductive Logic Programming Approach to Validate Hexose Binding Biochemical Knowledge

classification 🧬 q-bio.OT cs.LO
keywords biochemicalfindingsbindingsitesblack-boxclassifiersdatahexose
0
0 comments X
read the original abstract

Hexoses are simple sugars that play a key role in many cellular pathways, and in the regulation of development and disease mechanisms. Current protein-sugar computational models are based, at least partially, on prior biochemical findings and knowledge. They incorporate different parts of these findings in predictive black-box models. We investigate the empirical support for biochemical findings by comparing Inductive Logic Programming (ILP) induced rules to actual biochemical results. We mine the Protein Data Bank for a representative data set of hexose binding sites, non-hexose binding sites and surface grooves. We build an ILP model of hexose-binding sites and evaluate our results against several baseline machine learning classifiers. Our method achieves an accuracy similar to that of other black-box classifiers while providing insight into the discriminating process. In addition, it confirms wet-lab findings and reveals a previously unreported Trp-Glu amino acids dependency.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.