pith. sign in

arxiv: 1804.11105 · v1 · pith:KCQF2SZKnew · submitted 2018-04-30 · 💻 cs.AI · cs.CL

Fast and scalable learning of neuro-symbolic representations of biomedical knowledge

classification 💻 cs.AI cs.CL
keywords biologicalembeddingslearningclassifiersembeddingfastknowledgeneuro-symbolic
0
0 comments X
read the original abstract

In this work we address the problem of fast and scalable learning of neuro-symbolic representations for general biological knowledge. Based on a recently published comprehensive biological knowledge graph (Alshahrani, 2017) that was used for demonstrating neuro-symbolic representation learning, we show how to train fast (under 1 minute) log-linear neural embeddings of the entities. We utilize these representations as inputs for machine learning classifiers to enable important tasks such as biological link prediction. Classifiers are trained by concatenating learned entity embeddings to represent entity relations, and training classifiers on the concatenated embeddings to discern true relations from automatically generated negative examples. Our simple embedding methodology greatly improves on classification error compared to previously published state-of-the-art results, yielding a maximum increase of $+0.28$ F-measure and $+0.22$ ROC AUC scores for the most difficult biological link prediction problem. Finally, our embedding approach is orders of magnitude faster to train ($\leq$ 1 minute vs. hours), much more economical in terms of embedding dimensions ($d=50$ vs. $d=512$), and naturally encodes the directionality of the asymmetric biological relations, that can be controlled by the order with which we concatenate the embeddings.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. T-TExTS (Teaching Text Expansion for Teacher Scaffolding): Enhancing Text Selection in High School Literature through Knowledge Graph-Based Recommendation

    cs.IR 2025-06 conditional novelty 5.0

    T-TExTS builds a domain ontology into a knowledge graph and tests four embedding methods, finding Node2Vec yields the highest AUC (0.9642-0.9750) while a hybrid embedding balances ranking quality with interpretability...