BioBLP: A Modular Framework for Learning on Multimodal Biomedical Knowledge Graphs
Pith reviewed 2026-05-24 07:43 UTC · model grok-4.3
The pith
A modular framework encodes mixed attribute types in biomedical knowledge graphs into embeddings that improve drug-protein predictions for low-degree entities.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that a modular architecture with modality-specific encoders can produce entity embeddings from multimodal attributes in biomedical KGs, handling missing data, and that these embeddings yield competitive performance on link prediction and superior results on drug-protein interaction prediction for low-degree entities, with an efficient pretraining strategy further improving outcomes and reducing runtime.
What carries the argument
A modular framework using separate encoders for each data modality to map attributes into a shared embedding space for use with a graph-based link prediction model.
If this is right
- Embeddings remain defined for entities that lack data in one or more modalities.
- Performance advantages concentrate on the low-degree entities that constitute a large share of the graph.
- Pretraining on attribute data alone raises final accuracy while cutting overall training runtime.
- The same modular design supports adding new attribute types without redesigning the graph model.
Where Pith is reading between the lines
- The same separation of modality encoders could be tested on knowledge graphs outside biomedicine that combine sequences, structures, and text.
- Reporting results broken down by entity degree would become a standard check for any embedding method that claims to use side information.
- One could measure whether the shared embedding space preserves modality-specific information by probing how well each original attribute can be reconstructed from the final vector.
Load-bearing premise
That modality-specific encoders can be combined into one embedding space without introducing systematic bias that would erase any benefit from using the attribute data.
What would settle it
If a head-to-head test on the drug-protein interaction task shows no outperformance over attribute-free baselines when restricted to the low-degree entity subset, the main performance claim would not hold.
read the original abstract
Knowledge graphs (KGs) are an important tool for representing complex relationships between entities in the biomedical domain. Several methods have been proposed for learning embeddings that can be used to predict new links in such graphs. Some methods ignore valuable attribute data associated with entities in biomedical KGs, such as protein sequences, or molecular graphs. Other works incorporate such data, but assume that entities can be represented with the same data modality. This is not always the case for biomedical KGs, where entities exhibit heterogeneous modalities that are central to their representation in the subject domain. We propose a modular framework for learning embeddings in KGs with entity attributes, that allows encoding attribute data of different modalities while also supporting entities with missing attributes. We additionally propose an efficient pretraining strategy for reducing the required training runtime. We train models using a biomedical KG containing approximately 2 million triples, and evaluate the performance of the resulting entity embeddings on the tasks of link prediction, and drug-protein interaction prediction, comparing against methods that do not take attribute data into account. In the standard link prediction evaluation, the proposed method results in competitive, yet lower performance than baselines that do not use attribute data. When evaluated in the task of drug-protein interaction prediction, the method compares favorably with the baselines. We find settings involving low degree entities, which make up for a substantial amount of the set of entities in the KG, where our method outperforms the baselines. Our proposed pretraining strategy yields significantly higher performance while reducing the required training runtime. Our implementation is available at https://github.com/elsevier-AI-Lab/BioBLP .
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces BioBLP, a modular framework for learning embeddings on biomedical KGs that encodes heterogeneous entity attributes (e.g., sequences, molecular graphs) while supporting missing attributes, plus an efficient pretraining strategy. On a KG of ~2M triples, standard link prediction yields competitive but lower performance than attribute-ignoring baselines; drug-protein interaction prediction shows favorable results, especially for low-degree entities; pretraining improves performance and reduces runtime. Implementation is released on GitHub.
Significance. If the results hold under detailed scrutiny, the work could be significant for biomedical KG embedding by addressing the common reality of multimodal, incomplete attributes without forcing uniform modality assumptions. The focus on low-degree entities (a substantial portion of real KGs) and the pretraining efficiency gain address practical limitations of prior methods.
major comments (1)
- Abstract: the central empirical claims (favorable drug-protein results, especially low-degree; pretraining gains) are presented without methodological details on modality-specific encoders, baseline descriptions, statistical tests, or ablation on missing-attribute handling, preventing assessment of whether gains are attributable to the modular design or other factors.
minor comments (1)
- Abstract: 'significantly higher performance' from pretraining is stated without naming the metric or quantifying the runtime reduction.
Simulated Author's Rebuttal
We thank the referee for their review and the opportunity to clarify points from the manuscript. We address the major comment below.
read point-by-point responses
-
Referee: [—] Abstract: the central empirical claims (favorable drug-protein results, especially low-degree; pretraining gains) are presented without methodological details on modality-specific encoders, baseline descriptions, statistical tests, or ablation on missing-attribute handling, preventing assessment of whether gains are attributable to the modular design or other factors.
Authors: We acknowledge that the abstract is necessarily concise and omits explicit methodological details to adhere to length limits. Modality-specific encoders (e.g., for protein sequences via ESM and molecular graphs via GCN) and the mechanism for handling missing attributes are described in Section 3.2 and 3.3. Baselines (attribute-ignoring models such as TransE, DistMult, and ComplEx) are specified in Section 4.2. Standard ranking metrics (MRR and Hits@K) are reported without additional statistical significance tests, consistent with common practice in KG embedding papers; we can add paired t-tests if requested. Ablation studies on missing-attribute handling appear in Section 5.3. The drug-protein interaction results (Table 3) and low-degree entity analysis (Figure 4) compare directly against the attribute-ignoring baselines on the same KG, isolating the contribution of the multimodal encoders. Pretraining gains are quantified in Section 5.5 with runtime and performance deltas. We will revise the abstract to briefly reference these elements and the modular design's role. revision: yes
Circularity Check
No significant circularity in derivation chain
full rationale
The paper presents an empirical modular framework for multimodal KG embeddings, with performance evaluated on link prediction and drug-protein interaction tasks against external baselines that ignore attribute data. No load-bearing step reduces by construction to fitted inputs, self-definitions, or self-citation chains; pretraining is described as an efficiency technique yielding higher performance without circular reduction to the target metrics. Claims are grounded in comparisons to independent methods and stated qualifications on low-degree entities, making the derivation self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Large-Scale Analysis of Genetic and Clinical Patient Data
Ritchie MD. Large-Scale Analysis of Genetic and Clinical Patient Data. Annual Review of Biomedical Data Science. 2018 Jul;1(1):263–274
work page 2018
-
[2]
Big Data: astronomical or genomical? PLoS biology
Stephens ZD, et al. Big Data: astronomical or genomical? PLoS biology. 2015;13(7):e1002195
work page 2015
-
[3]
Big Data and Artificial Intelligence Modeling for Drug Discovery
Zhu H. Big Data and Artificial Intelligence Modeling for Drug Discovery. Annual Review of Pharmacology and Toxicology. 2020 Jan;60(1):573–589
work page 2020
-
[4]
The FAIR Guiding Principles for scientific data management and stewardship
Wilkinson MD, Dumontier M, et al. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data. 2016;3(1):160018
work page 2016
-
[5]
Wikidata as a knowledge graph for the life sciences
Waagmeester A, Stupp G, et al. Wikidata as a knowledge graph for the life sciences. eLife. 2020 mar;9
work page 2020
-
[6]
Bio2RDF: towards a mashup to build bioinformatics knowledge systems
Belleau F, Nolin MA, Tourigny N, Rigault P, Morissette J. Bio2RDF: towards a mashup to build bioinformatics knowledge systems. Journal of biomedical informatics. 2008;41(5):706–716. 25 Table A1: Classification performance on DPI benchmark with ratio 1:10. Reported are the mean (std) over the 5 best models scored on the test folds. DPI-FDA Feature Classifi...
work page 2008
-
[7]
Open PHACTS: semantic interoperability for drug discovery
Williams AJ, Harland L, Groth P, Pettifer S, Chichester C, Willighagen EL, et al. Open PHACTS: semantic interoperability for drug discovery. Drug discovery today. 2012;17(21-22):1188–1198
work page 2012
-
[8]
Domingo-Fern´ andez D, Baksi S, et al. COVID-19 Knowledge Graph: a com- putable, multi-modal, cause-and-effect knowledge model of COVID-19 pathophys- iology. Bioinformatics. 2020 09
work page 2020
-
[9]
Systematic integration of biomedical knowledge prioritizes drugs for repurposing
Himmelstein DS, Lizee A, et al. Systematic integration of biomedical knowledge prioritizes drugs for repurposing. eLife. 2017 sep;6:e26726
work page 2017
-
[10]
Hogan A, Blomqvist E, Cochez M, d’Amato C, de Melo G, Guti´ errez C, et al. Knowledge Graphs. No. 22 in Synthesis Lectures on Data, Semantics, and Knowledge. Springer; 2021. Available from: https://kgbook.org/. 26
work page 2021
-
[11]
Drug discovery FAQs: workflows for answering multidomain drug discovery questions
Chichester C, Digles D, et al. Drug discovery FAQs: workflows for answering multidomain drug discovery questions. Drug Discovery Today. 2015;20(4):399– 405
work page 2015
-
[12]
DrugBank 3.0: a Comprehensive Resource for ‘omics’ Research on Drugs
Knox C, Law V, Jewison T, Liu P, et al. DrugBank 3.0: a Comprehensive Resource for ‘omics’ Research on Drugs. Nucleic acids research. 2010;39(suppl 1):D1035– D1041
work page 2010
-
[13]
UniProt: a hub for protein information
Bateman A, Martin MJ, O’Donovan C, Magrane M, Apweiler R, Alpi E, et al. UniProt: a hub for protein information. Nucleic Acids Res. 2015 Jan;43(Database issue):D204–212
work page 2015
-
[14]
Medical Subject Headings (MeSH)
Lipscomb CE. Medical Subject Headings (MeSH). Bull Med Libr Assoc. 2000;88(3): 265–266
work page 2000
-
[15]
Network medicine framework for identifying drug-repurposing opportunities for COVID-
Morselli Gysi D, Do Valle ´I, Zitnik M, Ameli A, Gan X, Varol O, et al. Network medicine framework for identifying drug-repurposing opportunities for COVID-
-
[16]
Proceedings of the National Academy of Sciences. 2021;118(19):e2025581118
work page 2021
-
[17]
A Review of Relational Machine Learning for Knowledge Graphs
Nickel M, Murphy K, Tresp V, Gabrilovich E. A Review of Relational Machine Learning for Knowledge Graphs. Proc IEEE. 2016;104(1):11–33. https://doi.org/ 10.1109/JPROC.2015.2483592
-
[18]
Knowledge Graph Embedding: A Survey of Approaches and Applications
Wang Q, Mao Z, Wang B, Guo L. Knowledge Graph Embedding: A Survey of Approaches and Applications. IEEE Trans Knowl Data Eng. 2017;29(12):2724–
work page 2017
-
[19]
https://doi.org/10.1109/TKDE.2017.2754499
-
[20]
Translating Embeddings for Modeling Multi-relational Data
Bordes A, Usunier N, Garc´ ıa-Dur´ an A, Weston J, Yakhnenko O. Translating Embeddings for Modeling Multi-relational Data. In: Burges CJC, Bottou L, Ghahramani Z, Weinberger KQ, editors. Advances in Neural Information Pro- cessing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, ...
work page 2013
-
[21]
Complex Embeddings for Simple Link Prediction
Trouillon T, Welbl J, Riedel S, Gaussier ´E, Bouchard G. Complex Embeddings for Simple Link Prediction. In: Balcan M, Weinberger KQ, editors. Proceedings of the 33nd International Conference on Machine Learning, ICML 2016, New York City, NY, USA, June 19-24, 2016. vol. 48 of JMLR Workshop and Conference Proceedings. JMLR.org; 2016. p. 2071–2080. Available...
work page 2016
-
[22]
RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space
Sun Z, Deng Z, Nie J, Tang J. RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space. In: 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9,
work page 2019
-
[23]
Available from: https://openreview.net/forum?id= HkgEQnRqYQ
OpenReview.net; 2019. Available from: https://openreview.net/forum?id= HkgEQnRqYQ. 27
work page 2019
-
[24]
Representation Learning of Knowledge Graphs with Entity Descriptions
Xie R, Liu Z, Jia J, Luan H, Sun M. Representation Learning of Knowledge Graphs with Entity Descriptions. In: Schuurmans D, Wellman MP, editors. Pro- ceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 12-17, 2016, Phoenix, Arizona, USA. AAAI Press; 2016. p. 2659–2665. Available from: http://www.aaai.org/ocs/index.php/AAAI/AAAI1...
work page 2016
-
[25]
Inductive Relation Prediction by Subgraph Reasoning
Teru KK, Denis EG, Hamilton WL. Inductive Relation Prediction by Subgraph Reasoning. In: Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event. vol. 119 of Proceedings of Machine Learning Research. PMLR; 2020. p. 9448–9457. Available from: http: //proceedings.mlr.press/v119/teru20a.html
work page 2020
-
[26]
Inductive Entity Representations from Text via Link Prediction
Daza D, Cochez M, Groth P. Inductive Entity Representations from Text via Link Prediction. In: Leskovec J, Grobelnik M, Najork M, Tang J, Zia L, editors. WWW ’21: The Web Conference 2021, Virtual Event / Ljubljana, Slovenia, April 19-23, 2021. ACM / IW3C2; 2021. p. 798–808. Available from: https://doi.org/ 10.1145/3442381.3450141
-
[27]
NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge Graphs
Galkin M, Denis EG, Wu J, Hamilton WL. NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge Graphs. In: The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net; 2022. p. 1–14. Available from: https:// openreview.net/forum?id=xMJWUKJnFSw
work page 2022
-
[28]
Image-embodied Knowledge Representation Learn- ing
Xie R, Liu Z, Luan H, Sun M. Image-embodied Knowledge Representation Learn- ing. In: Sierra C, editor. Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017, Melbourne, Australia, August 19-25, 2017. ijcai.org; 2017. p. 3140–3146. Available from: https://doi.org/10. 24963/ijcai.2017/438
work page 2017
-
[29]
Multi-Task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs
Tay Y, Tuan LA, Phan MC, Hui SC. Multi-Task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs. In: Lim E, Winslett M, Sanderson M, Fu AW, Sun J, Culpepper JS, et al., editors. Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM 2017, Singapore, November 06 - 10, 2017. ACM; 2017. p. 1029–1038. Avai...
-
[30]
Knowledge Graph Embedding with Numeric Attributes of Enti- ties
Wu Y, Wang Z. Knowledge Graph Embedding with Numeric Attributes of Enti- ties. In: Proceedings of the Third Workshop on Representation Learning for NLP. Melbourne, Australia: Association for Computational Linguistics; 2018. p. 132–136. Available from: https://aclanthology.org/W18-3017
work page 2018
-
[31]
Embedding Multimodal Relational Data for Knowledge Base Completion
Pezeshkpour P, Chen L, Singh S. Embedding Multimodal Relational Data for Knowledge Base Completion. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Brussels, Belgium: Asso- ciation for Computational Linguistics; 2018. p. 3208–3218. Available from: https://aclanthology.org/D18-1359. 28
work page 2018
-
[32]
Incorporating Literals into Knowledge Graph Embeddings
Kristiadi A, Khan MA, Lukovnikov D, Lehmann J, Fischer A. Incorporating Literals into Knowledge Graph Embeddings. In: Ghidini C, Hartig O, Maleshkova M, Sv´ atek V, Cruz IF, Hogan A, et al., editors. The Semantic Web - ISWC 2019 - 18th International Semantic Web Conference, Auckland, New Zealand, October 26-30, 2019, Proceedings, Part I. vol. 11778 of Lec...
work page 2019
-
[33]
KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation
Wang X, Gao T, Zhu Z, Zhang Z, Liu Z, Li J, et al. KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation. Trans Assoc Comput Linguistics. 2021;9:176–194. https://doi.org/10.1162/tacl a 00360
work page internal anchor Pith review doi:10.1162/tacl 2021
-
[34]
Multimodal learning with graphs
Ektefaie Y, Dasoulas G, Noori A, Farhat M, Zitnik M. Multimodal learning with graphs. Nature Machine Intelligence. 2023 Apr;https://doi.org/10.1038/ s42256-023-00624-6
work page 2023
-
[35]
SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language Models
Wang L, Zhao W, Wei Z, Liu J. SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language Models. In: Muresan S, Nakov P, Villavi- cencio A, editors. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2022, Dublin, Ireland, May 22-27, 2022. Association for Computational ...
-
[36]
StATIK: Structure and Text for Inductive Knowledge Graph Com- pletion
Markowitz E, Balasubramanian K, Mirtaheri M, Annavaram M, Galstyan A, Steeg GV. StATIK: Structure and Text for Inductive Knowledge Graph Com- pletion. In: Carpuat M, de Marneffe M, Ru´ ız IVM, editors. Findings of the Association for Computational Linguistics: NAACL 2022, Seattle, WA, United States, July 10-15, 2022. Association for Computational Linguist...
-
[37]
CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction
Safavi T, Downey D, Hope T. CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction. CoRR. 2022;abs/2205.08012. https://doi.org/10.48550/ arXiv.2205.08012. 2205.08012
-
[38]
BERT : Pre-training of deep bidirectional transformers for language understanding
Devlin J, Chang M, Lee K, Toutanova K. BERT: Pre-training of Deep Bidirec- tional Transformers for Language Understanding. In: Burstein J, Doran C, Solorio T, editors. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2-...
-
[39]
BioKEEN: a library for learning and evaluating biological knowledge graph embeddings
Ali M, Hoyt CT, ndez D, Lehmann J, Jabeen H. BioKEEN: a library for learning and evaluating biological knowledge graph embeddings. Bioinformatics. 2019 Sep;35(18):3538–3540. 29
work page 2019
-
[40]
To embed or not: network embedding as a paradigm in computational biology
Nelson W, Zitnik M, Wang B, Leskovec J, Goldenberg A, Sharan R. To embed or not: network embedding as a paradigm in computational biology. Frontiers in genetics. 2019;10:381
work page 2019
-
[41]
BioKG: A knowledge graph for relational learning on biological data
Walsh B, Mohamed SK, Nov´ aˇ cek V. BioKG: A knowledge graph for relational learning on biological data. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management; 2020. p. 3173–3180
work page 2020
-
[42]
Discovering protein drug targets using knowledge graph embeddings
Mohamed SK, ek V, Nounu A. Discovering protein drug targets using knowledge graph embeddings. Bioinformatics. 2020 Jan;36(2):603–610
work page 2020
-
[43]
Application and evaluation of knowledge graph embeddings in biomedical data
Alshahrani M, Thafar MA, Essack M. Application and evaluation of knowledge graph embeddings in biomedical data. PeerJ Comput Sci. 2021;7:e341
work page 2021
-
[44]
A Knowledge Graph-Enhanced Tensor Factorisation Model for Discovering Drug Targets
Ye C, Swiers R, Bonner S, Barrett I. A Knowledge Graph-Enhanced Tensor Factorisation Model for Discovering Drug Targets. IEEE/ACM Trans Comput Biol Bioinform. 2022 Aug;PP
work page 2022
-
[45]
Gema AP, Grabarczyk D, De Wulf W, Borole P, Alfaro JA, Minervini P, et al. Knowledge Graph Embeddings in the Biomedical Domain: Are They Useful? A Look at Link Prediction, Rule Learning, and Downstream Polypharmacy Tasks. CoRR. 2022;abs/2305.19979. https://doi.org/10.48550/arXiv.2305.19979. 2305.19979
-
[46]
Karim MR, Cochez M, Jares JB, Uddin M, Beyan OD, Decker S. Drug-Drug Inter- action Prediction Based on Knowledge Graph Embeddings and Convolutional- LSTM Network. In: Shi XM, Buck M, Ma J, Veltri P, editors. Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2019, Niagara Falls, NY, US...
work page 2019
-
[47]
ACM; 2019. p. 113–123. Available from: https://doi.org/10.1145/3307339. 3342161
-
[48]
Rossi A, Matinata A. Knowledge graph embeddings: Are relation-learning models learning relations? In: EDBT/ICDT Workshops; 2020
work page 2020
-
[49]
Choi W, Lee H. Identifying disease-gene associations using a convolutional neural network-based model by embedding a biological knowledge graph with entity descriptions. Plos one. 2021;16(10):e0258626
work page 2021
-
[50]
Alshahrani M, Almansour A, Alkhaldi A, Thafar MA, Uludag M, Essack M, et al. Combining biomedical knowledge graphs and text to improve predictions for drug-target interactions and drug-indications. PeerJ. 2022;10:e13061
work page 2022
-
[51]
Ren Z, You Z, Yu C, Li L, Guan Y, Guo L, et al. A biomedical knowledge graph- based method for drug-drug interactions prediction through combining local and global features with deep neural networks. Briefings Bioinform. 2022;23(5). https: //doi.org/10.1093/bib/bbac363. 30
-
[52]
Attention-based knowledge graph represen- tation learning for predicting drug-drug interactions
Su X, Hu L, You Z, Hu P, Zhao B. Attention-based knowledge graph represen- tation learning for predicting drug-drug interactions. Briefings in bioinformatics. 2022;23(3):bbac140
work page 2022
-
[53]
MKGE: Knowledge graph embedding with molecular structure information
Zhang Y, Li Z, Duan B, Qin L, Peng J. MKGE: Knowledge graph embedding with molecular structure information. Comput Biol Chem. 2022;100:107730. https: //doi.org/10.1016/j.compbiolchem.2022.107730
-
[54]
Multimodal reasoning based on knowledge graph embedding for specific diseases
Zhu C, Yang Z, Xia X, Li N, Zhong F, Liu L. Multimodal reasoning based on knowledge graph embedding for specific diseases. Bioinform. 2022;38(8):2235–
work page 2022
-
[55]
https://doi.org/10.1093/bioinformatics/btac085
-
[56]
You CAN Teach an Old Dog New Tricks! On Training Knowledge Graph Embeddings
Ruffinelli D, Broscheit S, Gemulla R. You CAN Teach an Old Dog New Tricks! On Training Knowledge Graph Embeddings. In: 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30,
work page 2020
-
[57]
Available from: https://openreview.net/forum?id= BkxSmlBFvr
OpenReview.net; 2020. Available from: https://openreview.net/forum?id= BkxSmlBFvr
work page 2020
-
[58]
Ali M, Berrendorf M, Hoyt CT, Vermue L, Galkin M, Sharifzadeh S, et al. Bringing Light Into the Dark: A Large-Scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework. IEEE Trans Pattern Anal Mach Intell. 2022;44(12):8825–8845. https://doi.org/10.1109/TPAMI.2021.3124805
-
[59]
Understanding the performance of knowledge graph embeddings in drug discovery
Bonner S, Barrett IP, Ye C, Swiers R, Engkvist O, Hoyt CT, et al. Understanding the performance of knowledge graph embeddings in drug discovery. Artificial Intelligence in the Life Sciences. 2022;2:100036. https://doi.org/https://doi.org/ 10.1016/j.ailsci.2022.100036
-
[60]
DrugBank: a knowledgebase for drugs, drug actions and drug targets
Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, Tzur D, et al. DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Res. 2008 Jan;36(Database issue):D901–906
work page 2008
-
[61]
Prediction of drug- target interaction networks from the integration of chemical and genomic spaces
Yamanishi Y, Araki M, Gutteridge A, Honda W, Kanehisa M. Prediction of drug- target interaction networks from the integration of chemical and genomic spaces. Bioinformatics. 2008 Jul;24(13):i232–240
work page 2008
-
[62]
Prottrans: Toward understanding the language of life through self-supervised learning
Elnaggar A, Heinzinger M, Dallago C, Rehawi G, Wang Y, Jones L, et al. Prottrans: Toward understanding the language of life through self-supervised learning. IEEE transactions on pattern analysis and machine intelligence. 2021;44(10):7112–7127
work page 2021
-
[63]
Predicting Binding from Screening Assays with Transformer Network Embeddings
Morris P, St Clair R, Hahn WE, Barenholtz E. Predicting Binding from Screening Assays with Transformer Network Embeddings. Journal of Chemical Information and Modeling. 2020 Jun;https://doi.org/10.1021/acs.jcim.9b01212
-
[64]
BioBERT: a pre-trained biomedical language representation model for biomedical text 31 mining
Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, et al. BioBERT: a pre-trained biomedical language representation model for biomedical text 31 mining. Bioinformatics. 2019 09;36(4):1234–1240. https://doi.org/10. 1093/bioinformatics/btz682. https://academic.oup.com/bioinformatics/article- pdf/36/4/1234/48983216/bioinformatics 36 4 1234.pdf
work page 2019
-
[65]
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is All you Need. In: Guyon I, von Luxburg U, Bengio S, Wal- lach HM, Fergus R, Vishwanathan SVN, et al., editors. Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA; 20...
work page 2017
-
[66]
PyKEEN 1.0: A Python Library for Training and Evaluating Knowledge Graph Embeddings
Ali M, Berrendorf M, Hoyt CT, Vermue L, Sharifzadeh S, Tresp V, et al. PyKEEN 1.0: A Python Library for Training and Evaluating Knowledge Graph Embeddings. J Mach Learn Res. 2021;22:82:1–82:6
work page 2021
-
[67]
A multiple kernel learning algorithm for drug-target interaction prediction
Nascimento AC, Prudˆ encio RB, Costa IG. A multiple kernel learning algorithm for drug-target interaction prediction. BMC bioinformatics. 2016;17:1–16
work page 2016
-
[68]
Predicting drug-target interactions by dual-network integrated logistic matrix factorization
Hao M, Bryant SH, Wang Y. Predicting drug-target interactions by dual-network integrated logistic matrix factorization. Scientific reports. 2017;7(1):1–11
work page 2017
-
[69]
Olayan RS, Ashoor H, Bajic VB. DDR: efficient computational method to predict drug–target interactions using graph mining and machine learning approaches. Bioinformatics. 2018;34(7):1164–1173
work page 2018
-
[70]
Takaya MS, Rehmsmeier. The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets. PLOS ONE. 2015 3;10:1–21. https://doi.org/10.1371/journal.pone.0118432
-
[71]
Implications of topological imbalance for representation learning on biomedical knowl- edge graphs
Bonner S, Kirik U, Engkvist O, Tang J, Barrett IP. Implications of topological imbalance for representation learning on biomedical knowl- edge graphs. Briefings in Bioinformatics. 2022 07;23(5). Bbac279. https://doi.org/10.1093/bib/bbac279. https://academic.oup.com/bib/article- pdf/23/5/bbac279/45937607/sup main bbac279.pdf. 32
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.