Large Language Models Meet Biomedical Knowledge Graphs for Mechanistically Grounded Therapeutic Prioritization

Betty Tyler; Chih-Hsuan Wei; Chin-Hsien Tai; Chi-Ping Day; Christine C. Alewine; David Saraf; Hasan Slika; Joey Chan; Robert Leaman; Zhiyong Lu

arxiv: 2604.19815 · v1 · submitted 2026-04-17 · 💻 cs.AI

Large Language Models Meet Biomedical Knowledge Graphs for Mechanistically Grounded Therapeutic Prioritization

Chih-Hsuan Wei , Chi-Ping Day , Zhizheng Wang , Christine C. Alewine , Betty Tyler , Hasan Slika , David Saraf , Chin-Hsien Tai

show 3 more authors

Joey Chan Robert Leaman Zhiyong Lu

This is my paper

Pith reviewed 2026-05-10 08:40 UTC · model grok-4.3

classification 💻 cs.AI

keywords drug repurposingknowledge graphslarge language modelsmechanistic reasoningtherapeutic prioritizationbiomedical data integrationtranscriptional signaturescancer survival

0 comments

The pith

A hybrid system merges knowledge graphs with large language models to rank drug candidates by biological mechanism instead of historical use.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces DrugKLM to combine the relational facts stored in biomedical knowledge graphs with the explanatory reasoning of large language models. This produces ranked lists of therapy candidates that rest on coherent biological stories for a given disease rather than on patterns of past prescriptions. The method records higher accuracy on standard test collections than approaches using only graphs or only language models. Its ranking scores track gene expression changes that correlate with longer survival in patients across twelve cancer types and favor signals of actual biological change over simple repetition of old indications. Disease specialists reviewing the outputs for five cancers note that the system surfaces candidates backed by clear mechanistic links and context-specific evidence.

Core claim

DrugKLM integrates biomedical knowledge graph structure with large language model-based mechanistic reasoning to enable mechanistically grounded therapeutic prioritization. Across benchmark datasets the framework outperforms knowledge graph-only and language model-only baselines. Its confidence scores exhibit functional alignment with molecular phenotypes such that higher scores associate with transcriptional signatures linked to improved survival across twelve TCGA cancers. The scoring framework preferentially captures biologically perturbational signals rather than historical indication patterns. Expert curation across five cancers reveals systematic differences in prioritization behavior,

What carries the argument

The DrugKLM hybrid scoring framework that fuses knowledge-graph edges with large-language-model-generated mechanistic explanations to assign priority to therapeutic candidates.

If this is right

Higher accuracy in surfacing biologically plausible repurposing candidates on existing benchmarks
Priority scores that correspond to gene activity patterns tied to clinical survival outcomes
Reduced reliance on historical prescription patterns in favor of current mechanistic fit
More coherent candidate lists when reviewed by experts for specific disease contexts

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The approach could be tested on non-cancer indications to check whether the same mechanistic preference holds outside oncology data.
Tighter coupling of language-model output to graph constraints might limit the impact of incomplete biomedical coverage.
Prospective clinical validation would reveal whether the higher mechanistic scores translate into improved trial success rates.
The method suggests a route to more auditable AI recommendations by requiring explicit mechanistic links for every ranked candidate.

Load-bearing premise

Large language model reasoning supplies reliable biological explanations that integrate cleanly with the knowledge graph without bias from training data or missing graph connections.

What would settle it

A new set of patient cohorts where high DrugKLM scores fail to predict the expected transcriptional shifts or survival benefit would falsify the alignment claim.

read the original abstract

Drug repurposing is often framed as a candidate identification task, but existing approaches provide limited guidance for distinguishing biologically plausible candidates from historically well-connected ones. Here we introduce DrugKLM, a hybrid framework that integrates biomedical knowledge graph structure with large language model-based mechanistic reasoning to enable mechanistically grounded therapeutic prioritization. Across benchmark datasets, DrugKLM outperforms knowledge graph-only and language model-only baselines, including TxGNN. Beyond improved recall, DrugKLM confidence scores exhibit functional alignment with molecular phenotypes: higher scores are associated with transcriptional signatures linked to improved survival across 12 TCGA cancers. The scoring framework preferentially captures biologically perturbational signals rather than historical indication patterns. Expert curation across five cancers further reveals systematic differences in prioritization behavior, with DrugKLM elevating candidates supported by coherent mechanistic rationale and disease-specific clinical context. Together, these results establish DrugKLM as an evidence-integrative framework that translates heterogeneous biomedical data into mechanistically interpretable and clinically grounded therapeutic hypotheses.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities. The framework implicitly rests on the assumption that existing biomedical knowledge graphs are sufficiently complete and accurate for mechanistic grounding and that LLM outputs can be trusted as biologically valid without additional verification steps.

axioms (1)

domain assumption Biomedical knowledge graphs contain reliable and sufficiently complete representations of molecular relationships for mechanistic reasoning.
The hybrid prioritization depends on the KG structure being a trustworthy foundation that LLMs can build upon.

pith-pipeline@v0.9.0 · 5505 in / 1274 out tokens · 41978 ms · 2026-05-10T08:40:03.840880+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

58 extracted references · 58 canonical work pages

[1]

ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning

Matsumoto, N., et al. ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning. Bioinformatics 41, btaf031 (2025)

work page 2025
[2]

K-paths: Reasoning over graph paths for drug repurposing and drug interaction prediction

Abdullahi, T., et al. K-paths: Reasoning over graph paths for drug repurposing and drug interaction prediction. in Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2 5-16 (2025)

work page 2025
[3]

A foundation model for clinician-centered drug repurposing

Huang, K., et al. A foundation model for clinician-centered drug repurposing. Nature Medicine 30, 3601-3613 (2024)

work page 2024
[4]

DrugSim2DR: systematic prediction of drug functional similarities in the context of specific disease for drug repurposing

Wu, J., et al. DrugSim2DR: systematic prediction of drug functional similarities in the context of specific disease for drug repurposing. GigaScience 12, giad104 (2023)

work page 2023
[5]

Shao, M., Jiang, L., Meng, Z. & Xu, J. Computational drug repurposing based on a recommendation system and drug–drug functional pathway similarity. Molecules 27, 1404 (2022)

work page 2022
[6]

Computational drug repurposing by exploiting large-scale gene expression data: Strategy, methods and applications

He, H., et al. Computational drug repurposing by exploiting large-scale gene expression data: Strategy, methods and applications. Computers in biology and medicine 155, 106671 (2023)

work page 2023
[7]

& Akhmedov, M

Kwee, I., Martinelli, A., Khayal, L.A. & Akhmedov, M. metaLINCS: an R package for meta-level analysis of LINCS L1000 drug signatures using stratified connectivity mapping. Bioinformatics Advances 2, vbac064 (2022)

work page 2022
[8]

& Pesquita, C

Nunes, S., Badreddine, S. & Pesquita, C. Rewarding explainability in drug repurposing with knowledge graphs. arXiv preprint arXiv:2509.02276 (2025)

work page arXiv 2025
[9]

Biomistral: A collection of open-source pretrained large language models for medical domains

Labrak, Y ., et al. Biomistral: A collection of open-source pretrained large language models for medical domains. arXiv preprint arXiv:2402.10373 (2024). 23

work page arXiv 2024
[10]

Drug repurposing for Alzheimer’s disease using a graph-of-thoughts based large language model to infer drug-disease relationships in a comprehensive knowledge graph

Wang, Z.P ., et al. Drug repurposing for Alzheimer’s disease using a graph-of-thoughts based large language model to infer drug-disease relationships in a comprehensive knowledge graph. BioData Mining 18, 51 (2025)

work page 2025
[11]

DrugReX: an explainable drug repurposing system powered by large language models and literature-based knowledge graph

Huang, L.-C., et al. DrugReX: an explainable drug repurposing system powered by large language models and literature-based knowledge graph. Research Square, rs. 3. rs-6728958 (2025)

work page 2025
[12]

& Nematbakhsh, M

Safaei, A.A., Saboori, P ., Ramezani, R. & Nematbakhsh, M. KGLM-QA: A Novel Approach for Knowledge Graph-Enhanced Large Language Models for Question Answering. in 2024 15th International Conference on Information and Knowledge Technology (IKT) 234-240 (IEEE, 2024)

work page 2024
[13]

Drugagent: Automating ai-aided drug discovery programming through llm multi-agent collaboration

Liu, S., et al. Drugagent: Automating ai-aided drug discovery programming through llm multi- agent collaboration. arXiv preprint arXiv:2411.15692 (2024)

work page arXiv 2024
[14]

& Lai, L

Zhang, F., Zhao, Y ., Zhang, W. & Lai, L. BioScientist Agent: Designing LLM-Biomedical Agents with KG-Augmented RL Reasoning Modules for Drug Repurposing and Mechanistic of Action Elucidation. bioRxiv, 2025.2008. 2008.669291 (2025)

work page arXiv 2025
[15]

DrugReAlign: a multisource prompt framework for drug repurposing based on large language models

Wei, J., et al. DrugReAlign: a multisource prompt framework for drug repurposing based on large language models. BMC biology 22, 226 (2024)

work page 2024
[16]

Txagent: an ai agent for therapeutic reasoning across a universe of tools.arXiv preprint arXiv:2503.10970, 2025

Gao, S., et al. TxAgent: An AI agent for therapeutic reasoning across a universe of tools. arXiv preprint arXiv:2503.10970 (2025)

work page arXiv 2025
[17]

& Ricketts, C.J

Linehan, W.M. & Ricketts, C.J. The Cancer Genome Atlas of renal cell carcinoma: findings and clinical implications. Nature Reviews Urology 16, 539-552 (2019)

work page 2019
[18]

& Ide, N.C

Zarin, D.A., Tse, T., Williams, R.J., Califf, R.M. & Ide, N.C. The ClinicalTrials. gov results database— update and key issues. New England Journal of Medicine 364, 852-860 (2011)

work page 2011
[19]

& Wang, J

Zhang, Z., Cai, J., Zhang, Y . & Wang, J. Learning hierarchy-aware knowledge graph embeddings for link prediction. in Proceedings of the AAAI conference on artificial intelligence, Vol. 34 3065- 3072 (2020)

work page 2020
[20]

& Zitnik, M

Chandak, P ., Huang, K. & Zitnik, M. Building a knowledge graph to enable precision medicine. Scientific Data 10, 67 (2023)

work page 2023
[21]

Comparative toxicogenomics database’s 20th anniversary: update 2025

Davis, A.P ., et al. Comparative toxicogenomics database’s 20th anniversary: update 2025. Nucleic acids research 53, D1328-D1334 (2025)

work page 2025
[22]

PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge

Wei, C.-H., et al. PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge. Nucleic Acids Research 52, W540-W546 (2024)

work page 2024
[23]

SigCom LINCS: data and metadata search engine for a million gene expression signatures

Evangelista, J.E., et al. SigCom LINCS: data and metadata search engine for a million gene expression signatures. Nucleic acids research 50, W697-W709 (2022)

work page 2022
[24]

A landscape of pharmacogenomic interactions in cancer

Iorio, F., et al. A landscape of pharmacogenomic interactions in cancer. Cell 166, 740-754 (2016)

work page 2016
[25]

MEDICASCY: a machine learning approach for predicting small-molecule drug side effects, indications, efficacy, and modes of action

Zhou, H., et al. MEDICASCY: a machine learning approach for predicting small-molecule drug side effects, indications, efficacy, and modes of action. Molecular pharmaceutics 17, 1558-1574 (2020)

work page 2020
[26]

Systematic integration of biomedical knowledge prioritizes drugs for repurposing

Himmelstein, D.S., et al. Systematic integration of biomedical knowledge prioritizes drugs for repurposing. elife 6, e26726 (2017)

work page 2017
[27]

Systemic therapy for mucosal, acral, and uveal melanoma

Rapisuwon, S., et al. Systemic therapy for mucosal, acral, and uveal melanoma. in Cutaneous Melanoma 1301-1335 (Springer, 2020)

work page 2020
[28]

Conserved interferon-γ signaling drives clinical response to immune checkpoint blockade therapy in melanoma

Grasso, C.S., et al. Conserved interferon-γ signaling drives clinical response to immune checkpoint blockade therapy in melanoma. Cancer cell 38, 500-515. e503 (2020)

work page 2020
[29]

B cells sustain inflammation and predict response to immune checkpoint blockade in human melanoma

Griss, J., et al. B cells sustain inflammation and predict response to immune checkpoint blockade in human melanoma. Nature communications 10, 4186 (2019)

work page 2019
[30]

& Hunte, C

Birth, D., Kao, W.-C. & Hunte, C. Structural analysis of atovaquone-inhibited cytochrome bc 1 complex reveals the molecular basis of antimalarial drug action. Nature communications 5, 4029 (2014). 24

work page 2014
[31]

Synthetic lethality-mediated precision oncology via the tumor transcriptome

Lee, J.S., et al. Synthetic lethality-mediated precision oncology via the tumor transcriptome. Cell 184, 2487-2502. e2413 (2021)

work page 2021
[32]

Tumor and microenvironment evolution during immunotherapy with nivolumab

Riaz, N., et al. Tumor and microenvironment evolution during immunotherapy with nivolumab. Cell 171, 934-949. e916 (2017)

work page 2017
[33]

Genomic and transcriptomic features of response to anti-PD-1 therapy in metastatic melanoma

Hugo, W., et al. Genomic and transcriptomic features of response to anti-PD-1 therapy in metastatic melanoma. Cell 165, 35-44 (2016)

work page 2016
[34]

& Hacohen, N

Rooney, M.S., Shukla, S.A., Wu, C.J., Getz, G. & Hacohen, N. Molecular and genetic properties of tumors associated with local immune cytolytic activity. Cell 160, 48-61 (2015)

work page 2015
[35]

K-paths: Reasoning over graph paths for drug repurposing and drug interaction prediction

Abdullahi, T., et al. K-paths: Reasoning over graph paths for drug repurposing and drug interaction prediction. arXiv preprint arXiv:2502.13344 (2025)

work page arXiv 2025
[36]

Personalized prediction of anticancer potential of non-oncology drugs through learning from genome derived molecular pathways

Dong, X., et al. Personalized prediction of anticancer potential of non-oncology drugs through learning from genome derived molecular pathways. NPJ Precision Oncology 9, 36 (2025)

work page 2025
[37]

& Lee, S

Song, H., Bang, D., Koo, B., Kim, S. & Lee, S. LLM-Integrated Representative Path Selection for Context-Aware Drug Repurposing on Biomedical Knowledge Graphs. in NeurIPS 2025 2nd Workshop on Multi-modal Foundation Models and Large Language Models for Life Sciences

work page 2025
[38]

TheraMind: A Multi-LLM Agent for Accelerating Drug Repurposing in Lung Cancer via Case Report Mining

More, V., et al. TheraMind: A Multi-LLM Agent for Accelerating Drug Repurposing in Lung Cancer via Case Report Mining. (2025)

work page 2025
[39]

Medcpt: Contrastive pre-trained transformers with large-scale pubmed search logs for zero-shot biomedical information retrieval

Jin, Q., et al. Medcpt: Contrastive pre-trained transformers with large-scale pubmed search logs for zero-shot biomedical information retrieval. Bioinformatics 39, btad651 (2023)

work page 2023
[40]

DGIdb 5.0: rebuilding the drug–gene interaction database for precision medicine and drug discovery platforms

Cannon, M., et al. DGIdb 5.0: rebuilding the drug–gene interaction database for precision medicine and drug discovery platforms. Nucleic acids research 52, D1227-D1235 (2024)

work page 2024
[41]

Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells

Yang, W., et al. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic acids research 41, D955-D961 (2012)

work page 2012
[42]

Enrichr: a comprehensive gene set enrichment analysis web server 2016 update

Kuleshov, M.V., et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic acids research 44, W90-W97 (2016)

work page 2016
[43]

Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1

Barbie, D.A., et al. Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1. Nature 462, 108-112 (2009). 25 Supplementary materials: Fig. S1 Relevance scoring of TxGNN using TCGA survival outcomes and ClinicalTrials.gov trial statuses. (a) Correlation between TxGNN confidence scores and TCGA dataset survival relevance across d...

work page 2009
[44]

Disease–Drug Evidence

work page
[45]

Aggregated Disease–Gene and Drug–Gene Evidence

work page
[46]

GSEA pathway evidence

work page
[47]

Disease":

Case JSON for disease context [subtype_statements] Scoring Rules (apply independently): Disease–Drug evidence: - If direct disease–drug evidence is supported by clinical trial reports or FDA-approved indications → add 40 points. - If disease–drug evidence is indirect or preclinical only (e.g., cell line or animal studies) → add 20 points. Gene-level evide...

work page
[48]

Output must be valid JSON. 29

work page
[49]

Each score must be between 0 and 100

work page
[50]

All reasoning must be grounded in: * The provided study information * Established scientific knowledge of oncology drug development

work page
[51]

Do not use bullets or symbols that are not on a standard keyboard

work page
[52]

Include both: a) a detailed explanation for each scoring category b) the numeric score

work page
[53]

this_year-start_year

Use the evaluation features and importance levels listed below. --- EVALUATION FEATURES AND THEIR IMPORTANCE Mechanistic rationale (Importance: Highest) * How strongly the drug mechanism links to the disease biology. * Whether the target is known to be relevant. * Whether similar mechanisms have proven successful. Preclinical evidence (Importance: High) *...

work page
[54]

Read the clinical trial study from INPUT:[Input]

work page
[55]

Produce the JSON described above

work page
[56]

The JSON must be comprehensive and self-contained

work page
[57]

Avoid any special characters not available on a standard keyboard

work page
[58]

--- INPUT: [Input] Target_Disease: [Disease] Target_Drug: [Drug] Fig

When assigning overall_confidence, the model must treat the score of result_status as a major determinant. --- INPUT: [Input] Target_Disease: [Disease] Target_Drug: [Drug] Fig. S4. Scoring prompt for automated ClinicalTrials.gov–based relevance evaluation

work page

[1] [1]

ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning

Matsumoto, N., et al. ESCARGOT: an AI agent leveraging large language models, dynamic graph of thoughts, and biomedical knowledge graphs for enhanced reasoning. Bioinformatics 41, btaf031 (2025)

work page 2025

[2] [2]

K-paths: Reasoning over graph paths for drug repurposing and drug interaction prediction

Abdullahi, T., et al. K-paths: Reasoning over graph paths for drug repurposing and drug interaction prediction. in Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2 5-16 (2025)

work page 2025

[3] [3]

A foundation model for clinician-centered drug repurposing

Huang, K., et al. A foundation model for clinician-centered drug repurposing. Nature Medicine 30, 3601-3613 (2024)

work page 2024

[4] [4]

DrugSim2DR: systematic prediction of drug functional similarities in the context of specific disease for drug repurposing

Wu, J., et al. DrugSim2DR: systematic prediction of drug functional similarities in the context of specific disease for drug repurposing. GigaScience 12, giad104 (2023)

work page 2023

[5] [5]

Shao, M., Jiang, L., Meng, Z. & Xu, J. Computational drug repurposing based on a recommendation system and drug–drug functional pathway similarity. Molecules 27, 1404 (2022)

work page 2022

[6] [6]

Computational drug repurposing by exploiting large-scale gene expression data: Strategy, methods and applications

He, H., et al. Computational drug repurposing by exploiting large-scale gene expression data: Strategy, methods and applications. Computers in biology and medicine 155, 106671 (2023)

work page 2023

[7] [7]

& Akhmedov, M

Kwee, I., Martinelli, A., Khayal, L.A. & Akhmedov, M. metaLINCS: an R package for meta-level analysis of LINCS L1000 drug signatures using stratified connectivity mapping. Bioinformatics Advances 2, vbac064 (2022)

work page 2022

[8] [8]

& Pesquita, C

Nunes, S., Badreddine, S. & Pesquita, C. Rewarding explainability in drug repurposing with knowledge graphs. arXiv preprint arXiv:2509.02276 (2025)

work page arXiv 2025

[9] [9]

Biomistral: A collection of open-source pretrained large language models for medical domains

Labrak, Y ., et al. Biomistral: A collection of open-source pretrained large language models for medical domains. arXiv preprint arXiv:2402.10373 (2024). 23

work page arXiv 2024

[10] [10]

Drug repurposing for Alzheimer’s disease using a graph-of-thoughts based large language model to infer drug-disease relationships in a comprehensive knowledge graph

Wang, Z.P ., et al. Drug repurposing for Alzheimer’s disease using a graph-of-thoughts based large language model to infer drug-disease relationships in a comprehensive knowledge graph. BioData Mining 18, 51 (2025)

work page 2025

[11] [11]

DrugReX: an explainable drug repurposing system powered by large language models and literature-based knowledge graph

Huang, L.-C., et al. DrugReX: an explainable drug repurposing system powered by large language models and literature-based knowledge graph. Research Square, rs. 3. rs-6728958 (2025)

work page 2025

[12] [12]

& Nematbakhsh, M

Safaei, A.A., Saboori, P ., Ramezani, R. & Nematbakhsh, M. KGLM-QA: A Novel Approach for Knowledge Graph-Enhanced Large Language Models for Question Answering. in 2024 15th International Conference on Information and Knowledge Technology (IKT) 234-240 (IEEE, 2024)

work page 2024

[13] [13]

Drugagent: Automating ai-aided drug discovery programming through llm multi-agent collaboration

Liu, S., et al. Drugagent: Automating ai-aided drug discovery programming through llm multi- agent collaboration. arXiv preprint arXiv:2411.15692 (2024)

work page arXiv 2024

[14] [14]

& Lai, L

Zhang, F., Zhao, Y ., Zhang, W. & Lai, L. BioScientist Agent: Designing LLM-Biomedical Agents with KG-Augmented RL Reasoning Modules for Drug Repurposing and Mechanistic of Action Elucidation. bioRxiv, 2025.2008. 2008.669291 (2025)

work page arXiv 2025

[15] [15]

DrugReAlign: a multisource prompt framework for drug repurposing based on large language models

Wei, J., et al. DrugReAlign: a multisource prompt framework for drug repurposing based on large language models. BMC biology 22, 226 (2024)

work page 2024

[16] [16]

Txagent: an ai agent for therapeutic reasoning across a universe of tools.arXiv preprint arXiv:2503.10970, 2025

Gao, S., et al. TxAgent: An AI agent for therapeutic reasoning across a universe of tools. arXiv preprint arXiv:2503.10970 (2025)

work page arXiv 2025

[17] [17]

& Ricketts, C.J

Linehan, W.M. & Ricketts, C.J. The Cancer Genome Atlas of renal cell carcinoma: findings and clinical implications. Nature Reviews Urology 16, 539-552 (2019)

work page 2019

[18] [18]

& Ide, N.C

Zarin, D.A., Tse, T., Williams, R.J., Califf, R.M. & Ide, N.C. The ClinicalTrials. gov results database— update and key issues. New England Journal of Medicine 364, 852-860 (2011)

work page 2011

[19] [19]

& Wang, J

Zhang, Z., Cai, J., Zhang, Y . & Wang, J. Learning hierarchy-aware knowledge graph embeddings for link prediction. in Proceedings of the AAAI conference on artificial intelligence, Vol. 34 3065- 3072 (2020)

work page 2020

[20] [20]

& Zitnik, M

Chandak, P ., Huang, K. & Zitnik, M. Building a knowledge graph to enable precision medicine. Scientific Data 10, 67 (2023)

work page 2023

[21] [21]

Comparative toxicogenomics database’s 20th anniversary: update 2025

Davis, A.P ., et al. Comparative toxicogenomics database’s 20th anniversary: update 2025. Nucleic acids research 53, D1328-D1334 (2025)

work page 2025

[22] [22]

PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge

Wei, C.-H., et al. PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge. Nucleic Acids Research 52, W540-W546 (2024)

work page 2024

[23] [23]

SigCom LINCS: data and metadata search engine for a million gene expression signatures

Evangelista, J.E., et al. SigCom LINCS: data and metadata search engine for a million gene expression signatures. Nucleic acids research 50, W697-W709 (2022)

work page 2022

[24] [24]

A landscape of pharmacogenomic interactions in cancer

Iorio, F., et al. A landscape of pharmacogenomic interactions in cancer. Cell 166, 740-754 (2016)

work page 2016

[25] [25]

MEDICASCY: a machine learning approach for predicting small-molecule drug side effects, indications, efficacy, and modes of action

Zhou, H., et al. MEDICASCY: a machine learning approach for predicting small-molecule drug side effects, indications, efficacy, and modes of action. Molecular pharmaceutics 17, 1558-1574 (2020)

work page 2020

[26] [26]

Systematic integration of biomedical knowledge prioritizes drugs for repurposing

Himmelstein, D.S., et al. Systematic integration of biomedical knowledge prioritizes drugs for repurposing. elife 6, e26726 (2017)

work page 2017

[27] [27]

Systemic therapy for mucosal, acral, and uveal melanoma

Rapisuwon, S., et al. Systemic therapy for mucosal, acral, and uveal melanoma. in Cutaneous Melanoma 1301-1335 (Springer, 2020)

work page 2020

[28] [28]

Conserved interferon-γ signaling drives clinical response to immune checkpoint blockade therapy in melanoma

Grasso, C.S., et al. Conserved interferon-γ signaling drives clinical response to immune checkpoint blockade therapy in melanoma. Cancer cell 38, 500-515. e503 (2020)

work page 2020

[29] [29]

B cells sustain inflammation and predict response to immune checkpoint blockade in human melanoma

Griss, J., et al. B cells sustain inflammation and predict response to immune checkpoint blockade in human melanoma. Nature communications 10, 4186 (2019)

work page 2019

[30] [30]

& Hunte, C

Birth, D., Kao, W.-C. & Hunte, C. Structural analysis of atovaquone-inhibited cytochrome bc 1 complex reveals the molecular basis of antimalarial drug action. Nature communications 5, 4029 (2014). 24

work page 2014

[31] [31]

Synthetic lethality-mediated precision oncology via the tumor transcriptome

Lee, J.S., et al. Synthetic lethality-mediated precision oncology via the tumor transcriptome. Cell 184, 2487-2502. e2413 (2021)

work page 2021

[32] [32]

Tumor and microenvironment evolution during immunotherapy with nivolumab

Riaz, N., et al. Tumor and microenvironment evolution during immunotherapy with nivolumab. Cell 171, 934-949. e916 (2017)

work page 2017

[33] [33]

Genomic and transcriptomic features of response to anti-PD-1 therapy in metastatic melanoma

Hugo, W., et al. Genomic and transcriptomic features of response to anti-PD-1 therapy in metastatic melanoma. Cell 165, 35-44 (2016)

work page 2016

[34] [34]

& Hacohen, N

Rooney, M.S., Shukla, S.A., Wu, C.J., Getz, G. & Hacohen, N. Molecular and genetic properties of tumors associated with local immune cytolytic activity. Cell 160, 48-61 (2015)

work page 2015

[35] [35]

K-paths: Reasoning over graph paths for drug repurposing and drug interaction prediction

Abdullahi, T., et al. K-paths: Reasoning over graph paths for drug repurposing and drug interaction prediction. arXiv preprint arXiv:2502.13344 (2025)

work page arXiv 2025

[36] [36]

Personalized prediction of anticancer potential of non-oncology drugs through learning from genome derived molecular pathways

Dong, X., et al. Personalized prediction of anticancer potential of non-oncology drugs through learning from genome derived molecular pathways. NPJ Precision Oncology 9, 36 (2025)

work page 2025

[37] [37]

& Lee, S

Song, H., Bang, D., Koo, B., Kim, S. & Lee, S. LLM-Integrated Representative Path Selection for Context-Aware Drug Repurposing on Biomedical Knowledge Graphs. in NeurIPS 2025 2nd Workshop on Multi-modal Foundation Models and Large Language Models for Life Sciences

work page 2025

[38] [38]

TheraMind: A Multi-LLM Agent for Accelerating Drug Repurposing in Lung Cancer via Case Report Mining

More, V., et al. TheraMind: A Multi-LLM Agent for Accelerating Drug Repurposing in Lung Cancer via Case Report Mining. (2025)

work page 2025

[39] [39]

Medcpt: Contrastive pre-trained transformers with large-scale pubmed search logs for zero-shot biomedical information retrieval

Jin, Q., et al. Medcpt: Contrastive pre-trained transformers with large-scale pubmed search logs for zero-shot biomedical information retrieval. Bioinformatics 39, btad651 (2023)

work page 2023

[40] [40]

DGIdb 5.0: rebuilding the drug–gene interaction database for precision medicine and drug discovery platforms

Cannon, M., et al. DGIdb 5.0: rebuilding the drug–gene interaction database for precision medicine and drug discovery platforms. Nucleic acids research 52, D1227-D1235 (2024)

work page 2024

[41] [41]

Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells

Yang, W., et al. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic acids research 41, D955-D961 (2012)

work page 2012

[42] [42]

Enrichr: a comprehensive gene set enrichment analysis web server 2016 update

Kuleshov, M.V., et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic acids research 44, W90-W97 (2016)

work page 2016

[43] [43]

Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1

Barbie, D.A., et al. Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1. Nature 462, 108-112 (2009). 25 Supplementary materials: Fig. S1 Relevance scoring of TxGNN using TCGA survival outcomes and ClinicalTrials.gov trial statuses. (a) Correlation between TxGNN confidence scores and TCGA dataset survival relevance across d...

work page 2009

[44] [44]

Disease–Drug Evidence

work page

[45] [45]

Aggregated Disease–Gene and Drug–Gene Evidence

work page

[46] [46]

GSEA pathway evidence

work page

[47] [47]

Disease":

Case JSON for disease context [subtype_statements] Scoring Rules (apply independently): Disease–Drug evidence: - If direct disease–drug evidence is supported by clinical trial reports or FDA-approved indications → add 40 points. - If disease–drug evidence is indirect or preclinical only (e.g., cell line or animal studies) → add 20 points. Gene-level evide...

work page

[48] [48]

Output must be valid JSON. 29

work page

[49] [49]

Each score must be between 0 and 100

work page

[50] [50]

All reasoning must be grounded in: * The provided study information * Established scientific knowledge of oncology drug development

work page

[51] [51]

Do not use bullets or symbols that are not on a standard keyboard

work page

[52] [52]

Include both: a) a detailed explanation for each scoring category b) the numeric score

work page

[53] [53]

this_year-start_year

Use the evaluation features and importance levels listed below. --- EVALUATION FEATURES AND THEIR IMPORTANCE Mechanistic rationale (Importance: Highest) * How strongly the drug mechanism links to the disease biology. * Whether the target is known to be relevant. * Whether similar mechanisms have proven successful. Preclinical evidence (Importance: High) *...

work page

[54] [54]

Read the clinical trial study from INPUT:[Input]

work page

[55] [55]

Produce the JSON described above

work page

[56] [56]

The JSON must be comprehensive and self-contained

work page

[57] [57]

Avoid any special characters not available on a standard keyboard

work page

[58] [58]

--- INPUT: [Input] Target_Disease: [Disease] Target_Drug: [Drug] Fig

When assigning overall_confidence, the model must treat the score of result_status as a major determinant. --- INPUT: [Input] Target_Disease: [Disease] Target_Drug: [Drug] Fig. S4. Scoring prompt for automated ClinicalTrials.gov–based relevance evaluation

work page