OOD-GraphLLM: Graph Large Language Model for Out-of-Distribution Generalized Drug Synergy Prediction

Linxin Xiao; Wenwu Zhu; Xin Wang; Yang Yao

arxiv: 2605.30247 · v1 · pith:SQHNO3RUnew · submitted 2026-05-28 · 💻 cs.LG · cs.MM

OOD-GraphLLM: Graph Large Language Model for Out-of-Distribution Generalized Drug Synergy Prediction

Xin Wang , Linxin Xiao , Yang Yao , Wenwu Zhu This is my paper

Pith reviewed 2026-06-29 08:34 UTC · model grok-4.3

classification 💻 cs.LG cs.MM

keywords drug synergy predictionout-of-distribution generalizationgraph large language modelmolecular graph representationbiomedical language modelretrieval-augmented tuningdrug combination

0 comments

The pith

A graph large language model predicts drug synergy accurately for molecules with unseen structures by unifying graph and language representations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper seeks to establish that drug synergy can be predicted reliably even when new compounds introduce molecular structures absent from training data. Standard methods assume training and test drugs share similar distributions, but novel compounds vary in scaffolds and sizes, breaking this assumption. The work proposes a unified model that learns both the topological structure of molecules through graphs and their semantic properties through language, using retrieval to connect the two during tuning. A reader would care because this could allow faster identification of effective drug pairs without collecting exhaustive new data for every emerging compound.

Core claim

OOD-GraphLLM achieves out-of-distribution generalized drug synergy prediction by jointly optimizing molecular graph representations and biomedical semantic language representations in a unified framework, achieved through finetuning DrugSyn-LLM and applying retrieval-augmented biomedical instruction tuning to align topological and semantic molecular information for language-based reasoning.

What carries the argument

The OOD-GraphLLM framework, which integrates graph neural architectures for molecular topology with retrieval-augmented instruction tuning on a biomedical large language model to align structural and semantic information.

If this is right

The model identifies structurally relevant and irrelevant molecular features relative to specific cell targets under OOD conditions.
Optimal graph neural network designs emerge for calculating molecular representations that support OOD generalization.
Joint use of structural graphs and language semantics enables language-based reasoning about drug combinations not seen in training.
The released model supports interactive predictions for new drug pairs via a web interface.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same alignment of graph topology with language semantics might transfer to other molecular prediction tasks that face scaffold shifts, such as toxicity forecasting.
If the alignment holds, it could lower the volume of labeled experimental data required to train reliable models for emerging compounds.
The approach points toward testing whether similar retrieval mechanisms improve generalization when cell contexts themselves vary beyond the training distribution.

Load-bearing premise

Retrieval-augmented instruction tuning aligns molecular topological information with semantic information well enough to overcome distribution shifts in drug synergy data.

What would settle it

Measure prediction accuracy on a test set of drug pairs whose molecular scaffolds and sizes differ markedly from the training distribution and compare against ground-truth synergy labels obtained from cell assays.

Figures

Figures reproduced from arXiv: 2605.30247 by Linxin Xiao, Wenwu Zhu, Xin Wang, Yang Yao.

**Figure 2.** Figure 2: The overall framework of OOD-GraphLLM . OOD-GraphLLM is able to conduct accurate O.O.D. generalized DSP by [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Chemical space visualization. Dataset. We derive all drug combination data from DrugComb [46], which contains totally 1,432,351 unique <drug, drug, cell line> triplets. Each triplet is annotated with synergy measurements under four scoring schemes, namely Loewe, Bliss, HSA, and ZIP. Detailed definitions and computation rules for these synergy scores are provided in Appendix A. Drug-related information and … view at source ↗

**Figure 4.** Figure 4: Ablation studies of OOD-GraphLLM. 4.3 Ablation Study To assess the impact of individual components within our OODGraphLLM, we perform an ablation study on HSA score dataset under both splits. Model performance is evaluated using accuracy for classification and RMSE for regression. We design multiple model variants, where each variant excludes a particular component: • w/o R-IT: We remove the retrieval-au… view at source ↗

**Figure 6.** Figure 6: A case study on 5-Fluorouracil and Vorinostat. [PITH_FULL_IMAGE:figures/full_fig_p009_6.png] view at source ↗

**Figure 5.** Figure 5: Hyperparameter sensitivity analysis for 𝛼, 𝛽, different prompts and prototypes per layer. As shown in the figure, the model achieves the best overall performance when 𝛼 = 5e−3, while increasing or decreasing 𝛼 leads to noticeable performance degradation across different tasks. In contrast, increasing 𝛽 beyond this range has a relatively minor effect on both accuracy and RMSE, whereas reducing 𝛽 results in… view at source ↗

**Figure 7.** Figure 7: Detailed input prompt design for DrugSyn-LLM. [PITH_FULL_IMAGE:figures/full_fig_p011_7.png] view at source ↗

**Figure 8.** Figure 8: Structural-group preferences of diverse message [PITH_FULL_IMAGE:figures/full_fig_p012_8.png] view at source ↗

read the original abstract

Drug synergy prediction (DSP) aims to identify efficacious drug combinations under various cellular contexts with different targets. However, the continual emergence of novel compounds results in variations in molecular scaffolds and sizes, causing drug synergy data to exhibit out-of-distribution (O.O.D.) shifts with respect to topological structure. Existing works rely on in-distribution (I.D.) assumption, failing to handle the O.O.D. shifts. To solve this problem, we study out-of-distribution generalized drug synergy prediction through a graph large language model for the first time. Nevertheless, O.O.D. generalized DSP is highly non-trivial, posing several challenges: i) how to discover structurally relevant and irrelevant molecular representations with respect to cell targets; ii) how to find the optimal graph neural architectures that accurately calculate molecular representations; and iii) how to jointly leverage molecular structural and semantic information in LLMs. To address these challenges, we propose OOD-GraphLLM, a novel graphLLM framework which is able to accurately predict drug synergy under O.O.D. settings via jointly optimizing molecular graph representation and biomedical semantic language representations in a unified manner. Furthermore, we finetune DrugSyn-LLM, a biomedical LLM, and employ a retrieval-augmented biomedical instruction tuning strategy to align molecular topological information and molecular semantic information with language-based reasoning for O.O.D. generalized DSP. Both the source code (https://github.com/EkkoXiao/Bio-GraphLLM) and released model (https://mn.cs.tsinghua.edu.cn/bio-graphllm/) are publicly available, where users are allowed to download model resources and interactively use the system through a web interface.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper introduces the first graph LLM for OOD drug synergy prediction but the abstract offers no results to evaluate the claims.

read the letter

The one thing to know is that this is the first paper to tackle out-of-distribution drug synergy prediction with a graph LLM, using a joint optimization of molecular graphs and semantic language reps via retrieval-augmented tuning.

What the paper does is identify a real issue: new compounds change scaffolds and sizes, breaking the usual in-distribution assumption in DSP. They lay out three specific challenges and propose OOD-GraphLLM to address them by finetuning a biomedical LLM and aligning topo and semantic info. Making the code and model available with a web demo is helpful for anyone who wants to experiment.

The approach sounds sensible for combining GNNs with LLMs in this domain.

Where it falls short right now is the lack of any supporting evidence. The abstract states the model can accurately predict under OOD but gives no metrics, no ablation results on the tuning strategy, and no description of the retrieval corpus or how alignment is verified. That makes the central mechanism an unproven claim at this stage. If the full paper has experiments showing gains on OOD splits from novel scaffolds, that would change things.

This paper is for specialists in AI for pharma and graph-based molecular modeling. Someone already working on robustness in drug combination prediction would get the most from the framework details.

It deserves a serious referee. The idea is new enough and the application practical enough that reviewers should see the full experiments and judge if the joint optimization delivers on OOD generalization.

Referee Report

2 major / 1 minor

Summary. The paper introduces OOD-GraphLLM, the first graph large language model framework for out-of-distribution generalized drug synergy prediction (DSP). It addresses OOD shifts arising from novel molecular scaffolds and sizes by jointly optimizing molecular graph representations and biomedical semantic language representations in a unified manner, finetuning DrugSyn-LLM and applying a retrieval-augmented biomedical instruction tuning strategy to align topological and semantic information for OOD generalization.

Significance. If the OOD generalization results hold with proper validation, the work could be significant for advancing DSP under distribution shifts, as the first application of graphLLMs to this problem; the public release of source code and the interactive model at the provided GitHub and web links is a clear strength for reproducibility and community use.

major comments (2)

[Experiments] The central claim that the retrieval-augmented biomedical instruction tuning successfully aligns molecular topological (graph) representations with semantic (language) information to handle OOD shifts is not supported by isolated experimental validation. No ablation isolates the tuning strategy's contribution to OOD metrics versus standard fine-tuning or graph-only baselines (see § on experiments and the description of the tuning strategy).
[Method] The method section provides no quantitative measure or metric for how alignment between molecular topological information and molecular semantic information is achieved or verified during joint optimization, leaving the mechanism for OOD handling as an untested assumption rather than a demonstrated result.

minor comments (1)

[Abstract] The abstract states the model 'is able to accurately predict' under OOD settings but contains no experimental results, error metrics, or baseline comparisons to ground this claim.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback and positive assessment of the work's potential significance and reproducibility. We address the major comments point by point below, agreeing that additional validation would strengthen the claims.

read point-by-point responses

Referee: [Experiments] The central claim that the retrieval-augmented biomedical instruction tuning successfully aligns molecular topological (graph) representations with semantic (language) information to handle OOD shifts is not supported by isolated experimental validation. No ablation isolates the tuning strategy's contribution to OOD metrics versus standard fine-tuning or graph-only baselines (see § on experiments and the description of the tuning strategy).

Authors: We agree that the current experiments do not include an isolated ablation specifically quantifying the retrieval-augmented biomedical instruction tuning's contribution to OOD performance relative to standard fine-tuning or graph-only baselines. In the revised manuscript we will add these ablations, reporting OOD metrics (e.g., synergy prediction accuracy under scaffold and size shifts) for the full model versus the indicated variants to directly support the alignment claim. revision: yes
Referee: [Method] The method section provides no quantitative measure or metric for how alignment between molecular topological information and molecular semantic information is achieved or verified during joint optimization, leaving the mechanism for OOD handling as an untested assumption rather than a demonstrated result.

Authors: We acknowledge that the method section currently lacks an explicit quantitative metric (such as embedding similarity or alignment loss) to verify the degree of alignment between topological graph representations and semantic language representations. In revision we will introduce and report such a metric, computed before and after the joint optimization and retrieval-augmented tuning steps, to demonstrate the alignment mechanism. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation is self-contained architectural proposal

full rationale

The paper proposes OOD-GraphLLM as a graphLLM framework for OOD drug synergy prediction via joint optimization of molecular graph and biomedical semantic representations, plus retrieval-augmented instruction tuning of DrugSyn-LLM. The provided text (abstract and description) contains no equations, parameter-fitting procedures, uniqueness theorems, or derivation chains that could reduce a claimed prediction or result to its own inputs by construction. Claims rest on model architecture and (unshown) empirical results rather than self-definitional mappings, fitted inputs renamed as predictions, or load-bearing self-citations. This is the normal case of a self-contained empirical ML proposal with no inspectable circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no information on free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5841 in / 968 out tokens · 25996 ms · 2026-06-29T08:34:31.925372+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

47 extracted references · 11 canonical work pages · 2 internal anchors

[1]

Iz Beltagy, Kyle Lo, and Arman Cohan. 2019. SciBERT: A pretrained language model for scientific text.arXiv preprint arXiv:1903.10676(2019). Conference’17, July 2017, Washington, DC, USA Xin Wang et al

work page arXiv 2019
[2]

Ziwei Chai, Tianjie Zhang, Liang Wu, Kaiqiang Han, Xiaohai Hu, Xuanwen Huang, and Yang Yang. 2025. Graphllm: Boosting graph reasoning ability of large language model.IEEE Transactions on Big Data(2025)

2025
[3]

UniProt Consortium. 2019. UniProt: a worldwide hub of protein knowledge. Nucleic acids research47, D1 (2019), D506–D515

2019
[4]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers). 4171–4186

2019
[5]

Yunyun Dong, Yunqing Chang, Yuxiang Wang, Qixuan Han, Xiaoyuan Wen, Ziting Yang, Yan Zhang, Yan Qiang, Kun Wu, Xiaole Fan, et al. 2024. MFSynDCP: multi-source feature collaborative interactive learning for drug combination synergy prediction.BMC bioinformatics25, 1 (2024), 140

2024
[6]

Mohamed Reda El Khili, Safyan Aman Memon, and Amin Emad. 2023. MARSY: a multitask deep-learning framework for prediction of drug combination synergy scores.Bioinformatics39, 4 (2023), btad177

2023
[7]

Junfeng Fang, Shuai Zhang, Chang Wu, Zhengyi Yang, Zhiyuan Liu, Sihang Li, Kun Wang, Wenjie Du, and Xiang Wang. 2024. Moltc: Towards molecular relational modeling in language models.arXiv preprint arXiv:2402.03781(2024)

work page arXiv 2024
[8]

Yue Guo, Haitao Hu, Wenbo Chen, Hao Yin, Jian Wu, Chang-Yu Hsieh, Qiaojun He, and Ji Cao. 2024. SynergyX: a multi-modality mutual attention network for interpretable drug synergy prediction.Briefings in Bioinformatics25, 2 (2024), bbae015

2024
[9]

Betül Güvenç Paltun, Samuel Kaski, and Hiroshi Mamitsuka. 2021. Machine learning approaches for drug combination therapies.Briefings in Bioinfor- matics22, 6 (08 2021), bbab293. arXiv:https://academic.oup.com/bib/article- pdf/22/6/bbab293/41088416/bbab293.pdf doi:10.1093/bib/bbab293

work page doi:10.1093/bib/bbab293 2021
[10]

Edward J Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen, et al. 2022. Lora: Low-rank adaptation of large language models.ICLR1, 2 (2022), 3

2022
[11]

Jing Hu, Jie Gao, Xiaomin Fang, Zijing Liu, Fan Wang, Weili Huang, Hua Wu, and Guodong Zhao. 2022. DTSyn: a dual-transformer-based neural network to predict synergistic drug combinations.Briefings in Bioinformatics23, 5 (2022), bbac302

2022
[12]

Chao Huang, Xubin Ren, Jiabin Tang, Dawei Yin, and Nitesh Chawla. 2024. Large language models for graphs: Progresses and directions. InCompanion Proceedings of the ACM Web Conference 2024. 1284–1287

2024
[13]

Aleksandr Ianevski, Anil K Giri, and Tero Aittokallio. 2022. SynergyFinder 3.0: an interactive analysis and consensus interpretation of multi-drug synergies across multiple samples.Nucleic acids research50, W1 (2022), W739–W743

2022
[14]

Francesco Iorio, Theo A Knijnenburg, Daniel J Vis, Graham R Bignell, Michael P Menden, Michael Schubert, Nanne Aben, Emanuel Gonçalves, Syd Barthorpe, Howard Lightfoot, et al. 2016. A landscape of pharmacogenomic interactions in cancer.Cell166, 3 (2016), 740–754

2016
[15]

Joseph D Janizek, Safiye Celik, and Su-In Lee. 2018. Explainable machine learn- ing prediction of synergistic drug combinations for precision cancer medicine. BioRxiv(2018), 331769

2018
[16]

Yuanfeng Ji, Lu Zhang, Jiaxiang Wu, Bingzhe Wu, Lanqing Li, Long-Kai Huang, Tingyang Xu, Yu Rong, Jie Ren, Ding Xue, et al . 2023. Drugood: Out-of- distribution dataset curator and benchmark for ai-aided drug discovery–a focus on affinity prediction problems with noise annotations. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. ...

2023
[17]

Bowen Jin, Gang Liu, Chi Han, Meng Jiang, Heng Ji, and Jiawei Han. 2024. Large language models on graphs: A comprehensive survey.IEEE Transactions on Knowledge and Data Engineering(2024)

2024
[18]

Bowen Jin, Wentao Zhang, Yu Zhang, Yu Meng, Xinyang Zhang, Qi Zhu, and Jiawei Han. 2023. Patton: Language model pretraining on text-rich networks. arXiv preprint arXiv:2305.12268(2023)

work page arXiv 2023
[19]

Craig Knox, Mike Wilson, Christen M Klinger, Mark Franklin, Eponine Oler, Alex Wilson, Allison Pon, Jordan Cox, Na Eun Chin, Seth A Strawbridge, et al. 2024. DrugBank 6.0: the DrugBank knowledgebase for 2024.Nucleic acids research52, D1 (2024), D1265–D1275

2024
[20]

Halil Ibrahim Kuru, Oznur Tastan, and A Ercument Cicek. 2021. MatchMaker: a deep learning framework for drug synergy prediction.IEEE/ACM transactions on computational biology and bioinformatics19, 4 (2021), 2334–2344

2021
[21]

Greg Landrum. 2013. Rdkit documentation.Release1, 1-79 (2013), 4

2013
[22]

Huijun Li, Lin Zou, Jamal AH Kowah, Dongqiong He, Lisheng Wang, Mingqing Yuan, and Xu Liu. 2023. Predicting drug synergy and discovering new drug combinations based on a graph autoencoder and convolutional neural network. Interdisciplinary Sciences: Computational Life Sciences15, 2 (2023), 316–330

2023
[23]

Junnan Li, Dongxu Li, Silvio Savarese, and Steven Hoi. 2023. Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models. InInternational conference on machine learning. PMLR, 19730–19742

2023
[24]

Lei Li, Hongyu Zhang, Chunhou Zheng, and Yansen Su. 2025. A review of deep learning approaches for drug synergy prediction in cancer.npj Drug Discovery2, 1 (Dec. 2025), 30. doi:10.1038/s44386-025-00034-1

work page doi:10.1038/s44386-025-00034-1 2025
[25]

Tianhao Li, Sandesh Shetty, Advaith Kamath, Ajay Jaiswal, Xiaoqian Jiang, Ying Ding, and Yejin Kim. 2024. CancerGPT for few shot drug pair synergy prediction using large pretrained language models.NPJ Digital Medicine7, 1 (2024), 40

2024
[26]

Xueliang Li, Bihan Shen, Fangyoumin Feng, Kunshi Li, Zhixuan Tang, Liangxiao Ma, and Hong Li. 2024. Dual-view jointly learning improves personalized drug synergy prediction.Bioinformatics40, 10 (2024), btae604

2024
[27]

Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, and Alexander Rives. 2023. Evolutionary-scale prediction of atomic-level pro- tein structure with a language model.Science379, 6637 (2023), 1123–

2023
[28]

1126/science.ade2574

arXiv:https://www.science.org/doi/pdf/10.1126/science.ade2574 doi:10. 1126/science.ade2574

work page doi:10.1126/science.ade2574
[29]

Qiao Liu and Lei Xie. 2021. TranSynergy: Mechanism-driven interpretable deep neural network for the synergistic prediction and pathway deconvolution of drug combinations.PLoS computational biology17, 2 (2021), e1008653

2021
[30]

Tianyu Liu, Tinyi Chu, Xiao Luo, and Hongyu Zhao. 2025. Building a unified model for drug synergy analysis powered by large language models.Nature Communications16, 1 (2025), 4537

2025
[31]

Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101(2017)

work page internal anchor Pith review Pith/arXiv arXiv 2017
[32]

Kristina Preuer, Richard PI Lewis, Sepp Hochreiter, Andreas Bender, Krishna C Bulusu, and Günter Klambauer. 2018. DeepSynergy: predicting anti-cancer drug synergy with deep learning.Bioinformatics34, 9 (2018), 1538–1546

2018
[33]

Kyriakos Schwarz, Alicia Pliego-Mendieta, Amina Mollaysa, Lara Planas-Paz, Chantal Pauli, Ahmed Allam, and Michael Krauthammer. 2022. Ddos: a graph neural network based drug synergy prediction algorithm.arXiv preprint arXiv:2210.00802(2022)

work page arXiv 2022
[34]

Aravind Subramanian, Rajiv Narayan, Steven M Corsello, David D Peck, Ted E Natoli, Xiaodong Lu, Joshua Gould, John F Davis, Andrew A Tubelli, Jacob K Asiedu, et al. 2017. A next generation connectivity map: L1000 platform and the first 1,000,000 profiles.Cell171, 6 (2017), 1437–1452

2017
[35]

Jiabin Tang, Yuhao Yang, Wei Wei, Lei Shi, Lixin Su, Suqi Cheng, Dawei Yin, and Chao Huang. 2024. Graphgpt: Graph instruction tuning for large language models. InProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval. 491–500

2024
[36]

Ross Taylor, Marcin Kardas, Guillem Cucurull, Thomas Scialom, Anthony Hartshorn, Elvis Saravia, Andrew Poulton, Viktor Kerkez, and Robert Sto- jnic. 2022. Galactica: A large language model for science.arXiv preprint arXiv:2211.09085(2022)

work page internal anchor Pith review Pith/arXiv arXiv 2022
[37]

Heng Wang, Shangbin Feng, Tianxing He, Zhaoxuan Tan, Xiaochuang Han, and Yulia Tsvetkov. 2023. Can language models solve graph problems in natural language?Advances in Neural Information Processing Systems36 (2023), 30840– 30861

2023
[38]

Jinxian Wang, Xuejun Liu, Siyuan Shen, Lei Deng, and Hui Liu. 2022. DeepDDS: deep graph neural network with attention mechanism to predict synergistic drug combinations.Briefings in Bioinformatics23, 1 (2022)

2022
[39]

Tianshuo Wang, Ruheng Wang, and Leyi Wei. 2023. AttenSyn: an attention- based deep graph neural network for anticancer synergistic drug combination prediction.Journal of Chemical Information and Modeling64, 7 (2023), 2854–2862

2023
[40]

Xin Wang, Zeyang Zhang, Linxin Xiao, Haibo Chen, Chendi Ge, and Wenwu Zhu. 2025. Towards Multi-modal Graph Large Language Model.arXiv preprint arXiv:2506.09738(2025)

work page arXiv 2025
[41]

Linxin Xiao, Xin Wang, Zeyang Zhang, Yang Yao, and Wenwu Zhu. 2025. DyNAS- DDI: Dynamic Pairwise Architecture Search for Generalizable Drug-Drug Interac- tion LLM. InProceedings of the 33rd ACM International Conference on Multimedia. 2216–2225

2025
[42]

Mengdie Xu, Xinwei Zhao, Jingyu Wang, Wei Feng, Naifeng Wen, Chunyu Wang, Junjie Wang, Yun Liu, and Lingling Zhao. 2023. DFFNDDS: prediction of syner- gistic drug combinations with dual feature fusion networks.Journal of Chemin- formatics15, 1 (2023), 33

2023
[43]

Wanjuan Yang, Jorge Soares, Patricia Greninger, Elena J Edelman, Howard Light- foot, Simon Forbes, Nidhi Bindal, Dave Beare, James A Smith, I Richard Thompson, et al. 2012. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for thera- peutic biomarker discovery in cancer cells.Nucleic acids research41, D1 (2012), D955–D961

2012
[44]

Ruosong Ye, Caiqi Zhang, Runhui Wang, Shuyuan Xu, and Yongfeng Zhang. 2024. Language is all a graph needs. InFindings of the association for computational linguistics: EACL 2024. 1955–1973

2024
[45]

Barbara Zdrazil, Eloy Felix, Fiona Hunter, Emma J Manners, James Blackshaw, Sybilla Corbett, Marleen De Veij, Harris Ioannidis, David Mendez Lopez, Juan F Mosquera, et al. 2024. The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods.Nucleic acids research 52, D1 (2024), D1180–D1192

2024
[46]

Jianan Zhao, Meng Qu, Chaozhuo Li, Hao Yan, Qian Liu, Rui Li, Xing Xie, and Jian Tang. 2022. Learning on large-scale text-attributed graphs via variational inference.arXiv preprint arXiv:2210.14709(2022). OOD-GraphLLM: Graph Large Language Model for Out-of-Distribution Generalized Drug Synergy Prediction Conference’17, July 2017, Washington, DC, USA

work page arXiv 2022
[47]

Shuyu Zheng, Jehad Aldahdooh, Tolou Shadbahr, Yinyin Wang, Dalal Aldah- dooh, Jie Bao, Wenyu Wang, and Jing Tang. 2021. DrugComb update: a more comprehensive drug sensitivity data repository and analysis portal.Nucleic acids research49, W1 (2021), W174–W184. A Experiment Details A.1 Operations To enable flexible architecture search over molecular graph en...

2021

[1] [1]

Iz Beltagy, Kyle Lo, and Arman Cohan. 2019. SciBERT: A pretrained language model for scientific text.arXiv preprint arXiv:1903.10676(2019). Conference’17, July 2017, Washington, DC, USA Xin Wang et al

work page arXiv 2019

[2] [2]

Ziwei Chai, Tianjie Zhang, Liang Wu, Kaiqiang Han, Xiaohai Hu, Xuanwen Huang, and Yang Yang. 2025. Graphllm: Boosting graph reasoning ability of large language model.IEEE Transactions on Big Data(2025)

2025

[3] [3]

UniProt Consortium. 2019. UniProt: a worldwide hub of protein knowledge. Nucleic acids research47, D1 (2019), D506–D515

2019

[4] [4]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers). 4171–4186

2019

[5] [5]

Yunyun Dong, Yunqing Chang, Yuxiang Wang, Qixuan Han, Xiaoyuan Wen, Ziting Yang, Yan Zhang, Yan Qiang, Kun Wu, Xiaole Fan, et al. 2024. MFSynDCP: multi-source feature collaborative interactive learning for drug combination synergy prediction.BMC bioinformatics25, 1 (2024), 140

2024

[6] [6]

Mohamed Reda El Khili, Safyan Aman Memon, and Amin Emad. 2023. MARSY: a multitask deep-learning framework for prediction of drug combination synergy scores.Bioinformatics39, 4 (2023), btad177

2023

[7] [7]

Junfeng Fang, Shuai Zhang, Chang Wu, Zhengyi Yang, Zhiyuan Liu, Sihang Li, Kun Wang, Wenjie Du, and Xiang Wang. 2024. Moltc: Towards molecular relational modeling in language models.arXiv preprint arXiv:2402.03781(2024)

work page arXiv 2024

[8] [8]

Yue Guo, Haitao Hu, Wenbo Chen, Hao Yin, Jian Wu, Chang-Yu Hsieh, Qiaojun He, and Ji Cao. 2024. SynergyX: a multi-modality mutual attention network for interpretable drug synergy prediction.Briefings in Bioinformatics25, 2 (2024), bbae015

2024

[9] [9]

Betül Güvenç Paltun, Samuel Kaski, and Hiroshi Mamitsuka. 2021. Machine learning approaches for drug combination therapies.Briefings in Bioinfor- matics22, 6 (08 2021), bbab293. arXiv:https://academic.oup.com/bib/article- pdf/22/6/bbab293/41088416/bbab293.pdf doi:10.1093/bib/bbab293

work page doi:10.1093/bib/bbab293 2021

[10] [10]

Edward J Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen, et al. 2022. Lora: Low-rank adaptation of large language models.ICLR1, 2 (2022), 3

2022

[11] [11]

Jing Hu, Jie Gao, Xiaomin Fang, Zijing Liu, Fan Wang, Weili Huang, Hua Wu, and Guodong Zhao. 2022. DTSyn: a dual-transformer-based neural network to predict synergistic drug combinations.Briefings in Bioinformatics23, 5 (2022), bbac302

2022

[12] [12]

Chao Huang, Xubin Ren, Jiabin Tang, Dawei Yin, and Nitesh Chawla. 2024. Large language models for graphs: Progresses and directions. InCompanion Proceedings of the ACM Web Conference 2024. 1284–1287

2024

[13] [13]

Aleksandr Ianevski, Anil K Giri, and Tero Aittokallio. 2022. SynergyFinder 3.0: an interactive analysis and consensus interpretation of multi-drug synergies across multiple samples.Nucleic acids research50, W1 (2022), W739–W743

2022

[14] [14]

Francesco Iorio, Theo A Knijnenburg, Daniel J Vis, Graham R Bignell, Michael P Menden, Michael Schubert, Nanne Aben, Emanuel Gonçalves, Syd Barthorpe, Howard Lightfoot, et al. 2016. A landscape of pharmacogenomic interactions in cancer.Cell166, 3 (2016), 740–754

2016

[15] [15]

Joseph D Janizek, Safiye Celik, and Su-In Lee. 2018. Explainable machine learn- ing prediction of synergistic drug combinations for precision cancer medicine. BioRxiv(2018), 331769

2018

[16] [16]

Yuanfeng Ji, Lu Zhang, Jiaxiang Wu, Bingzhe Wu, Lanqing Li, Long-Kai Huang, Tingyang Xu, Yu Rong, Jie Ren, Ding Xue, et al . 2023. Drugood: Out-of- distribution dataset curator and benchmark for ai-aided drug discovery–a focus on affinity prediction problems with noise annotations. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. ...

2023

[17] [17]

Bowen Jin, Gang Liu, Chi Han, Meng Jiang, Heng Ji, and Jiawei Han. 2024. Large language models on graphs: A comprehensive survey.IEEE Transactions on Knowledge and Data Engineering(2024)

2024

[18] [18]

Bowen Jin, Wentao Zhang, Yu Zhang, Yu Meng, Xinyang Zhang, Qi Zhu, and Jiawei Han. 2023. Patton: Language model pretraining on text-rich networks. arXiv preprint arXiv:2305.12268(2023)

work page arXiv 2023

[19] [19]

Craig Knox, Mike Wilson, Christen M Klinger, Mark Franklin, Eponine Oler, Alex Wilson, Allison Pon, Jordan Cox, Na Eun Chin, Seth A Strawbridge, et al. 2024. DrugBank 6.0: the DrugBank knowledgebase for 2024.Nucleic acids research52, D1 (2024), D1265–D1275

2024

[20] [20]

Halil Ibrahim Kuru, Oznur Tastan, and A Ercument Cicek. 2021. MatchMaker: a deep learning framework for drug synergy prediction.IEEE/ACM transactions on computational biology and bioinformatics19, 4 (2021), 2334–2344

2021

[21] [21]

Greg Landrum. 2013. Rdkit documentation.Release1, 1-79 (2013), 4

2013

[22] [22]

Huijun Li, Lin Zou, Jamal AH Kowah, Dongqiong He, Lisheng Wang, Mingqing Yuan, and Xu Liu. 2023. Predicting drug synergy and discovering new drug combinations based on a graph autoencoder and convolutional neural network. Interdisciplinary Sciences: Computational Life Sciences15, 2 (2023), 316–330

2023

[23] [23]

Junnan Li, Dongxu Li, Silvio Savarese, and Steven Hoi. 2023. Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models. InInternational conference on machine learning. PMLR, 19730–19742

2023

[24] [24]

Lei Li, Hongyu Zhang, Chunhou Zheng, and Yansen Su. 2025. A review of deep learning approaches for drug synergy prediction in cancer.npj Drug Discovery2, 1 (Dec. 2025), 30. doi:10.1038/s44386-025-00034-1

work page doi:10.1038/s44386-025-00034-1 2025

[25] [25]

Tianhao Li, Sandesh Shetty, Advaith Kamath, Ajay Jaiswal, Xiaoqian Jiang, Ying Ding, and Yejin Kim. 2024. CancerGPT for few shot drug pair synergy prediction using large pretrained language models.NPJ Digital Medicine7, 1 (2024), 40

2024

[26] [26]

Xueliang Li, Bihan Shen, Fangyoumin Feng, Kunshi Li, Zhixuan Tang, Liangxiao Ma, and Hong Li. 2024. Dual-view jointly learning improves personalized drug synergy prediction.Bioinformatics40, 10 (2024), btae604

2024

[27] [27]

Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, and Alexander Rives. 2023. Evolutionary-scale prediction of atomic-level pro- tein structure with a language model.Science379, 6637 (2023), 1123–

2023

[28] [28]

1126/science.ade2574

arXiv:https://www.science.org/doi/pdf/10.1126/science.ade2574 doi:10. 1126/science.ade2574

work page doi:10.1126/science.ade2574

[29] [29]

Qiao Liu and Lei Xie. 2021. TranSynergy: Mechanism-driven interpretable deep neural network for the synergistic prediction and pathway deconvolution of drug combinations.PLoS computational biology17, 2 (2021), e1008653

2021

[30] [30]

Tianyu Liu, Tinyi Chu, Xiao Luo, and Hongyu Zhao. 2025. Building a unified model for drug synergy analysis powered by large language models.Nature Communications16, 1 (2025), 4537

2025

[31] [31]

Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101(2017)

work page internal anchor Pith review Pith/arXiv arXiv 2017

[32] [32]

Kristina Preuer, Richard PI Lewis, Sepp Hochreiter, Andreas Bender, Krishna C Bulusu, and Günter Klambauer. 2018. DeepSynergy: predicting anti-cancer drug synergy with deep learning.Bioinformatics34, 9 (2018), 1538–1546

2018

[33] [33]

Kyriakos Schwarz, Alicia Pliego-Mendieta, Amina Mollaysa, Lara Planas-Paz, Chantal Pauli, Ahmed Allam, and Michael Krauthammer. 2022. Ddos: a graph neural network based drug synergy prediction algorithm.arXiv preprint arXiv:2210.00802(2022)

work page arXiv 2022

[34] [34]

Aravind Subramanian, Rajiv Narayan, Steven M Corsello, David D Peck, Ted E Natoli, Xiaodong Lu, Joshua Gould, John F Davis, Andrew A Tubelli, Jacob K Asiedu, et al. 2017. A next generation connectivity map: L1000 platform and the first 1,000,000 profiles.Cell171, 6 (2017), 1437–1452

2017

[35] [35]

Jiabin Tang, Yuhao Yang, Wei Wei, Lei Shi, Lixin Su, Suqi Cheng, Dawei Yin, and Chao Huang. 2024. Graphgpt: Graph instruction tuning for large language models. InProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval. 491–500

2024

[36] [36]

Ross Taylor, Marcin Kardas, Guillem Cucurull, Thomas Scialom, Anthony Hartshorn, Elvis Saravia, Andrew Poulton, Viktor Kerkez, and Robert Sto- jnic. 2022. Galactica: A large language model for science.arXiv preprint arXiv:2211.09085(2022)

work page internal anchor Pith review Pith/arXiv arXiv 2022

[37] [37]

Heng Wang, Shangbin Feng, Tianxing He, Zhaoxuan Tan, Xiaochuang Han, and Yulia Tsvetkov. 2023. Can language models solve graph problems in natural language?Advances in Neural Information Processing Systems36 (2023), 30840– 30861

2023

[38] [38]

Jinxian Wang, Xuejun Liu, Siyuan Shen, Lei Deng, and Hui Liu. 2022. DeepDDS: deep graph neural network with attention mechanism to predict synergistic drug combinations.Briefings in Bioinformatics23, 1 (2022)

2022

[39] [39]

Tianshuo Wang, Ruheng Wang, and Leyi Wei. 2023. AttenSyn: an attention- based deep graph neural network for anticancer synergistic drug combination prediction.Journal of Chemical Information and Modeling64, 7 (2023), 2854–2862

2023

[40] [40]

Xin Wang, Zeyang Zhang, Linxin Xiao, Haibo Chen, Chendi Ge, and Wenwu Zhu. 2025. Towards Multi-modal Graph Large Language Model.arXiv preprint arXiv:2506.09738(2025)

work page arXiv 2025

[41] [41]

Linxin Xiao, Xin Wang, Zeyang Zhang, Yang Yao, and Wenwu Zhu. 2025. DyNAS- DDI: Dynamic Pairwise Architecture Search for Generalizable Drug-Drug Interac- tion LLM. InProceedings of the 33rd ACM International Conference on Multimedia. 2216–2225

2025

[42] [42]

Mengdie Xu, Xinwei Zhao, Jingyu Wang, Wei Feng, Naifeng Wen, Chunyu Wang, Junjie Wang, Yun Liu, and Lingling Zhao. 2023. DFFNDDS: prediction of syner- gistic drug combinations with dual feature fusion networks.Journal of Chemin- formatics15, 1 (2023), 33

2023

[43] [43]

Wanjuan Yang, Jorge Soares, Patricia Greninger, Elena J Edelman, Howard Light- foot, Simon Forbes, Nidhi Bindal, Dave Beare, James A Smith, I Richard Thompson, et al. 2012. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for thera- peutic biomarker discovery in cancer cells.Nucleic acids research41, D1 (2012), D955–D961

2012

[44] [44]

Ruosong Ye, Caiqi Zhang, Runhui Wang, Shuyuan Xu, and Yongfeng Zhang. 2024. Language is all a graph needs. InFindings of the association for computational linguistics: EACL 2024. 1955–1973

2024

[45] [45]

Barbara Zdrazil, Eloy Felix, Fiona Hunter, Emma J Manners, James Blackshaw, Sybilla Corbett, Marleen De Veij, Harris Ioannidis, David Mendez Lopez, Juan F Mosquera, et al. 2024. The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods.Nucleic acids research 52, D1 (2024), D1180–D1192

2024

[46] [46]

Jianan Zhao, Meng Qu, Chaozhuo Li, Hao Yan, Qian Liu, Rui Li, Xing Xie, and Jian Tang. 2022. Learning on large-scale text-attributed graphs via variational inference.arXiv preprint arXiv:2210.14709(2022). OOD-GraphLLM: Graph Large Language Model for Out-of-Distribution Generalized Drug Synergy Prediction Conference’17, July 2017, Washington, DC, USA

work page arXiv 2022

[47] [47]

Shuyu Zheng, Jehad Aldahdooh, Tolou Shadbahr, Yinyin Wang, Dalal Aldah- dooh, Jie Bao, Wenyu Wang, and Jing Tang. 2021. DrugComb update: a more comprehensive drug sensitivity data repository and analysis portal.Nucleic acids research49, W1 (2021), W174–W184. A Experiment Details A.1 Operations To enable flexible architecture search over molecular graph en...

2021