Molecular Lead Optimization via Agentic Tool Planning

Bin Chen; Haobo Zhang; Jiayu Zhou; Lingxiao Li; Ruohao Fan

arxiv: 2605.28862 · v1 · pith:HWQO5V3Onew · submitted 2026-05-21 · 💻 cs.LG · q-bio.QM

Molecular Lead Optimization via Agentic Tool Planning

Lingxiao Li , Haobo Zhang , Ruohao Fan , Bin Chen , Jiayu Zhou This is my paper

Pith reviewed 2026-06-30 17:05 UTC · model grok-4.3

classification 💻 cs.LG q-bio.QM

keywords lead optimizationADMET propertiesLLM agentsequential decision makingmolecular designdrug discoverytrajectory planning

0 comments

The pith

A trajectory-aware LLM agent for choosing sequences of molecular tools improves lead optimization over one-step methods.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper claims that current one-step molecular optimization misses the long-term effects of design choices. TRACE instead has an LLM agent plan a sequence of tool uses, thinking ahead about the whole trajectory while keeping the core structure intact. This is tested on tasks to improve ADMET properties like absorption and toxicity. The agent shows better success rates, bigger gains in the target properties, more valid molecules, and similar structures to the starting compounds. A reader would care because better optimization could speed up turning early compounds into drug candidates.

Core claim

TRACE formulates the selection of molecular optimization tools as a sequential decision-making problem over action trajectories. This allows the LLM-reasoning agent to make forward-looking refinements under structural constraints. On multiple ADMET optimization tasks, it achieves higher optimization success, larger property improvements, higher validity, while preserving molecular similarity compared to baseline models.

What carries the argument

The trajectory-aware decision process over sequences of molecular optimization tools, powered by LLM reasoning.

If this is right

Optimization success rates increase on ADMET tasks.
Property improvements are larger.
Generated molecules have higher validity.
Molecular similarity to the lead is preserved better or equally.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar sequential planning could apply to other multi-step design tasks in chemistry or materials.
Integrating this with simulation tools might allow closed-loop optimization without human intervention.
Future work could test if the gains come mainly from the trajectory view or the LLM's reasoning ability.

Load-bearing premise

That planning over full trajectories of tool uses will produce better long-term molecular outcomes than choosing tools one at a time.

What would settle it

An ablation study on the same ADMET tasks where the agent is restricted to single-step decisions and shows equal or lower performance than the full trajectory version.

Figures

Figures reproduced from arXiv: 2605.28862 by Bin Chen, Haobo Zhang, Jiayu Zhou, Lingxiao Li, Ruohao Fan.

**Figure 2.** Figure 2: In-context self-correction across TRACE explo [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Multi-step exploration under similarity constraints. [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 5.** Figure 5: TRACE overview. The orchestrator maintains the [PITH_FULL_IMAGE:figures/full_fig_p004_5.png] view at source ↗

**Figure 6.** Figure 6: Anchored multi-step exploration. Starting from lead [PITH_FULL_IMAGE:figures/full_fig_p005_6.png] view at source ↗

**Figure 7.** Figure 7: In-context self-correction across TRACE exploration steps on five ADMET tasks, reported with Error Rate (ER) and Rescue Rate (RR). System Configuration and Agent Variants. TRACE instantiates four lead optimization tools, each corresponding to a GeLLMo variant finetuned on MuMOInstruct but differing in backbone and the size of the chemical property power set used during training. Specifically, we use GeL… view at source ↗

**Figure 8.** Figure 8: Representative BBBP optimization trajectories produced by TRACE. Each row starts from an input lead and shows molecules generated across exploration steps; the highlighted molecule achieves the largest relative improvement (delta) along the trajectory, measured w.r.t. the original lead (prop denotes the predicted BBBP score). and chemically plausible, the kind of scaffold-preserving microoptimization that… view at source ↗

read the original abstract

Drug discovery is a lengthy and resource-intensive process composed of multiple stages. Among these stages, lead optimization plays a critical role in transforming early hit compounds into viable drug candidates. This stage requires improving ADMET-related properties through subtle structural refinement while preserving key molecular substructures responsible for binding affinity to disease targets. Recent advances in artificial intelligence have shown promise in accelerating various aspects of drug discovery; however, most existing approaches to lead optimization rely on one-step molecular optimization, which fail to account for the long-term consequences of sequential design decisions. To address this limitation, we propose TRACE, a trajectory-aware, LLM-reasoning agent for molecular lead optimization that formulates tool selection as a sequential decision-making problem over action trajectories. Given a lead molecule and an optimization objective, TRACE makes trajectory-aware decisions over molecular optimization tools, enabling forward-looking refinement under structural constraints. Experiments on multiple ADMET optimization tasks show that our agent achieves higher optimization success, larger property improvements, and higher validity, while preserving molecular similarity compared to baseline models.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

TRACE's idea of trajectory-aware sequential planning for molecular optimization is plausible but the abstract gives no evidence or ablations showing that the sequential part drives the claimed ADMET gains.

read the letter

The main takeaway is that TRACE frames molecular tool selection as a sequential trajectory-aware planning task with an LLM agent, and reports better ADMET optimization results than baselines. This is a reasonable direction because lead optimization really does involve multiple steps where early choices constrain later ones, and one-step methods can miss that. The paper does well in spelling out why sequential decisions matter for keeping molecular similarity while improving properties. It correctly notes that most existing approaches are one-step and fail to account for long-term consequences.

The soft spots are significant though. The abstract gives no experimental details, no specific baselines, no metrics, and no ablations that would show the trajectory planning itself is the reason for any gains. The stress-test note is on point here; without a controlled comparison that varies only the sequential vs one-step aspect, you can't attribute the improvements to the key modeling choice. With only the abstract available, there's no way to check the actual implementation, the validity checks, or how it compares to existing work in the literature.

This kind of work is aimed at the AI-for-chemistry community. Someone looking for new agent architectures might find the high-level idea useful, but without the supporting evidence it doesn't hold up as a completed piece. I wouldn't bring this to a reading group. I wouldn't cite it. It doesn't seem ready for peer review; it needs the full experimental section with ablations and details before that.

Referee Report

2 major / 0 minor

Summary. The paper proposes TRACE, a trajectory-aware LLM-reasoning agent for molecular lead optimization. It formulates tool selection as a sequential decision-making problem over action trajectories to address limitations of one-step optimization methods, enabling forward-looking refinement of lead molecules under structural constraints for ADMET properties. Experiments on multiple ADMET optimization tasks are reported to show higher optimization success, larger property improvements, higher validity, and preserved molecular similarity relative to baseline models.

Significance. If the experimental results and attribution to the trajectory formulation hold after controlled validation, the work could meaningfully advance agentic AI methods in drug discovery by shifting from myopic to sequential planning in molecular design. The approach directly targets a stated limitation of prior one-step methods and could inform tool-use agents in other constrained optimization domains.

major comments (2)

[Abstract / Experimental Evaluation] Abstract and Experimental Evaluation: The central claim attributes performance gains to the trajectory-aware sequential formulation, yet no ablation is described that holds the tool set, LLM backbone, and prompting fixed while varying only the sequential trajectory decision model versus a one-step/myopic baseline. Without this isolation, the reported deltas on ADMET tasks cannot be confidently linked to the modeling choice rather than ancillary factors.
[Abstract] Abstract: The manuscript asserts superior performance (higher success, larger improvements, higher validity, preserved similarity) but the provided text supplies no concrete experimental details on tasks, baselines, metrics, statistical tests, number of runs, or validity checks. This prevents evaluation of whether the data-to-claim link is sound.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the two major comments point-by-point below and have revised the manuscript to strengthen the experimental claims and abstract.

read point-by-point responses

Referee: [Abstract / Experimental Evaluation] Abstract and Experimental Evaluation: The central claim attributes performance gains to the trajectory-aware sequential formulation, yet no ablation is described that holds the tool set, LLM backbone, and prompting fixed while varying only the sequential trajectory decision model versus a one-step/myopic baseline. Without this isolation, the reported deltas on ADMET tasks cannot be confidently linked to the modeling choice rather than ancillary factors.

Authors: We agree that an explicit ablation isolating the sequential trajectory formulation is necessary to attribute gains specifically to this modeling choice. The original experiments compared TRACE to one-step baselines, but these did not hold every other factor fixed. In the revised manuscript we have added a controlled ablation that uses identical tool set, LLM backbone, and prompting, differing only in whether tool selection is myopic (single-step) or trajectory-aware (sequential). The ablation shows statistically significant gains from the trajectory component on success rate and ADMET improvement, directly supporting the central claim. revision: yes
Referee: [Abstract] Abstract: The manuscript asserts superior performance (higher success, larger improvements, higher validity, preserved similarity) but the provided text supplies no concrete experimental details on tasks, baselines, metrics, statistical tests, number of runs, or validity checks. This prevents evaluation of whether the data-to-claim link is sound.

Authors: We accept that the original abstract omitted necessary experimental specifics. The revised abstract now states: experiments were performed on five ADMET tasks (aqueous solubility, logP, hERG inhibition, CYP inhibition, and permeability); baselines comprised one-step LLM optimizers and prior agent baselines; metrics were success rate (fraction of molecules meeting target thresholds), mean property delta, validity (fraction of chemically valid outputs), and Tanimoto similarity to the starting lead; all results are means over five independent runs with standard deviations and paired t-test p-values < 0.05. revision: yes

Circularity Check

0 steps flagged

No circularity; empirical claims rest on external experimental comparisons

full rationale

The paper introduces TRACE as an LLM-based agent that treats tool selection as a sequential trajectory problem and reports superior ADMET optimization results versus baselines. No equations, fitted parameters, or self-citations are presented that reduce any claimed prediction or uniqueness result to the input data or prior self-work by construction. The load-bearing step is the experimental comparison itself, which is independent of the modeling choice and can be falsified by replication on the same tasks. This matches the default expectation of a non-circular empirical methods paper.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no information on free parameters, axioms, or invented entities used by the method.

pith-pipeline@v0.9.1-grok · 5712 in / 1151 out tokens · 31288 ms · 2026-06-30T17:05:20.080287+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

24 extracted references · 5 canonical work pages · 1 internal anchor

[1]

InProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

Liddia: Language- based intelligent drug discovery agent. InProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 12015–12039. Guy W Bemis and Mark A Murcko

2025
[2]

Vishal Dey, Xiao Hu, and Xia Ning

Molecular frameworks.Journal of medicinal chemistry39, 15 (1996), 2887–2893. Vishal Dey, Xiao Hu, and Xia Ning

1996
[3]

Robert P Hertzberg and Andrew J Pope

Gellm3o: Generalizing large language models for multi-property molecule optimization.arXiv preprint arXiv:2502.1339810 (2025). Robert P Hertzberg and Andrew J Pope

work page arXiv 2025
[4]

James P Hughes, Stephen Rees, S Barrett Kalindjian, and Karen L Philpott

High-throughput screening: new technology for the 21st century.Current opinion in chemical biology4, 4 (2000), 445–451. James P Hughes, Stephen Rees, S Barrett Kalindjian, and Karen L Philpott

2000
[5]

Yoshitaka Inoue, Tianci Song, Xinling Wang, Augustin Luna, and Tianfan Fu

Principles of early drug discovery.British journal of pharmacology162, 6 (2011), 1239–1249. Yoshitaka Inoue, Tianci Song, Xinling Wang, Augustin Luna, and Tianfan Fu

2011
[6]

Jan H Jensen

Drugagent: Multi-agent large language model-based reasoning for drug-target interaction prediction.ArXiv(2025), arXiv–2408. Jan H Jensen

2025
[7]

Wengong Jin, Regina Barzilay, and Tommi Jaakkola

A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space.Chemical science10, 12 (2019), 3567–3572. Wengong Jin, Regina Barzilay, and Tommi Jaakkola

2019
[8]

György M Keserü and Gergely M Makara

Efficient drug lead discovery and optimization.Accounts of chemical research42, 6 (2009), 724–733. György M Keserü and Gergely M Makara

2009
[9]

Greg Landrum and RDKit contributors

The influence of lead discovery strategies on the properties of drug candidates.nature reviews Drug Discovery8, 3 (2009), 203–212. Greg Landrum and RDKit contributors

2009
[10]

https://www.rdkit.org

RDKit: Open-source cheminformatics. https://www.rdkit.org. Accessed: 2026-02-06. Christopher A Lipinski, Franco Lombardo, Beryl W Dominy, and Paul J Feeney

2026
[11]

Andres M

Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings.Advanced drug delivery reviews64 (2012), 4–17. Andres M. Bran, Sam Cox, Oliver Schilter, Carlo Baldassari, Andrew D White, and Philippe Schwaller

2012
[12]

Nature Machine Intelligence6, 5 (2024), 525–535

Augmenting large language models with chemistry tools. Nature Machine Intelligence6, 5 (2024), 525–535. AD McNaughton, G Ramalaxmi, A Kruel, CR Knutson, RA Varikoti, and N Kumar. [n. d.]. CACTUS: Chemistry agent connecting tool-usage to science, arXiv,

2024
[13]

arXiv preprint arXiv:2405.0097210 ([n. d.]). Meta

work page arXiv
[14]

Augmented Language Models: a Survey

Augmented language models: a survey.arXiv preprint arXiv:2302.07842(2023). Mistral AI

work page internal anchor Pith review Pith/arXiv arXiv 2023
[15]

Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Eric Hambro, Luke Zettlemoyer, Nicola Cancedda, and Thomas Scialom

How to improve R&D productivity: the pharmaceutical industry’s grand challenge.Nature reviews Drug discovery9, 3 (2010), 203–214. Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Eric Hambro, Luke Zettlemoyer, Nicola Cancedda, and Thomas Scialom

2010
[16]

Kyle Swanson, Parker Walther, Jeremy Leitz, Souhrid Mukherjee, Joseph C Wu, Ra- bindra V Shivnaraine, and James Zou

Tool- former: Language models can teach themselves to use tools.Advances in Neural Information Processing Systems36 (2023), 68539–68551. Kyle Swanson, Parker Walther, Jeremy Leitz, Souhrid Mukherjee, Joseph C Wu, Ra- bindra V Shivnaraine, and James Zou

2023
[17]

Roberto Todeschini and Viviana Consonni

ADMET-AI: a machine learning ADMET platform for evaluation of large-scale chemical libraries.Bioinformatics40, 7 (2024), btae416. Roberto Todeschini and Viviana Consonni. 2008.Handbook of molecular descriptors. John Wiley & Sons. Daniel F Veber, Stephen R Johnson, Hung-Yuan Cheng, Brian R Smith, Keith W Ward, and Kenneth D Kopple

2024
[18]

Michael J Waring, John Arrowsmith, Andrew R Leach, Paul D Leeson, Sam Mandrell, Robert M Owen, Garry Pairaudeau, William D Pennie, Stephen D Pickett, Jibo 9 Li et al

Molecular properties that influence the oral bioavail- ability of drug candidates.Journal of medicinal chemistry45, 12 (2002), 2615–2623. Michael J Waring, John Arrowsmith, Andrew R Leach, Paul D Leeson, Sam Mandrell, Robert M Owen, Garry Pairaudeau, William D Pennie, Stephen D Pickett, Jibo 9 Li et al. Wang, et al

2002
[19]

Yoshihiro Yamanishi, Michihiro Araki, Alex Gutteridge, Wataru Honda, and Minoru Kanehisa

An analysis of the attrition of drug candidates from four major pharmaceutical companies.Nature reviews Drug discovery14, 7 (2015), 475–486. Yoshihiro Yamanishi, Michihiro Araki, Alex Gutteridge, Wataru Honda, and Minoru Kanehisa

2015
[20]

Botao Yu, Frazier N Baker, Ziqi Chen, Xia Ning, and Huan Sun

Prediction of drug–target interaction networks from the integration of chemical and genomic spaces.Bioinformatics24, 13 (2008), i232–i240. Botao Yu, Frazier N Baker, Ziqi Chen, Xia Ning, and Huan Sun

2008
[21]

arXiv preprint arXiv:2402.09391 , year=

Llasmol: Ad- vancing large language models for chemistry with a large-scale, comprehensive, high-quality instruction tuning dataset.arXiv preprint arXiv:2402.09391(2024). Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan, Jiatong Li, Weiran Huang, Xiangyu Yue, Wanli Ouyang, et al

work page arXiv 2024
[22]

arXiv preprint arXiv:2402.06852 , year=

Chemllm: A chemical large language model.arXiv preprint arXiv:2402.06852(2024). Zhenpeng Zhou, Steven Kearnes, Li Li, Richard N Zare, and Patrick Riley

work page arXiv 2024
[23]

Yiheng Zhu, Jialu Wu, Chaowen Hu, Jiahuan Yan, Tingjun Hou, Jian Wu, et al

Opti- mization of molecules via deep reinforcement learning.Scientific reports9, 1 (2019), 10752. Yiheng Zhu, Jialu Wu, Chaowen Hu, Jiahuan Yan, Tingjun Hou, Jian Wu, et al

2019
[24]

tool_calls

Sample-efficient multi-objective molecular optimization with gflownets.Advances in Neural Information Processing Systems36 (2023), 79667–79684. 10 Molecular Lead Optimization via Agentic Tool Planning A Framework Components A.1 LLM Reasoner The LLM reasoner serves as the high-level reasoning and decision- making component within the overall agentic framew...

2023

[1] [1]

InProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

Liddia: Language- based intelligent drug discovery agent. InProceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 12015–12039. Guy W Bemis and Mark A Murcko

2025

[2] [2]

Vishal Dey, Xiao Hu, and Xia Ning

Molecular frameworks.Journal of medicinal chemistry39, 15 (1996), 2887–2893. Vishal Dey, Xiao Hu, and Xia Ning

1996

[3] [3]

Robert P Hertzberg and Andrew J Pope

Gellm3o: Generalizing large language models for multi-property molecule optimization.arXiv preprint arXiv:2502.1339810 (2025). Robert P Hertzberg and Andrew J Pope

work page arXiv 2025

[4] [4]

James P Hughes, Stephen Rees, S Barrett Kalindjian, and Karen L Philpott

High-throughput screening: new technology for the 21st century.Current opinion in chemical biology4, 4 (2000), 445–451. James P Hughes, Stephen Rees, S Barrett Kalindjian, and Karen L Philpott

2000

[5] [5]

Yoshitaka Inoue, Tianci Song, Xinling Wang, Augustin Luna, and Tianfan Fu

Principles of early drug discovery.British journal of pharmacology162, 6 (2011), 1239–1249. Yoshitaka Inoue, Tianci Song, Xinling Wang, Augustin Luna, and Tianfan Fu

2011

[6] [6]

Jan H Jensen

Drugagent: Multi-agent large language model-based reasoning for drug-target interaction prediction.ArXiv(2025), arXiv–2408. Jan H Jensen

2025

[7] [7]

Wengong Jin, Regina Barzilay, and Tommi Jaakkola

A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space.Chemical science10, 12 (2019), 3567–3572. Wengong Jin, Regina Barzilay, and Tommi Jaakkola

2019

[8] [8]

György M Keserü and Gergely M Makara

Efficient drug lead discovery and optimization.Accounts of chemical research42, 6 (2009), 724–733. György M Keserü and Gergely M Makara

2009

[9] [9]

Greg Landrum and RDKit contributors

The influence of lead discovery strategies on the properties of drug candidates.nature reviews Drug Discovery8, 3 (2009), 203–212. Greg Landrum and RDKit contributors

2009

[10] [10]

https://www.rdkit.org

RDKit: Open-source cheminformatics. https://www.rdkit.org. Accessed: 2026-02-06. Christopher A Lipinski, Franco Lombardo, Beryl W Dominy, and Paul J Feeney

2026

[11] [11]

Andres M

Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings.Advanced drug delivery reviews64 (2012), 4–17. Andres M. Bran, Sam Cox, Oliver Schilter, Carlo Baldassari, Andrew D White, and Philippe Schwaller

2012

[12] [12]

Nature Machine Intelligence6, 5 (2024), 525–535

Augmenting large language models with chemistry tools. Nature Machine Intelligence6, 5 (2024), 525–535. AD McNaughton, G Ramalaxmi, A Kruel, CR Knutson, RA Varikoti, and N Kumar. [n. d.]. CACTUS: Chemistry agent connecting tool-usage to science, arXiv,

2024

[13] [13]

arXiv preprint arXiv:2405.0097210 ([n. d.]). Meta

work page arXiv

[14] [14]

Augmented Language Models: a Survey

Augmented language models: a survey.arXiv preprint arXiv:2302.07842(2023). Mistral AI

work page internal anchor Pith review Pith/arXiv arXiv 2023

[15] [15]

Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Eric Hambro, Luke Zettlemoyer, Nicola Cancedda, and Thomas Scialom

How to improve R&D productivity: the pharmaceutical industry’s grand challenge.Nature reviews Drug discovery9, 3 (2010), 203–214. Timo Schick, Jane Dwivedi-Yu, Roberto Dessì, Roberta Raileanu, Maria Lomeli, Eric Hambro, Luke Zettlemoyer, Nicola Cancedda, and Thomas Scialom

2010

[16] [16]

Kyle Swanson, Parker Walther, Jeremy Leitz, Souhrid Mukherjee, Joseph C Wu, Ra- bindra V Shivnaraine, and James Zou

Tool- former: Language models can teach themselves to use tools.Advances in Neural Information Processing Systems36 (2023), 68539–68551. Kyle Swanson, Parker Walther, Jeremy Leitz, Souhrid Mukherjee, Joseph C Wu, Ra- bindra V Shivnaraine, and James Zou

2023

[17] [17]

Roberto Todeschini and Viviana Consonni

ADMET-AI: a machine learning ADMET platform for evaluation of large-scale chemical libraries.Bioinformatics40, 7 (2024), btae416. Roberto Todeschini and Viviana Consonni. 2008.Handbook of molecular descriptors. John Wiley & Sons. Daniel F Veber, Stephen R Johnson, Hung-Yuan Cheng, Brian R Smith, Keith W Ward, and Kenneth D Kopple

2024

[18] [18]

Michael J Waring, John Arrowsmith, Andrew R Leach, Paul D Leeson, Sam Mandrell, Robert M Owen, Garry Pairaudeau, William D Pennie, Stephen D Pickett, Jibo 9 Li et al

Molecular properties that influence the oral bioavail- ability of drug candidates.Journal of medicinal chemistry45, 12 (2002), 2615–2623. Michael J Waring, John Arrowsmith, Andrew R Leach, Paul D Leeson, Sam Mandrell, Robert M Owen, Garry Pairaudeau, William D Pennie, Stephen D Pickett, Jibo 9 Li et al. Wang, et al

2002

[19] [19]

Yoshihiro Yamanishi, Michihiro Araki, Alex Gutteridge, Wataru Honda, and Minoru Kanehisa

An analysis of the attrition of drug candidates from four major pharmaceutical companies.Nature reviews Drug discovery14, 7 (2015), 475–486. Yoshihiro Yamanishi, Michihiro Araki, Alex Gutteridge, Wataru Honda, and Minoru Kanehisa

2015

[20] [20]

Botao Yu, Frazier N Baker, Ziqi Chen, Xia Ning, and Huan Sun

Prediction of drug–target interaction networks from the integration of chemical and genomic spaces.Bioinformatics24, 13 (2008), i232–i240. Botao Yu, Frazier N Baker, Ziqi Chen, Xia Ning, and Huan Sun

2008

[21] [21]

arXiv preprint arXiv:2402.09391 , year=

Llasmol: Ad- vancing large language models for chemistry with a large-scale, comprehensive, high-quality instruction tuning dataset.arXiv preprint arXiv:2402.09391(2024). Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan, Jiatong Li, Weiran Huang, Xiangyu Yue, Wanli Ouyang, et al

work page arXiv 2024

[22] [22]

arXiv preprint arXiv:2402.06852 , year=

Chemllm: A chemical large language model.arXiv preprint arXiv:2402.06852(2024). Zhenpeng Zhou, Steven Kearnes, Li Li, Richard N Zare, and Patrick Riley

work page arXiv 2024

[23] [23]

Yiheng Zhu, Jialu Wu, Chaowen Hu, Jiahuan Yan, Tingjun Hou, Jian Wu, et al

Opti- mization of molecules via deep reinforcement learning.Scientific reports9, 1 (2019), 10752. Yiheng Zhu, Jialu Wu, Chaowen Hu, Jiahuan Yan, Tingjun Hou, Jian Wu, et al

2019

[24] [24]

tool_calls

Sample-efficient multi-objective molecular optimization with gflownets.Advances in Neural Information Processing Systems36 (2023), 79667–79684. 10 Molecular Lead Optimization via Agentic Tool Planning A Framework Components A.1 LLM Reasoner The LLM reasoner serves as the high-level reasoning and decision- making component within the overall agentic framew...

2023