AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment

Jiangyu Chen

arxiv: 2603.03686 · v3 · submitted 2026-03-04 · 💻 cs.AI

AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment

Jiangyu Chen This is my paper

Pith reviewed 2026-05-15 17:30 UTC · model grok-4.3

classification 💻 cs.AI

keywords neuro-symbolic AIsolvent designMonte Carlo Tree Searchdifferentiable physicschemical formulationphotoresist developermaterials discoverysparse search

0 comments

The pith

AI4S-SDS uses sparse Monte Carlo Tree Search and differentiable physics to generate fully valid chemical formulations with higher exploration diversity than baselines.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a neuro-symbolic framework that pairs multi-agent reasoning with a specialized Monte Carlo Tree Search engine to automate the design of chemical solvents and formulations. It solves context-length limits in long searches by storing states sparsely and reconstructing paths on demand, then adds global-local planning and sibling-aware node expansion to avoid repetitive paths. A differentiable physics module translates symbolic choices into continuous mixing ratios that satisfy thermodynamic constraints through a hybrid loss with sparsity regularization. This combination yields complete validity under the chosen physical rules and surfaces a new photoresist developer that matches or exceeds a commercial standard in early lithography tests.

Core claim

AI4S-SDS establishes that a closed-loop neuro-symbolic architecture, built around sparse state storage, dynamic path reconstruction, global-local search, sibling-aware expansion, and a differentiable physics engine with hybrid normalized loss plus sparsity regularization, produces chemical formulations that satisfy all adopted HSP-based thermodynamic constraints while achieving substantially greater exploration diversity than standard agents; the system further identifies a novel photoresist developer formulation that performs competitively or better than a commercial benchmark in preliminary lithography experiments.

What carries the argument

Sparse State Storage with Dynamic Path Reconstruction inside a Monte Carlo Tree Search engine, paired with a Differentiable Physics Engine that enforces constraints via hybrid normalized loss and sparsity-inducing regularization.

If this is right

The framework reaches full validity for every generated formulation under the HSP constraints.
Exploration diversity increases markedly relative to baseline agents.
A novel photoresist developer is discovered that matches or exceeds commercial performance in lithography trials.
Arbitrarily deep search becomes possible within fixed token budgets by decoupling reasoning history from context length.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same sparse-MCTS and differentiable-physics loop could be applied to other high-dimensional formulation problems such as battery electrolytes or pharmaceutical blends.
Replacing the current HSP model with higher-fidelity molecular dynamics inside the differentiable engine would provide a direct test of whether physical accuracy improves downstream experimental success.
The memory-driven root reconfiguration might generalize to any long-horizon combinatorial design task where mode collapse is a risk.

Load-bearing premise

The HSP-based physical constraints together with the hybrid normalized loss and sparsity regularization are enough to ensure the generated formulations are thermodynamically feasible and practically effective in real use.

What would settle it

A laboratory test in which a formulation produced by the system violates thermodynamic stability, fails to meet HSP solubility requirements, or underperforms the commercial benchmark in actual lithography processing.

Figures

Figures reproduced from arXiv: 2603.03686 by Jiangyu Chen.

**Figure 1.** Figure 1: AI4S-SDS: AI for Science–Solvent Design System. The framework integrates LLMbased proposal generation, MCTS-based discrete search, and differentiable physics-informed ratio refinement. 4.2 Discrete Proposal Search AI4S-SDS performs search in the discrete topology space using a Monte Carlo Tree Search (MCTS) backbone. Each node stores only a lightweight semantic state, v = (a, r, n, Q), where a denotes the… view at source ↗

**Figure 2.** Figure 2: Diversity and distributional characteristics of generated formulations. (a) AI4S-SDS discovers more unique solvent topologies. (b) Higher Shannon entropy indicates more uniform exploration. (c) Lower top-5 usage concentration reflects reduced reliance on evaluator-favored templates. 5.3 Phase 3: Overcoming Global Mode Collapse via Planning Despite achieving high top scores, Naive MCTS exhibited severe glob… view at source ↗

**Figure 3.** Figure 3: Qualitative lithography comparison. Left: The commercial baseline is n-Butyl Acetate (nBA), a standard industry developer. Right: A representative formulation generated by AI4S-SDS. The example illustrates improved pattern definition under the tested conditions. This comparison serves as a proof-of-concept demonstration of the framework’s ability to identify candidates that satisfy the explicit physicochem… view at source ↗

read the original abstract

Automated design of chemical formulations is a cornerstone of materials science, yet it requires navigating a high-dimensional combinatorial space involving discrete compositional choices and continuous geometric constraints. Existing Large Language Model (LLM) agents face significant challenges in this setting, including context window limitations during long-horizon reasoning and path-dependent exploration that may lead to mode collapse. To address these issues, we introduce AI4S-SDS, a closed-loop neuro-symbolic framework that integrates multi-agent collaboration with a tailored Monte Carlo Tree Search (MCTS) engine. We propose a Sparse State Storage mechanism with Dynamic Path Reconstruction, which decouples reasoning history from context length and enables arbitrarily deep exploration under fixed token budgets. To reduce local convergence and improve coverage, we implement a Global--Local Search Strategy: a memory-driven planning module adaptively reconfigures the search root based on historical feedback, while a Sibling-Aware Expansion mechanism promotes orthogonal exploration at the node level. Furthermore, we bridge symbolic reasoning and physical feasibility through a Differentiable Physics Engine, employing a hybrid normalized loss with sparsity-inducing regularization to optimize continuous mixing ratios under thermodynamic constraints. Empirical results show that AI4S-SDS achieves full validity under the adopted HSP-based physical constraints and substantially improves exploration diversity compared to baseline agents. In preliminary lithography experiments, the framework identifies a novel photoresist developer formulation that demonstrates competitive or superior performance relative to a commercial benchmark, highlighting the potential of diversity-driven neuro-symbolic search for scientific discovery.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper shows a workable neuro-symbolic MCTS system for solvent design with some promising experimental output, but the reliance on HSP constraints leaves the real feasibility open to question.

read the letter

The key thing here is a neuro-symbolic framework that pairs a customized MCTS engine with a differentiable physics module to design solvents under physical constraints. It claims 100% validity and a new photoresist developer that matches or beats a commercial benchmark in lithography tests. What stands out as new are the Sparse State Storage and Dynamic Path Reconstruction to keep context fixed while allowing deep trees, along with the Sibling-Aware Expansion and Global-Local Search to boost diversity. These target the context window and mode collapse problems in LLM agents for long chemical reasoning chains. The hybrid loss with sparsity regularization in the physics engine is a direct attempt to make the outputs feasible. The approach looks like a solid engineering effort to make search more efficient in high-dimensional spaces. The multi-agent collaboration adds another layer for handling the discrete and continuous parts of formulation. On the downside, the physical constraints come from HSP parameters, which are rough empirical fits and don't capture kinetics or temperature effects that matter in real developers. The abstract mentions improved diversity and competitive performance but gives no specific metrics, baseline comparisons, or ablation results. Without those, it's difficult to tell how much the new mechanisms actually move the needle versus just satisfying the proxy constraints. The single preliminary experiment doesn't provide strong evidence either. This is aimed at people building AI tools for materials and chemical discovery. A reader focused on practical neuro-symbolic systems might get ideas from the search modifications. I would recommend sending it for peer review. The ideas are worth checking out in detail, particularly whether the physics alignment holds up under scrutiny.

Referee Report

3 major / 2 minor

Summary. The paper introduces AI4S-SDS, a closed-loop neuro-symbolic framework for automated chemical formulation design that integrates multi-agent collaboration with a Sparse Monte Carlo Tree Search (MCTS) engine featuring Dynamic Path Reconstruction, a Global-Local Search Strategy with memory-driven root reconfiguration, and Sibling-Aware Expansion. It further employs a Differentiable Physics Engine that optimizes continuous mixing ratios via a hybrid normalized loss with sparsity-inducing regularization under Hansen Solubility Parameter (HSP) thermodynamic constraints. The central claims are that the system achieves full validity under these constraints, substantially improves exploration diversity relative to baseline agents, and identifies a novel photoresist developer formulation with competitive or superior performance in preliminary lithography experiments.

Significance. If the empirical claims hold with rigorous validation, the work would advance neuro-symbolic AI for materials science by showing how sparse state storage and differentiable alignment can mitigate context-length and mode-collapse issues in long-horizon chemical design tasks. The combination of symbolic search with physics-informed optimization offers a promising template for generating practically relevant formulations, particularly if the experimental lithography result generalizes.

major comments (3)

[Abstract] Abstract: The assertions of 'full validity under the adopted HSP-based physical constraints' and 'substantially improves exploration diversity' are presented without any quantitative metrics (e.g., validity rates, diversity scores such as unique formulation counts or entropy measures), baseline agent details, statistical tests, or error bars, leaving the central performance claims unsupported by verifiable evidence.
[Differentiable Physics Engine] Differentiable Physics Engine section (described in abstract): The hybrid normalized loss with sparsity regularization is claimed to enforce thermodynamic feasibility, yet the manuscript supplies no ablation on individual loss components, no comparison against independent thermodynamic simulators (e.g., COSMO-RS or molecular dynamics), and no quantification of how HSP empirical approximations (which omit temperature dependence, kinetics, and higher-order interactions) affect real lithography outcomes; this directly undermines the validity and novelty claims.
[Experimental results] Experimental results (implied in abstract): The report of a 'novel photoresist developer formulation' demonstrating 'competitive or superior performance' relative to a commercial benchmark lacks any details on the experimental protocol, performance metrics (e.g., dissolution rates, contrast curves), number of trials, or statistical comparison, rendering the practical-utility claim impossible to evaluate.

minor comments (2)

[Abstract] Abstract: The notation 'Global--Local Search Strategy' uses an en-dash that may be rendered inconsistently; consider standardizing to 'Global-Local' or defining the terms explicitly in the main text.
[Method] The description of 'Sparse State Storage mechanism with Dynamic Path Reconstruction' would benefit from a concise pseudocode or diagram to clarify how reasoning history is decoupled from context length.

Simulated Author's Rebuttal

3 responses · 1 unresolved

We thank the referee for their constructive and detailed feedback. We address each major comment point by point below, indicating where revisions have been made to strengthen the manuscript.

read point-by-point responses

Referee: [Abstract] Abstract: The assertions of 'full validity under the adopted HSP-based physical constraints' and 'substantially improves exploration diversity' are presented without any quantitative metrics (e.g., validity rates, diversity scores such as unique formulation counts or entropy measures), baseline agent details, statistical tests, or error bars, leaving the central performance claims unsupported by verifiable evidence.

Authors: We agree that the abstract would be strengthened by explicit quantitative support for these claims. In the revised manuscript we have updated the abstract to reference the achieved validity rate of 100% under the HSP constraints, the measured diversity gains (via unique formulation counts and entropy), the specific baseline agents employed, and the statistical tests with error bars. These supporting details and tables remain in the main experimental section for full context. revision: yes
Referee: [Differentiable Physics Engine] Differentiable Physics Engine section (described in abstract): The hybrid normalized loss with sparsity regularization is claimed to enforce thermodynamic feasibility, yet the manuscript supplies no ablation on individual loss components, no comparison against independent thermodynamic simulators (e.g., COSMO-RS or molecular dynamics), and no quantification of how HSP empirical approximations (which omit temperature dependence, kinetics, and higher-order interactions) affect real lithography outcomes; this directly undermines the validity and novelty claims.

Authors: We partially concur. We have added an ablation study in the revised Section 3.2 that isolates the contribution of the normalized loss and the sparsity-inducing regularization terms to overall validity and mixture quality. Direct head-to-head comparisons against COSMO-RS or molecular dynamics were not performed, as the framework prioritizes efficient HSP-based constraints for closed-loop iteration; we now explicitly discuss this design choice and the known limitations of HSP approximations (temperature dependence, kinetics) in the updated discussion section. Full quantification of their downstream effect on lithography performance would require a separate, resource-intensive experimental campaign that lies beyond the scope of the present preliminary study. revision: partial
Referee: [Experimental results] Experimental results (implied in abstract): The report of a 'novel photoresist developer formulation' demonstrating 'competitive or superior performance' relative to a commercial benchmark lacks any details on the experimental protocol, performance metrics (e.g., dissolution rates, contrast curves), number of trials, or statistical comparison, rendering the practical-utility claim impossible to evaluate.

Authors: We thank the referee for highlighting this gap. The revised manuscript expands Section 5 to provide the complete lithography experimental protocol, including dissolution-rate and contrast-curve measurement procedures, the number of independent trials performed, and the statistical comparisons (including p-values) against the commercial benchmark. These details were previously only summarized; they are now presented in the main text to allow proper evaluation of the practical-utility claim. revision: yes

standing simulated objections not resolved

Direct comparisons against COSMO-RS or molecular-dynamics simulators and exhaustive quantification of HSP-approximation effects on real lithography outcomes, both of which would require substantial additional computational and experimental resources outside the current study.

Circularity Check

0 steps flagged

No circularity detected in derivation chain

full rationale

The paper presents a neuro-symbolic framework combining MCTS with a Differentiable Physics Engine that enforces HSP-based constraints via a hybrid loss. The reported full validity is presented as an outcome of this enforcement mechanism rather than an independent prediction derived from external data. No self-definitional loops, fitted inputs renamed as predictions, or load-bearing self-citations appear in the provided abstract or described components. The central claims rest on the design of the engine and empirical runs against baselines, which are self-contained once the constraints and loss are accepted as modeling choices. No reduction of results to inputs by construction is exhibited.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that HSP constraints accurately capture thermodynamic feasibility and that the loss function produces practically useful mixtures; no free parameters or invented entities are explicitly quantified in the abstract.

axioms (1)

domain assumption HSP-based physical constraints accurately represent thermodynamic feasibility for solvent mixtures
Invoked to enforce validity of all generated formulations via the differentiable physics engine.

pith-pipeline@v0.9.0 · 5562 in / 1199 out tokens · 58057 ms · 2026-05-15T17:30:05.685940+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

hybrid normalized loss with sparsity-inducing regularization to optimize continuous mixing ratios under thermodynamic constraints... Ltotal = Lthermo + Lkinetics + Lentropy
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Hansen Solubility Parameters (HSP) theory... Ra distance... linear mixing rule

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

18 extracted references · 18 canonical work pages · 2 internal anchors

[1]

Graph of thoughts: Solving complex reasoning tasks with graph-based prompting

Maciej Besta, Nils Blach, Ales Kubicek, et al. Graph of thoughts: Solving complex reasoning tasks with graph-based prompting. InProceedings of the AAAI Conference on Artificial Intelligence, volume 38, pages 17682–17690, 2024

work page 2024
[2]

Autonomous chemical research with large language models.Nature, 624(7992):570–578, 2024

Daniil A Boiko, Robert MacKnight, Ben Kline, and Gabe Gomes. Autonomous chemical research with large language models.Nature, 624(7992):570–578, 2024

work page 2024
[3]

Chemcrow: Augmenting large-language models with chemistry tools.Nature Machine Intelligence, 6:525–531, 2024

Andres M Bran, Sam Cox, Philippe Schwaller, Teodoro Laino, et al. Chemcrow: Augmenting large-language models with chemistry tools.Nature Machine Intelligence, 6:525–531, 2024

work page 2024
[4]

A Tutorial on Bayesian Optimization

Peter I Frazier. A tutorial on bayesian optimization.arXiv preprint arXiv:1807.02811, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[5]

Automatic chemical design using a data-driven continuous representation of molecules.ACS central science, 4(2):268–276, 2018

Rafael Gómez-Bombarelli, Jennifer N Wei, David Duvenaud, José Miguel Hernández-Lobato, Benjamín Sánchez-Lengeling, Dennis Sheberla, Jorge Aguilera-Iparraguirre, Timothy D Hirzel, Ryan P Adams, and Alán Aspuru-Guzik. Automatic chemical design using a data-driven continuous representation of molecules.ACS central science, 4(2):268–276, 2018

work page 2018
[6]

CRC press, 2007

Charles M Hansen.Hansen solubility parameters: a user’s handbook. CRC press, 2007

work page 2007
[7]

Physics-informed machine learning.Nature Reviews Physics, 3(6):422–440, 2021

George Em Karniadakis, Ioannis G Kevrekidis, Lu Lu, Paris Perdikaris, Sifan Wang, and Liu Yang. Physics-informed machine learning.Nature Reviews Physics, 3(6):422–440, 2021

work page 2021
[8]

Reasoning via planning: Integrating MCTS with language models

Honglu Lu, Zhen Liu, Liang Wang, et al. Reasoning via planning: Integrating MCTS with language models. InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 8146–8165, 2023

work page 2023
[9]

Molecular de-novo design through deep reinforcement learning.Journal of cheminformatics, 9(1):48, 2017

Marcus Olivecrona, Thomas Blaschke, Ola Engkvist, and Hongming Chen. Molecular de-novo design through deep reinforcement learning.Journal of cheminformatics, 9(1):48, 2017

work page 2017
[10]

MemGPT: Towards LLMs as Operating Systems

Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, and Joseph E. Gonzalez. MemGPT: Towards llms as operating systems.arXiv preprint arXiv:2310.08560, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[11]

Deep reinforcement learning for de novo drug design.Science advances, 4(7):eaap7885, 2018

Mariya Popova, Olexandr Isayev, and Alexander Tropsha. Deep reinforcement learning for de novo drug design.Science advances, 4(7):eaap7885, 2018. 12

work page 2018
[12]

Planning chemical syntheses with deep neural networks and symbolic ai.Nature, 555(7698):604–610, 2018

Marwin HS Segler, Mike Preuss, and Mark P Waller. Planning chemical syntheses with deep neural networks and symbolic ai.Nature, 555(7698):604–610, 2018

work page 2018
[13]

Applications of genetic algorithms in chemistry and chemoinformatics.Wiley Interdisciplinary Reviews: Computational Molecular Science, 1 (3):317–324, 2011

Robert P Sheridan and Stephen K Kearsley. Applications of genetic algorithms in chemistry and chemoinformatics.Wiley Interdisciplinary Reviews: Computational Molecular Science, 1 (3):317–324, 2011

work page 2011
[14]

Reflexion: Language agents with verbal reinforcement learning.Advances in Neural Information Processing Systems, 36:8634–8652, 2023

Noah Shinn, Federico Cassano, Ashwin Gopinath, Karthik Narasimhan, and Shunyu Yao. Reflexion: Language agents with verbal reinforcement learning.Advances in Neural Information Processing Systems, 36:8634–8652, 2023

work page 2023
[15]

Large language models as optimizers

Chengrun Yang, Xuezhi Wang, Yifeng Lu, Hanxiao Liu, Quoc V Le, Denny Zhou, and Xinyun Chen. Large language models as optimizers. InInternational Conference on Learning Repre- sentations (ICLR), 2024

work page 2024
[16]

React: Synergizing reasoning and acting in language models

Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik R Narasimhan, and Yuan Cao. React: Synergizing reasoning and acting in language models. InThe eleventh international conference on learning representations, 2022

work page 2022
[17]

Tree of thoughts: Deliberate problem solving with large language models

Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L Griffiths, Yuan Cao, and Karthik Narasimhan. Tree of thoughts: Deliberate problem solving with large language models. In Advances in Neural Information Processing Systems (NeurIPS), volume 36, 2023

work page 2023
[18]

Chemllm: A chemical large language model

Di Zhang, Wei Wei, Yang Liu, et al. Chemllm: A chemical large language model.arXiv preprint arXiv:2402.06852, 2024. A Solvent library B Formulation Advisor Prompt B.1 Role Definition Role You are theFormulation Advisor, appointed by the Materials Science Review Committee. Your responsibility is not to “veto” solutions, but to help the Generator identify r...

work page arXiv 2024

[1] [1]

Graph of thoughts: Solving complex reasoning tasks with graph-based prompting

Maciej Besta, Nils Blach, Ales Kubicek, et al. Graph of thoughts: Solving complex reasoning tasks with graph-based prompting. InProceedings of the AAAI Conference on Artificial Intelligence, volume 38, pages 17682–17690, 2024

work page 2024

[2] [2]

Autonomous chemical research with large language models.Nature, 624(7992):570–578, 2024

Daniil A Boiko, Robert MacKnight, Ben Kline, and Gabe Gomes. Autonomous chemical research with large language models.Nature, 624(7992):570–578, 2024

work page 2024

[3] [3]

Chemcrow: Augmenting large-language models with chemistry tools.Nature Machine Intelligence, 6:525–531, 2024

Andres M Bran, Sam Cox, Philippe Schwaller, Teodoro Laino, et al. Chemcrow: Augmenting large-language models with chemistry tools.Nature Machine Intelligence, 6:525–531, 2024

work page 2024

[4] [4]

A Tutorial on Bayesian Optimization

Peter I Frazier. A tutorial on bayesian optimization.arXiv preprint arXiv:1807.02811, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[5] [5]

Automatic chemical design using a data-driven continuous representation of molecules.ACS central science, 4(2):268–276, 2018

Rafael Gómez-Bombarelli, Jennifer N Wei, David Duvenaud, José Miguel Hernández-Lobato, Benjamín Sánchez-Lengeling, Dennis Sheberla, Jorge Aguilera-Iparraguirre, Timothy D Hirzel, Ryan P Adams, and Alán Aspuru-Guzik. Automatic chemical design using a data-driven continuous representation of molecules.ACS central science, 4(2):268–276, 2018

work page 2018

[6] [6]

CRC press, 2007

Charles M Hansen.Hansen solubility parameters: a user’s handbook. CRC press, 2007

work page 2007

[7] [7]

Physics-informed machine learning.Nature Reviews Physics, 3(6):422–440, 2021

George Em Karniadakis, Ioannis G Kevrekidis, Lu Lu, Paris Perdikaris, Sifan Wang, and Liu Yang. Physics-informed machine learning.Nature Reviews Physics, 3(6):422–440, 2021

work page 2021

[8] [8]

Reasoning via planning: Integrating MCTS with language models

Honglu Lu, Zhen Liu, Liang Wang, et al. Reasoning via planning: Integrating MCTS with language models. InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 8146–8165, 2023

work page 2023

[9] [9]

Molecular de-novo design through deep reinforcement learning.Journal of cheminformatics, 9(1):48, 2017

Marcus Olivecrona, Thomas Blaschke, Ola Engkvist, and Hongming Chen. Molecular de-novo design through deep reinforcement learning.Journal of cheminformatics, 9(1):48, 2017

work page 2017

[10] [10]

MemGPT: Towards LLMs as Operating Systems

Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, and Joseph E. Gonzalez. MemGPT: Towards llms as operating systems.arXiv preprint arXiv:2310.08560, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[11] [11]

Deep reinforcement learning for de novo drug design.Science advances, 4(7):eaap7885, 2018

Mariya Popova, Olexandr Isayev, and Alexander Tropsha. Deep reinforcement learning for de novo drug design.Science advances, 4(7):eaap7885, 2018. 12

work page 2018

[12] [12]

Planning chemical syntheses with deep neural networks and symbolic ai.Nature, 555(7698):604–610, 2018

Marwin HS Segler, Mike Preuss, and Mark P Waller. Planning chemical syntheses with deep neural networks and symbolic ai.Nature, 555(7698):604–610, 2018

work page 2018

[13] [13]

Applications of genetic algorithms in chemistry and chemoinformatics.Wiley Interdisciplinary Reviews: Computational Molecular Science, 1 (3):317–324, 2011

Robert P Sheridan and Stephen K Kearsley. Applications of genetic algorithms in chemistry and chemoinformatics.Wiley Interdisciplinary Reviews: Computational Molecular Science, 1 (3):317–324, 2011

work page 2011

[14] [14]

Reflexion: Language agents with verbal reinforcement learning.Advances in Neural Information Processing Systems, 36:8634–8652, 2023

Noah Shinn, Federico Cassano, Ashwin Gopinath, Karthik Narasimhan, and Shunyu Yao. Reflexion: Language agents with verbal reinforcement learning.Advances in Neural Information Processing Systems, 36:8634–8652, 2023

work page 2023

[15] [15]

Large language models as optimizers

Chengrun Yang, Xuezhi Wang, Yifeng Lu, Hanxiao Liu, Quoc V Le, Denny Zhou, and Xinyun Chen. Large language models as optimizers. InInternational Conference on Learning Repre- sentations (ICLR), 2024

work page 2024

[16] [16]

React: Synergizing reasoning and acting in language models

Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik R Narasimhan, and Yuan Cao. React: Synergizing reasoning and acting in language models. InThe eleventh international conference on learning representations, 2022

work page 2022

[17] [17]

Tree of thoughts: Deliberate problem solving with large language models

Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L Griffiths, Yuan Cao, and Karthik Narasimhan. Tree of thoughts: Deliberate problem solving with large language models. In Advances in Neural Information Processing Systems (NeurIPS), volume 36, 2023

work page 2023

[18] [18]

Chemllm: A chemical large language model

Di Zhang, Wei Wei, Yang Liu, et al. Chemllm: A chemical large language model.arXiv preprint arXiv:2402.06852, 2024. A Solvent library B Formulation Advisor Prompt B.1 Role Definition Role You are theFormulation Advisor, appointed by the Materials Science Review Committee. Your responsibility is not to “veto” solutions, but to help the Generator identify r...

work page arXiv 2024