pith. sign in

arxiv: 2103.10432 · v1 · pith:WLD74LOEnew · submitted 2021-03-18 · 🧬 q-bio.BM · cs.CE· cs.LG

MARS: Markov Molecular Sampling for Multi-objective Drug Discovery

classification 🧬 q-bio.BM cs.CEcs.LG
keywords marsmolecularchemicaldiscoverydrugmulti-objectivecandidatesgraphs
0
0 comments X
read the original abstract

Searching for novel molecules with desired chemical properties is crucial in drug discovery. Existing work focuses on developing neural models to generate either molecular sequences or chemical graphs. However, it remains a big challenge to find novel and diverse compounds satisfying several properties. In this paper, we propose MARS, a method for multi-objective drug molecule discovery. MARS is based on the idea of generating the chemical candidates by iteratively editing fragments of molecular graphs. To search for high-quality candidates, it employs Markov chain Monte Carlo sampling (MCMC) on molecules with an annealing scheme and an adaptive proposal. To further improve sample efficiency, MARS uses a graph neural network (GNN) to represent and select candidate edits, where the GNN is trained on-the-fly with samples from MCMC. Experiments show that MARS achieves state-of-the-art performance in various multi-objective settings where molecular bio-activity, drug-likeness, and synthesizability are considered. Remarkably, in the most challenging setting where all four objectives are simultaneously optimized, our approach outperforms previous methods significantly in comprehensive evaluations. The code is available at https://github.com/yutxie/mars.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. AtomComposer: Discovering Chemical Space from First Principles with Reinforcement Learning

    cs.LG 2026-05 unverdicted novelty 8.0

    AtomComposer uses online RL with multi-composition training to discover up to 10x more valid 3D isomers on unseen chemical formulas than single-composition baselines.

  2. Controllable Molecular Generative Foundation Models

    cs.LG 2026-05 unverdicted novelty 5.0

    CoMole uses a motif-aware graph diffusion pipeline with RL to rank first in controllability on nine targets across materials and drug benchmarks while keeping validity above 0.94 without post-processing.

  3. Controllable Molecular Generative Foundation Models

    cs.LG 2026-05 unverdicted novelty 5.0

    CoMole combines motif-aware graph diffusion with RL policy optimization to deliver controllable molecular generation that outperforms baselines on nine targets across materials and drug benchmarks while keeping high validity.