Objective-Reinforced Generative Adversarial Networks (ORGAN) for Sequence Generation Models

Gabriel Lima Guimaraes , Benjamin Sanchez-Lengeling , Carlos Outeiral , Pedro Luis Cunha Farias , Al\'an Aspuru-Guzik

Authors on Pith no claims yet

classification 📊 stat.ML cs.LG

keywords generationdatametricsmodelordertowardsadversarialbias

read the original abstract

In unsupervised data generation tasks, besides the generation of a sample based on previous observations, one would often like to give hints to the model in order to bias the generation towards desirable metrics. We propose a method that combines Generative Adversarial Networks (GANs) and reinforcement learning (RL) in order to accomplish exactly that. While RL biases the data generation process towards arbitrary metrics, the GAN component of the reward function ensures that the model still remembers information learned from data. We build upon previous results that incorporated GANs and RL in order to generate sequence data and test this model in several settings for the generation of molecules encoded as text sequences (SMILES) and in the context of music generation, showing for each case that we can effectively bias the generation process towards desired metrics.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

How Creative Are Large Language Models in Generating Molecules?
cs.CL 2026-04 unverdicted novelty 7.0

Large language models exhibit distinct creative patterns in molecule generation, including higher constraint satisfaction when more constraints are added, and this is the first work to reframe molecule generation abil...
MolWorld: Molecule World Models for Actionable Molecular Optimization
cs.LG 2026-05 unverdicted novelty 6.0

MolWorld expands a molecule-transfer graph using a world model to discover high-property molecules that maintain strong structural connectivity to known compounds for actionable optimization.