Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization

· 2024 · cs.LG · arXiv 2410.19471

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Inverse folding models play an important role in structure-based design by predicting amino acid sequences that fold into desired reference structures. Models like ProteinMPNN, a message-passing encoder-decoder model, are trained to reliably produce new sequences from a reference structure. However, when applied to peptides, these models are prone to generating repetitive sequences that do not fold into the reference structure. To address this, we fine-tune ProteinMPNN to produce diverse and structurally consistent peptide sequences via Direct Preference Optimization (DPO). We derive two enhancements to DPO: online diversity regularization and domain-specific priors. Additionally, we develop a new understanding on improving diversity in decoder models. When conditioned on OpenFold generated structures, our fine-tuned models achieve state-of-the-art structural similarity scores, improving base ProteinMPNN by at least 8%. Compared to standard DPO, our regularized method achieves up to 20% higher sequence diversity with no loss in structural similarity score.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Entropy Across the Bridge: Conditional-Marginal Discretization for Flow and Schr\"odinger Samplers

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

Derives a conditional-marginal entropy-rate objective for bridge-aware discretization that yields U-shaped schedules and improves low-NFE sample quality on 2D, CIFAR-10, and protein tasks.

Pushing Biomolecular Utility-Diversity Frontiers with Supergroup Relative Policy Optimization

cs.CE · 2026-05-09 · conditional · novelty 6.0 · 2 refs

SGRPO is a GRPO-style framework that constructs set-level diversity rewards via supergroup sampling and leave-one-out redistribution to expand the utility-diversity Pareto frontier in biomolecular design tasks.

citing papers explorer

Showing 2 of 2 citing papers.

Entropy Across the Bridge: Conditional-Marginal Discretization for Flow and Schr\"odinger Samplers cs.LG · 2026-05-15 · unverdicted · none · ref 39 · internal anchor
Derives a conditional-marginal entropy-rate objective for bridge-aware discretization that yields U-shaped schedules and improves low-NFE sample quality on 2D, CIFAR-10, and protein tasks.
Pushing Biomolecular Utility-Diversity Frontiers with Supergroup Relative Policy Optimization cs.CE · 2026-05-09 · conditional · none · ref 41 · 2 links · internal anchor
SGRPO is a GRPO-style framework that constructs set-level diversity rewards via supergroup sampling and leave-one-out redistribution to expand the utility-diversity Pareto frontier in biomolecular design tasks.

Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer