PepALD: Macrocyclic Peptide Generation via Autoregressive Latent Diffusion

Junming Zhang; Siyu Yi; Wei Ju; Zhonghui Gu

arxiv: 2606.14510 · v1 · pith:HPK7NVMGnew · submitted 2026-06-12 · 💻 cs.LG · q-bio.BM

PepALD: Macrocyclic Peptide Generation via Autoregressive Latent Diffusion

Junming Zhang , Siyu Yi , Wei Ju , Zhonghui Gu This is my paper

Pith reviewed 2026-06-27 04:57 UTC · model grok-4.3

classification 💻 cs.LG q-bio.BM

keywords macrocyclic peptideslatent diffusionautoregressive generationpeptide designde novo generationpreference optimizationchemical embeddingsring closure prediction

0 comments

The pith

PepALD generates macrocyclic peptides by diffusing residues in a chemically structured latent space while predicting ring closures and aligning to affinity rewards.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents PepALD as a way to design macrocyclic peptides that can reach intracellular targets by handling non-natural chemistry, ring shape, permeability, and binding at once. Existing string-based models struggle because they either work at the atom level or treat monomers as abstract symbols without chemical detail. PepALD instead embeds monomers with their chemical structure, uses diffusion to generate each residue in a latent space conditioned on context, adds ring closures that respect R-groups, and tunes the process toward better binding using a specialized preference method. If this works, it would produce peptide candidates that better match therapeutic needs than current generators. This matters for creating drugs that standard methods cannot easily reach.

Core claim

PepALD is an Autoregressive Latent Diffusion foundation model for de novo macrocyclic peptide generation. The model represents HELM monomers with structured chemical embeddings, generates each residue through context-conditioned diffusion in chemically informed latent space, predicts R-group-aware ring closures during autoregressive generation, and aligns the denoiser to affinity rewards using winner-protected diffusion-adapted preference optimization. In silico experiments demonstrate PepALD's generation quality and reward-optimization performance against representative peptide generation baselines.

What carries the argument

Autoregressive Latent Diffusion that performs context-conditioned diffusion over structured chemical embeddings of monomers and incorporates R-group-aware ring closure prediction plus winner-protected preference optimization to enforce topology and affinity.

If this is right

Generated peptides would more reliably include non-natural monomers while maintaining valid ring topology.
The preference optimization step would shift the output distribution toward higher measured binding affinity.
Context conditioning during diffusion would allow control over sequence properties like permeability without post-hoc filtering.
Ring closure prediction integrated in the autoregressive loop would reduce invalid cyclic structures compared to string-only models.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same latent diffusion setup could be tested on linear peptides or other oligomers to check if the ring-specific components are essential.
Pairing the generator with molecular dynamics simulations of permeability could create a closed-loop design process.
Extending the chemical embeddings to include explicit 3D conformer information might improve downstream docking accuracy.

Load-bearing premise

The structured chemical embeddings, latent diffusion process, ring closure prediction, and preference optimization together produce the claimed gains in quality and alignment without separate tests isolating each piece.

What would settle it

Running the same in silico benchmarks with an ablated version that removes the chemical embeddings or the latent diffusion step and finding no drop in validity, diversity, or affinity scores relative to the full model.

Figures

Figures reproduced from arXiv: 2606.14510 by Junming Zhang, Siyu Yi, Wei Ju, Zhonghui Gu.

**Figure 2.** Figure 2: Comparison of monomer chemical-space coverage between the PepALD and HELM [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗

**Figure 3.** Figure 3: Peptide-level chemical-space coverage after permeability-oriented generation, shown as an MDS projection of pairwise molecular-fingerprint distances for valid samples from PepTune, HELMGPTperm, and PepALDperm [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗

**Figure 4.** Figure 4: Solubility and permeability evaluation of PepALD [PITH_FULL_IMAGE:figures/full_fig_p013_4.png] view at source ↗

**Figure 5.** Figure 5: Target-specific WP-DPO optimization redirects PepALD toward receptor-favorable [PITH_FULL_IMAGE:figures/full_fig_p015_5.png] view at source ↗

read the original abstract

Macrocyclic peptides are promising therapeutic candidates for intracellular targets, but their design requires simultaneous control over non-natural monomer chemistry, ring topology, membrane permeability, and target binding. Existing SMILES- or HELM-string generative models either operate in long atom-level sequence spaces or treat monomers as symbolic tokens with limited chemical grounding. We introduce PepALD, an Autoregressive Latent Diffusion (ALD) foundation model for \textit{de novo} macrocyclic peptide generation. The model represents HELM monomers with structured chemical embeddings, generates each residue through context-conditioned diffusion in chemically informed latent space, predicts R-group-aware ring closures during autoregressive generation, and aligns the denoiser to affinity rewards using winner-protected diffusion-adapted preference optimization. In silico experiments demonstrate PepALD's generation quality and reward-optimization performance against representative peptide generation baselines.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

PepALD sketches a plausible autoregressive latent diffusion setup for macrocyclic peptides but the abstract gives no data, baselines, or ablations to show the four claimed components actually help.

read the letter

The one thing to know is that this paper puts forward PepALD, which fuses autoregressive generation, latent diffusion, HELM chemical embeddings, R-group ring closure prediction, and a winner-protected preference optimization step for de novo macrocyclic peptide design. The problem it targets is legitimate: existing string models struggle with non-natural monomers, topology, and binding simultaneously.

What the work does is lay out a concrete architecture that tries to keep chemical structure in the loop rather than treating monomers as pure symbols. That framing is reasonable and the four listed pieces (embeddings, context-conditioned latent diffusion, autoregressive closure, and adapted preference opt) are presented as the distinguishing moves.

The soft spot is exactly what the stress-test note flags. The abstract asserts that in silico experiments beat representative baselines on generation quality and reward alignment, yet supplies zero methods, metrics, tables, or ablation results. Without those, there is no way to check whether any of the four components drove the gains or whether the whole thing reduces to standard diffusion plus some domain glue. The full text is referenced but not supplied here, so the evaluation stays provisional.

This is aimed at groups already working on generative models for therapeutic peptides. A reader hunting for new architecture sketches might skim it for the embedding and closure ideas. Anyone needing reproducible evidence or validated improvements will find nothing to cite or build on yet.

I would not send it to peer review in its current form; the central performance claim is unsupported. If the full manuscript adds proper experiments, controls, and ablations, then yes.

Referee Report

2 major / 0 minor

Summary. The paper introduces PepALD, an autoregressive latent diffusion foundation model for de novo macrocyclic peptide generation. It represents HELM monomers via structured chemical embeddings, performs context-conditioned diffusion in latent space, predicts R-group-aware ring closures autoregressively, and aligns the denoiser to affinity rewards via winner-protected diffusion-adapted preference optimization. The central claim is that in silico experiments demonstrate superior generation quality and reward-optimization performance relative to representative peptide generation baselines.

Significance. If the claimed performance gains are substantiated with detailed methods, metrics, baselines, and ablations, the work could advance chemically grounded generative modeling for macrocyclic peptides by combining latent diffusion with autoregressive structure prediction and preference optimization. The approach addresses limitations of SMILES/HELM string models through monomer-level chemical embeddings and topology handling. No machine-checked proofs, open code, or parameter-free derivations are described.

major comments (2)

[Abstract] Abstract: The central claim that 'in silico experiments demonstrate PepALD's generation quality and reward-optimization performance' is unsupported because the abstract (and visible text) supplies no methods, datasets, metrics, baselines, error analysis, or quantitative results. This prevents any assessment of whether the four listed components produce measurable improvements.
[Abstract] Abstract: The four technical contributions (structured chemical embeddings, context-conditioned latent diffusion, autoregressive ring-closure prediction, winner-protected preference optimization) are presented as distinguishing features, yet no ablation studies, incremental-result tables, or implementation equations are referenced to isolate their individual effects versus baselines.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on the abstract. We agree that the abstract should provide sufficient context on methods, metrics, and results to support the central claims, and we will revise it accordingly in the next version. The full paper contains the requested details in the methods, experiments, and results sections.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim that 'in silico experiments demonstrate PepALD's generation quality and reward-optimization performance' is unsupported because the abstract (and visible text) supplies no methods, datasets, metrics, baselines, error analysis, or quantitative results. This prevents any assessment of whether the four listed components produce measurable improvements.

Authors: We agree that the abstract as currently written does not include the requested details. In the revised manuscript we will expand the abstract to concisely reference the evaluation datasets (e.g., macrocyclic peptide libraries with affinity labels), key metrics (validity, diversity, ring-closure accuracy, reward alignment scores), representative baselines (SMILES-based and HELM-based generative models), and quantitative gains (e.g., relative improvements on affinity optimization). Full methods, error analysis, and tables remain in Sections 4 and 5. revision: yes
Referee: [Abstract] Abstract: The four technical contributions (structured chemical embeddings, context-conditioned latent diffusion, autoregressive ring-closure prediction, winner-protected preference optimization) are presented as distinguishing features, yet no ablation studies, incremental-result tables, or implementation equations are referenced to isolate their individual effects versus baselines.

Authors: We acknowledge the abstract does not cite ablations or equations. The revised abstract will include a brief statement that component-wise ablations (detailed in Section 5.3) isolate the contribution of each module, with incremental tables showing performance deltas relative to baselines. Implementation equations for the chemical embeddings, latent diffusion process, autoregressive ring closure, and winner-protected preference optimization are already provided in Sections 3.1–3.4 and will be cross-referenced. revision: yes

Circularity Check

0 steps flagged

No circularity detected; claims rest on empirical results without self-referential derivations

full rationale

The paper describes an autoregressive latent diffusion model for peptide generation with components including chemical embeddings, context-conditioned diffusion, ring closure prediction, and preference optimization. No equations, fitting procedures, or derivation chains are presented in the abstract or summary that reduce any claimed prediction or result to its own inputs by construction. The central claims concern in silico experimental performance against baselines, which are presented as independent empirical outcomes rather than tautological renamings or self-citations. No load-bearing self-citation chains, ansatzes smuggled via citation, or uniqueness theorems imported from prior author work are visible. This is the expected outcome for an applied ML architecture paper whose validation is external to any internal derivation.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only input supplies no information on free parameters, background axioms, or new postulated entities.

pith-pipeline@v0.9.1-grok · 5676 in / 1109 out tokens · 39236 ms · 2026-06-27T04:57:36.962347+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

26 extracted references · 1 canonical work pages · 1 internal anchor

[1]

and Buchwald, P

Bojadzic, D. and Buchwald, P. (2018). Toward small-molecule inhibition of protein--protein interactions: General aspects and recent progress in targeting costimulatory and coinhibitory (immune checkpoint) interactions. Current Topics in Medicinal Chemistry\/ , 18 (8), 674--699

2018
[2]

and Goldstein, J

Carbonell, J. and Goldstein, J. (1998). The use of MMR , diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval\/ , pages 335--336

1998
[3]

Gare, C. L. et al. (2025). From lead to market: chemical approaches to transform peptides into therapeutics. Trends in Biochemical Sciences\/

2025
[4]

Geylan, G. et al. (2025). Pepinvent: generative peptide design beyond natural amino acids. Chemical Science\/ , 16 (20), 8682--8696

2025
[5]

Hickey, J. L. et al. (2023). Beyond 20 in the 21st century: prospects and challenges of non-canonical amino acids in peptide drug discovery. ACS Medicinal Chemistry Letters\/ , 14 (5), 557--565

2023
[6]

Ho, J. et al. (2020). Denoising diffusion probabilistic models. In Advances in Neural Information Processing Systems\/ , volume 33, pages 6840--6851

2020
[7]

Li, J. et al. (2023). CycPeptMPDB : A comprehensive database of membrane permeability of cyclic peptides. Journal of Chemical Information and Modeling\/ , 63 (7), 2240--2250

2023
[8]

Mendez, D. et al. (2019). ChEMBL : towards direct deposition of bioassay data. Nucleic Acids Research\/ , 47 (D1), D930--D940

2019
[9]

Pal, A. et al. (2024). Smaug: Fixing failure modes of preference optimisation with dpo-positive, 2024. URL https://arxiv. org/abs/2402.13228\/

work page internal anchor Pith review Pith/arXiv arXiv 2024
[10]

Polykovskiy, D. et al. (2020). Molecular Sets (MOSES) : A benchmarking platform for molecular generation models. Frontiers in Pharmacology\/ , 11 , 565644

2020
[11]

Rafailov, R. et al. (2023). Direct preference optimization: Your language model is secretly a reward model. In Advances in Neural Information Processing Systems\/ , volume 36, pages 53728--53741

2023
[12]

Rettie, S. A. et al. (2025a). Accurate de novo design of high-affinity protein-binding macrocycles using deep learning. Nature Chemical Biology\/ , pages 1--9
[13]

Rettie, S. A. et al. (2025b). Cyclic peptide structure prediction and design using alphafold2. Nature Communications\/ , 16 (1), 4730
[14]

Sadek, M. M. et al. (2018). A cyclic peptide inhibitor of the iNOS -- SPSB protein--protein interaction as a potential anti-infective agent. ACS Chemical Biology\/ , 13 (10), 2930--2938

2018
[15]

Song, J. et al. (2021). Denoising diffusion implicit models. In International Conference on Learning Representations\/

2021
[16]

Tang, S. et al. (2025). PepTune : De novo generation of therapeutic peptides with multi-objective-guided discrete diffusion. In Proceedings of the 42nd International Conference on Machine Learning\/ , volume 267 of Proceedings of Machine Learning Research\/ , pages 59017--59065. PMLR

2025
[17]

van Neer, R. H. P. et al. (2025). Active- and allosteric-site cyclic peptide inhibitors of secreted M. tuberculosis chorismate mutase. ACS Infectious Diseases\/ , 11 (3), 703--714

2025
[18]

Vaswani, A. et al. (2017). Attention is all you need. In Advances in Neural Information Processing Systems\/ , volume 30

2017
[19]

Wallace, B. et al. (2024). Diffusion model alignment using direct preference optimization. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition\/ , pages 8228--8238

2024
[20]

Wang, R. et al. (2026). DrugBLIP : exploring the protein--molecule interaction mechanisms with a multi-task learning graph transformer. Bioinformatics\/ , 42 (4), btag069

2026
[21]

Weininger, D. (1988). SMILES , a chemical language and information system. 1. introduction to methodology and encoding rules. Journal of Chemical Information and Computer Sciences\/ , 28 (1), 31--36

1988
[22]

Xu, X. et al. (2024). HELM-GPT : de novo macrocyclic peptide design using generative pre-trained transformer. Bioinformatics\/ , 40 (6), btae364

2024
[23]

Yu, Y. et al. (2023). Uni-Dock : GPU -accelerated docking enables ultralarge virtual screening. Journal of Chemical Theory and Computation\/ , 19 (11), 3336--3345

2023
[24]

Zhang, T. et al. (2012). HELM : a hierarchical notation language for complex biomolecule structure representation. Journal of Chemical Information and Modeling\/ , 52 (10), 2796--2806

2012
[25]

Zhou, G. et al. (2023). Uni-Mol : a universal 3d molecular representation learning framework. In International Conference on Learning Representations\/

2023
[26]

Zorzi, A. et al. (2017). Cyclic peptide therapeutics: past, present and future. Current Opinion in Chemical Biology\/ , 38 , 24--29

2017

[1] [1]

and Buchwald, P

Bojadzic, D. and Buchwald, P. (2018). Toward small-molecule inhibition of protein--protein interactions: General aspects and recent progress in targeting costimulatory and coinhibitory (immune checkpoint) interactions. Current Topics in Medicinal Chemistry\/ , 18 (8), 674--699

2018

[2] [2]

and Goldstein, J

Carbonell, J. and Goldstein, J. (1998). The use of MMR , diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval\/ , pages 335--336

1998

[3] [3]

Gare, C. L. et al. (2025). From lead to market: chemical approaches to transform peptides into therapeutics. Trends in Biochemical Sciences\/

2025

[4] [4]

Geylan, G. et al. (2025). Pepinvent: generative peptide design beyond natural amino acids. Chemical Science\/ , 16 (20), 8682--8696

2025

[5] [5]

Hickey, J. L. et al. (2023). Beyond 20 in the 21st century: prospects and challenges of non-canonical amino acids in peptide drug discovery. ACS Medicinal Chemistry Letters\/ , 14 (5), 557--565

2023

[6] [6]

Ho, J. et al. (2020). Denoising diffusion probabilistic models. In Advances in Neural Information Processing Systems\/ , volume 33, pages 6840--6851

2020

[7] [7]

Li, J. et al. (2023). CycPeptMPDB : A comprehensive database of membrane permeability of cyclic peptides. Journal of Chemical Information and Modeling\/ , 63 (7), 2240--2250

2023

[8] [8]

Mendez, D. et al. (2019). ChEMBL : towards direct deposition of bioassay data. Nucleic Acids Research\/ , 47 (D1), D930--D940

2019

[9] [9]

Pal, A. et al. (2024). Smaug: Fixing failure modes of preference optimisation with dpo-positive, 2024. URL https://arxiv. org/abs/2402.13228\/

work page internal anchor Pith review Pith/arXiv arXiv 2024

[10] [10]

Polykovskiy, D. et al. (2020). Molecular Sets (MOSES) : A benchmarking platform for molecular generation models. Frontiers in Pharmacology\/ , 11 , 565644

2020

[11] [11]

Rafailov, R. et al. (2023). Direct preference optimization: Your language model is secretly a reward model. In Advances in Neural Information Processing Systems\/ , volume 36, pages 53728--53741

2023

[12] [12]

Rettie, S. A. et al. (2025a). Accurate de novo design of high-affinity protein-binding macrocycles using deep learning. Nature Chemical Biology\/ , pages 1--9

[13] [13]

Rettie, S. A. et al. (2025b). Cyclic peptide structure prediction and design using alphafold2. Nature Communications\/ , 16 (1), 4730

[14] [14]

Sadek, M. M. et al. (2018). A cyclic peptide inhibitor of the iNOS -- SPSB protein--protein interaction as a potential anti-infective agent. ACS Chemical Biology\/ , 13 (10), 2930--2938

2018

[15] [15]

Song, J. et al. (2021). Denoising diffusion implicit models. In International Conference on Learning Representations\/

2021

[16] [16]

Tang, S. et al. (2025). PepTune : De novo generation of therapeutic peptides with multi-objective-guided discrete diffusion. In Proceedings of the 42nd International Conference on Machine Learning\/ , volume 267 of Proceedings of Machine Learning Research\/ , pages 59017--59065. PMLR

2025

[17] [17]

van Neer, R. H. P. et al. (2025). Active- and allosteric-site cyclic peptide inhibitors of secreted M. tuberculosis chorismate mutase. ACS Infectious Diseases\/ , 11 (3), 703--714

2025

[18] [18]

Vaswani, A. et al. (2017). Attention is all you need. In Advances in Neural Information Processing Systems\/ , volume 30

2017

[19] [19]

Wallace, B. et al. (2024). Diffusion model alignment using direct preference optimization. In 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition\/ , pages 8228--8238

2024

[20] [20]

Wang, R. et al. (2026). DrugBLIP : exploring the protein--molecule interaction mechanisms with a multi-task learning graph transformer. Bioinformatics\/ , 42 (4), btag069

2026

[21] [21]

Weininger, D. (1988). SMILES , a chemical language and information system. 1. introduction to methodology and encoding rules. Journal of Chemical Information and Computer Sciences\/ , 28 (1), 31--36

1988

[22] [22]

Xu, X. et al. (2024). HELM-GPT : de novo macrocyclic peptide design using generative pre-trained transformer. Bioinformatics\/ , 40 (6), btae364

2024

[23] [23]

Yu, Y. et al. (2023). Uni-Dock : GPU -accelerated docking enables ultralarge virtual screening. Journal of Chemical Theory and Computation\/ , 19 (11), 3336--3345

2023

[24] [24]

Zhang, T. et al. (2012). HELM : a hierarchical notation language for complex biomolecule structure representation. Journal of Chemical Information and Modeling\/ , 52 (10), 2796--2806

2012

[25] [25]

Zhou, G. et al. (2023). Uni-Mol : a universal 3d molecular representation learning framework. In International Conference on Learning Representations\/

2023

[26] [26]

Zorzi, A. et al. (2017). Cyclic peptide therapeutics: past, present and future. Current Opinion in Chemical Biology\/ , 38 , 24--29

2017