pith. sign in

arxiv: 2604.20886 · v1 · submitted 2026-04-17 · ⚛️ physics.chem-ph · cs.LG

KinetiDiff: Docking-Guided Diffusion for De Novo ACVR1 Inhibitor Design in Fibrodysplasia Ossificans Progressiva

Pith reviewed 2026-05-10 06:43 UTC · model grok-4.3

classification ⚛️ physics.chem-ph cs.LG
keywords de novo drug designdiffusion modelskinase inhibitorsACVR1Fibrodysplasia Ossificans Progressivamolecular dockinggradient guidance
0
0 comments X

The pith

KinetiDiff steers diffusion models with real-time docking gradients to generate ACVR1 inhibitors exceeding the reference by 19 percent.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents KinetiDiff as a method that merges a geometry-complete diffusion model with ongoing AutoDock Vina guidance to design new molecules binding to ACVR1 kinase. This matters for Fibrodysplasia Ossificans Progressiva because the kinase drives the disease and stronger binders could support treatment efforts. The approach injects physics-based docking gradients directly into the denoising steps so generated molecules are pushed toward better predicted affinities. From 10,000 samples the model yielded 9,997 valid molecules, the strongest of which improved binding energy by 19.2 percent over the known reference structure. Ablation experiments across guidance types establish that the docking-based steering outperforms neural alternatives and unguided runs while preserving drug-like properties.

Core claim

By injecting physics-based docking gradients into the diffusion denoising loop, KinetiDiff steers molecule generation toward high-affinity conformations for ACVR1. From 10,000 diffusion samples, the framework produced 9,997 valid molecules. The best candidate achieved -11.05 kcal/mol (pKd = 8.10), a 19.2% improvement over the crystallographic reference. The top 100 candidates all exceed the reference, with 100% Lipinski compliance, median synthetic accessibility of 2.67, and internal diversity of 0.790. Systematic ablation across four guidance strategies demonstrates that real-time docking guidance dominates on all metrics.

What carries the argument

The injection of real-time AutoDock Vina docking gradients into the denoising loop of a Geometry-Complete Diffusion Model, which continuously steers each step of molecule generation toward higher predicted binding affinity to the ACVR1 target.

If this is right

  • Nearly all diffusion samples produce chemically valid molecules ready for further filtering.
  • The strongest generated candidate improves predicted binding energy by 19.2 percent over the reference.
  • Every member of the top 100 molecules exceeds the reference affinity while meeting Lipinski rules and showing favorable synthetic accessibility.
  • Real-time docking guidance outperforms neural-proxy guidance and unguided diffusion across affinity, validity, and diversity measures.
  • The method supplies a concrete route for creating inhibitors against rare-disease kinase targets such as ACVR1.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same gradient-injection strategy could be applied to other kinases simply by providing their crystal structures as guidance targets.
  • The high rate of valid molecules suggests guided diffusion can serve as an efficient sampler for exploring drug-like chemical space beyond this single target.
  • Replacing full docking with faster approximations at each step might preserve most of the guidance benefit while lowering compute cost.
  • Experimental testing in cellular models of Fibrodysplasia Ossificans Progressiva would reveal whether the predicted affinities translate into functional inhibition.

Load-bearing premise

AutoDock Vina docking scores accurately predict true biological binding affinity, synthetic feasibility, and in vivo activity of the generated molecules.

What would settle it

Synthesize the top-scoring molecule and measure its experimental binding affinity to ACVR1 via surface plasmon resonance or enzymatic assay; if the measured affinity does not exceed that of the crystallographic reference ligand, the performance claim is falsified.

Figures

Figures reproduced from arXiv: 2604.20886 by Aaryan Patel.

Figure 1
Figure 1. Figure 1: Architecture overview of the KinetiDiff framework. [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Box-and-whisker comparison of molecular generation quality across four guidance strategies. Vina-Direct [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: 2D structures of the top 10 generated ACVR1 inhibitor candidates with docking scores and drug-likeness [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: 2D chemical structure of the 3MTF crystallographic reference inhibitor (A3F). This baseline ligand established [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Property distributions by guidance type. Lipinski reference thresholds are shown as red dashed lines. Vina [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: pKd vs QED across all 19,021 molecules. The Pareto front (dashed line) traces the non-dominated trade-off [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Chemical diversity of the top 100 candidates. Left: pairwise Tanimoto similarity heatmap (hierarchically [PITH_FULL_IMAGE:figures/full_fig_p011_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Top two generated molecules docked in the ACVR1 binding pocket (PDB: 3MTF), rendered in PyMOL. [PITH_FULL_IMAGE:figures/full_fig_p016_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: ACVR1/ALK2 signaling pathway and mechanism of heterotopic ossification in FOP. [PITH_FULL_IMAGE:figures/full_fig_p018_9.png] view at source ↗
read the original abstract

We present KinetiDiff, a structure-based framework for de novo kinase inhibitor design that integrates a Geometry-Complete Diffusion Model with real-time AutoDock Vina gradient guidance. By injecting physics-based docking gradients into the diffusion denoising loop, KinetiDiff steers molecule generation toward high-affinity conformations for ACVR1 (ALK2), the causative kinase in Fibrodysplasia Ossificans Progressiva. From 10,000 diffusion samples, the framework produced 9,997 valid molecules. The best candidate achieved $-11.05$ kcal/mol (pKd = 8.10), a 19.2% improvement over the crystallographic reference. The top 100 candidates all exceed the reference, with 100% Lipinski compliance, median synthetic accessibility of 2.67, and internal diversity of 0.790. Systematic ablation across four guidance strategies--Vina-Direct (physics), HNN-Denovo (neural proxy), multi-objective, and unguided--demonstrates that real-time docking guidance dominates on all metrics. We evaluate HNN-Denovo as a computationally efficient alternative (60-fold speedup per step), revealing a domain-mismatch limitation (r = 0.224 correlation with Vina) that explains its inferior performance. These results establish gradient-guided geometric diffusion as a practical approach for generating potent, synthetically accessible inhibitors against rare-disease kinase targets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript presents KinetiDiff, a structure-based de novo design framework that integrates a Geometry-Complete Diffusion Model with real-time AutoDock Vina gradient guidance during the denoising process. Applied to ACVR1 (ALK2) for Fibrodysplasia Ossificans Progressiva, the method generates 9,997 valid molecules from 10,000 samples. The best candidate achieves a Vina score of -11.05 kcal/mol, representing a 19.2% improvement over the crystallographic reference, with all top-100 candidates exceeding the reference score. Ablation studies compare Vina-Direct guidance against neural proxy (HNN-Denovo), multi-objective, and unguided approaches, showing superiority of physics-based guidance, while noting the neural proxy's limitations due to low correlation (r=0.224) with Vina.

Significance. If the generated molecules can be shown to bind ACVR1 with affinities consistent with the reported Vina scores through independent means, the integration of physics-based docking gradients into geometric diffusion could advance computational methods for designing inhibitors against rare-disease targets. The high molecular validity rate, Lipinski compliance, and synthetic accessibility metrics are positive features, and the ablation provides useful comparison between direct physics guidance and learned proxies.

major comments (2)
  1. [Abstract and Results] Abstract and Results sections: The reported affinity improvements (e.g., best Vina score of -11.05 kcal/mol and top-100 ranking) are evaluated exclusively using the same AutoDock Vina docking scores that supply the gradients for guidance. This renders the gains expected by construction of the optimization loop rather than an external test of binding quality. No orthogonal scoring (alternative docking engine, MM-GBSA, MD free energies) or comparison to experimentally measured affinities of known ACVR1 inhibitors is provided.
  2. [Ablation study] Ablation study (Vina-Direct vs. HNN-Denovo): While the r=0.224 correlation correctly identifies domain mismatch for the neural proxy, the comparison still relies on Vina as the sole ground truth. This does not establish whether Vina itself is predictive of true biological binding for the generated ACVR1 ligands or whether the steered poses are physically realistic.
minor comments (2)
  1. [Abstract] Abstract: The reported 'internal diversity of 0.790' and 'median synthetic accessibility of 2.67' should specify the exact diversity metric (e.g., Tanimoto on ECFP) and reference scale for SA score to allow direct comparison.
  2. [Methods] Methods: Provide more detail on the precise mechanism of 'real-time' Vina gradient injection, including gradient computation frequency, scaling factor, and how it is combined with the diffusion denoising update.

Simulated Author's Rebuttal

2 responses · 2 unresolved

We thank the referee for their constructive feedback and for acknowledging the potential of integrating physics-based guidance into geometric diffusion models. We address each major comment below with clarifications and indicate where revisions will be made to the manuscript.

read point-by-point responses
  1. Referee: [Abstract and Results] Abstract and Results sections: The reported affinity improvements (e.g., best Vina score of -11.05 kcal/mol and top-100 ranking) are evaluated exclusively using the same AutoDock Vina docking scores that supply the gradients for guidance. This renders the gains expected by construction of the optimization loop rather than an external test of binding quality. No orthogonal scoring (alternative docking engine, MM-GBSA, MD free energies) or comparison to experimentally measured affinities of known ACVR1 inhibitors is provided.

    Authors: We agree that the reported improvements are assessed using AutoDock Vina scores, which also provide the guidance gradients. This design choice allows direct demonstration that the diffusion process, when steered by real-time Vina gradients, produces molecules with better scores than the crystallographic reference and outperforms unguided or alternative guidance baselines in the ablation studies. The 99.97% validity rate and drug-likeness metrics provide supporting evidence of molecular quality within the Vina optimization landscape. However, we acknowledge that this constitutes an internal evaluation rather than fully independent validation. In the revised manuscript, we will add a limitations subsection in the Discussion explicitly noting this point and recommending future orthogonal assessments (e.g., alternative docking engines or MD simulations). We will also include a comparison of top candidates' Vina scores against literature-reported values for known ACVR1 inhibitors. revision: partial

  2. Referee: [Ablation study] Ablation study (Vina-Direct vs. HNN-Denovo): While the r=0.224 correlation correctly identifies domain mismatch for the neural proxy, the comparison still relies on Vina as the sole ground truth. This does not establish whether Vina itself is predictive of true biological binding for the generated ACVR1 ligands or whether the steered poses are physically realistic.

    Authors: The ablation study isolates the effect of guidance type by holding the evaluation metric (Vina) constant across conditions, thereby showing that direct physics-based guidance outperforms the neural proxy and other strategies. The low correlation for HNN-Denovo is presented precisely to illustrate the domain mismatch issue. We do not assert that Vina scores predict experimental affinities for these specific de novo ligands, nor do we claim to have validated pose realism beyond the geometric constraints enforced by the diffusion model and docking engine. In the revised manuscript, we will expand the Discussion to address the known limitations of docking scores as proxies for binding and to emphasize the need for experimental confirmation of activity. revision: partial

standing simulated objections not resolved
  • Experimental measurement of binding affinities for the generated de novo molecules
  • Independent orthogonal scoring such as MM-GBSA or MD free-energy calculations on the generated ligands

Circularity Check

1 steps flagged

Vina docking scores used for both real-time gradient guidance and post-hoc evaluation create circular assessment of 'high-affinity' claim

specific steps
  1. fitted input called prediction [Abstract]
    "By injecting physics-based docking gradients into the diffusion denoising loop, KinetiDiff steers molecule generation toward high-affinity conformations for ACVR1 (ALK2)... The best candidate achieved $-11.05$ kcal/mol (pKd = 8.10), a 19.2% improvement over the crystallographic reference. The top 100 candidates all exceed the reference"

    The guidance mechanism explicitly optimizes the AutoDock Vina score via gradients; success is then declared by the same Vina scores exceeding the reference ligand. The 19.2% improvement and 100% top-100 superiority are therefore enforced by construction of the objective, not validated externally.

full rationale

The paper's core claim—that gradient-guided diffusion produces superior ACVR1 binders—is supported only by reporting improved AutoDock Vina scores. Because the identical Vina function supplies both the denoising gradients and the final ranking metric (best candidate -11.05 kcal/mol, top-100 all exceed reference), the reported gains are the direct, expected output of the optimization loop rather than an independent test. Ablations confirm Vina-Direct beats alternatives on Vina metrics, but this does not escape the circularity; no orthogonal scoring or experimental affinities are provided.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the domain assumption that Vina docking scores are reliable proxies for affinity and on the generative capabilities of the diffusion model under guidance.

axioms (2)
  • domain assumption AutoDock Vina docking scores are reliable proxies for binding affinity
    Used both to guide generation and to claim success without orthogonal validation.
  • domain assumption The geometry-complete diffusion model produces chemically valid and diverse molecules when guided by docking gradients
    Supported only by the count of 9,997 valid molecules out of 10,000 samples.

pith-pipeline@v0.9.0 · 5563 in / 1549 out tokens · 69843 ms · 2026-05-10T06:43:47.245935+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

16 extracted references · 16 canonical work pages

  1. [1]

    Dushyanth Srinivasan, Martin Arostegui, Erich J

    doi:10.1016/j.berh.2007.11.007. Dushyanth Srinivasan, Martin Arostegui, Erich J. Goebel, Kaitlin N. Hart, Senem Aykul, John B. Lees-Shepard, Vincent Idone, Sarah J. Hatsell, and Aris N. Economides. How activin a became a therapeutic target in fibrodysplasia ossificans progressiva.Biomolecules, 14(1):101,

  2. [2]

    Eleanor Williams, Jana Bagarova, Georgina Kerr, Dong-Dong Xia, Elsie S

    doi:10.3390/biom14010101. Eleanor Williams, Jana Bagarova, Georgina Kerr, Dong-Dong Xia, Elsie S. Place, Devaveena Dey, Yue Shen, Geoffrey A. Bocobo, Agustin H. Mohedas, Xiuli Huang, Philip E. Sanderson, Arthur Lee, Wei Zheng, Aris N. Economides, James C. Smith, Paul B. Yu, and Alex N. Bullock. Saracatinib is an efficacious clinical candidate for fibrodys...

  3. [3]

    doi:10.1172/jci.insight.95042. Thomas Ullrich, Sabine Guth, Luca Arista, Sven Weiler, Nikolaus Stiefl, Sylvie Teixeira-Fouchard, Carien Dekker, Alexandra Hinniger, Victoria Head, Michaela Kneissel, and Ina Kramer. Discovery and characterization of zilur- gisertib, a potent and selective inhibitor of activin receptor-like kinase-2 (alk2) for the treatment ...

  4. [4]

    Alex Zhavoronkov, Yan A

    doi:10.1021/acsmedchemlett.5c00516. Alex Zhavoronkov, Yan A. Ivanenkov, Alex Aliper, Mark S. Veselov, Vladimir A. Aladinskiy, Anastasiya V . Aladinskaya, Victor A. Terentiev, Daniil A. Polykovskiy, Maksim D. Kuznetsov, Arip Asadulaev, Yury V olkov, Artem Zholus, Rim R. Shayakhmetov, Alexander Zhebrak, Lidiya I. Minaeva, Bogdan A. Zagribelnyy, Lennart H. L...

  5. [5]

    Deep learning enables rapid identification of potent DDR1 kinase inhibitors.Nature Biotechnology, 37(9): 1038–1040, 2019

    doi:10.1038/s41587-019-0224-x. 11 Yueshan Li, Liting Zhang, Yifei Wang, Jun Zou, Ruicheng Yang, Xinling Luo, Chengyong Wu, Wei Yang, Chenyu Tian, Haixing Xu, Falu Wang, Xin Yang, Linli Li, and Shengyong Yang. Generative deep learning enables the discovery of a potent and selective ripk1 inhibitor.Nature Communications, 13(1):6891,

  6. [6]

    Uddalak Das

    doi:10.1038/s41467-022-34692-w. Uddalak Das. Generative ai for drug discovery and protein design: the next frontier in ai-driven molecular science. Medicine in Drug Discovery, 27:100213,

  7. [7]

    Alex Morehead and Jianlin Cheng

    doi:10.1016/j.medidd.2025.100213. Alex Morehead and Jianlin Cheng. Geometry-complete diffusion for 3d molecule generation and optimization. Communications Chemistry, 7(1):150,

  8. [8]

    Song, Qianru Zhao, Xiaodong Wu, Daniel Bhowmik, and Hugh A

    Xinzhe Zheng, Yifan Ren, Tianrui Lai, Xian Chen, Luke P. Song, Qianru Zhao, Xiaodong Wu, Daniel Bhowmik, and Hugh A. Carlson. Apo2mol: 3d molecule generation via dynamic pocket-aware diffusion models.arXiv preprint arXiv:2404.08777,

  9. [9]

    Yue Jian, Curtis Wu, Danny Reidenbach, and Aditi S

    doi:10.1093/bioinformatics/btac781. Yue Jian, Curtis Wu, Danny Reidenbach, and Aditi S. Krishnapriyan. General binding affinity guidance for diffusion models in structure-based drug design.arXiv preprint arXiv:2406.16821,

  10. [10]

    doi:10.1038/s41467-025-63245-0. Jesse A. Weller, Remo Rohs, Brandon A. Chapman, Mason Meyer, Michael De Montigny, Lee Kuo, Justin L. Wichman, Max D. Thomas, Matthew O’Meara, John McPhail, Li Cheng, and Alán Aspuru-Guzik. Structure-based drug design with a deep hierarchical generative model.Journal of Chemical Information and Modeling, 64(16):6450–6463,

  11. [11]

    Alex Nichol and Prafulla Dhariwal

    doi:10.1021/acs.jcim.4c01193. Alex Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models. InInternational Conference on Machine Learning, pages 8162–8171,

  12. [12]

    and Olson, A

    doi:10.1002/jcc.21334. Tong Liu, Youjun Lin, Xin Wen, Robyn N. Jorissen, and Michael K. Gilson. Bindingdb: a web-accessible database of experimentally determined protein–ligand binding affinities.Nucleic Acids Research, 35(Database issue):D198–D201,

  13. [13]

    doi:10.1093/nar/gkl899. Google. Google tpu research cloud (trc),

  14. [14]

    Table 3 summarizes the full Vina guidance configuration

    3D conformers were embedded using RDKit’s ETKDGv3 (with v2 and random-coordinate fallbacks), minimized with MMFF94 (UFF fallback), and converted to PDBQT format via Meeko’sMoleculePreparation and PDBQTWriterLegacy. Table 3 summarizes the full Vina guidance configuration. A.4 Training Data and HNN-Denovo Architecture The HNN-Denovo affinity predictor was t...

  15. [15]

    Table 4: Multi-objective scoring targets and thresholds. Objective Target Threshold Direction Affinity (pKd) 7.0≥6.0Maximize Docking (kcal/mol)−11.0≤ −10.0Minimize SA score 3.0≤3.5Minimize Adaptive threshold weights were updated with rate α= 0.1 and bounded to [0.1,0.6] . The SA objective enforced a minimum weight floor of 0.25 and an exponential penalty ...

  16. [16]

    17 D ACVR1 Biology and Inhibition Mechanism Figure 9: ACVR1/ALK2 signaling pathway and mechanism of heterotopic ossification in FOP.Left:Normal BMP signaling through wild-type ALK2, which is activated only by BMP ligands and remains unresponsive to activin A. Center:The R206H gain-of-function mutation renders ALK2 constitutively active and responsive to a...