pith. sign in

arxiv: 2511.00179 · v2 · submitted 2025-10-31 · ⚛️ physics.chem-ph · cs.AI· cs.LG

Generative Modeling Enables Molecular Structure Retrieval from Coulomb Explosion Imaging

Pith reviewed 2026-05-18 02:09 UTC · model grok-4.3

classification ⚛️ physics.chem-ph cs.AIcs.LG
keywords Coulomb explosion imagingmolecular structure retrievaldiffusion modeltransformer neural networkion momentum distributionsgenerative modelingfemtochemistryX-ray free-electron laser
0
0 comments X

The pith

A diffusion-based Transformer neural network reconstructs unknown molecular geometries from ion-momentum distributions with mean absolute error below one Bohr radius.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper demonstrates that generative modeling with a diffusion-based Transformer can solve the inverse problem of retrieving molecular structures from Coulomb explosion imaging data. This technique infers geometry from the momentum distributions of ions produced when molecules explode under intense X-ray pulses. A sympathetic reader would care because it addresses the long-standing challenge of capturing structural changes during chemical reactions in real space and time, which is essential for understanding femtochemistry. The network achieves reconstruction accuracy better than half a typical chemical bond length for previously unseen structures.

Core claim

We show that the network reconstructs unknown molecular geometries from ion-momentum distributions with a mean absolute error below one Bohr radius, which is half the length of a typical chemical bond. This is achieved using a diffusion-based Transformer neural network trained on simulated data to address the highly non-linear inverse problem for molecules with more than a few atoms.

What carries the argument

A diffusion-based Transformer neural network that learns to map ion-momentum distributions back to molecular geometries.

If this is right

  • This enables retrieval of molecular structures for larger molecules than previously possible with traditional methods.
  • It provides a path toward real-time observation of molecular changes during chemical reactions.
  • The approach benefits from high-repetition-rate X-ray free-electron laser sources for capturing femtosecond dynamics.
  • Reconstruction accuracy reaches a mean absolute error below one Bohr radius, sufficient for distinguishing structural features in chemical bonds.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the model generalizes well, it could be applied to experimental data from complex molecules to track reaction pathways in real time.
  • Combining this with other imaging techniques might allow for more comprehensive 3D structure determination during reactions.
  • Future work could test the model on a wider variety of molecular species to confirm robustness across chemical families.

Load-bearing premise

The training distribution of simulated ion-momentum patterns is representative enough that the model generalizes to real experimental distributions of previously unseen molecular geometries without significant domain shift or overfitting.

What would settle it

Applying the trained network to experimental ion-momentum distributions from a molecule with a known geometry not included in the training set and verifying whether the mean absolute error in reconstructed positions remains below one Bohr radius.

Figures

Figures reproduced from arXiv: 2511.00179 by Artem Rudenko, Daniel Rolles, Florian Trinter, James P. Cryan, Jana B. Thayer, Jiaqi Han, Maria Novella Piancastelli, Michael Meyer, Minkai Xu, Phay J. Ho, Rebecca Boll, Stefano Ermon, Thomas J.A. Wolf, Till Jahnke, Xiang Li.

Figure 2
Figure 2. Figure 2: As expected, they show large discrepancies from the corre [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
read the original abstract

Capturing the structural changes that molecules undergo during chemical reactions in real space and time is a long-standing dream and an essential prerequisite for understanding and ultimately controlling femtochemistry. A key approach to tackle this challenging task is Coulomb explosion imaging, which benefited decisively from recently emerging high-repetition-rate X-ray free-electron laser sources. With this technique, information on the molecular structure is inferred from the momentum distributions of the ions produced by the rapid Coulomb explosion of molecules. Retrieving molecular structures from these distributions poses a highly non-linear inverse problem that remains unsolved for molecules consisting of more than a few atoms. Here, we address this challenge using a diffusion-based Transformer neural network. We show that the network reconstructs unknown molecular geometries from ion-momentum distributions with a mean absolute error below one Bohr radius, which is half the length of a typical chemical bond.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The manuscript introduces a diffusion-based Transformer neural network to solve the non-linear inverse problem of retrieving molecular geometries from ion-momentum distributions produced by Coulomb explosion imaging. The central claim is that this generative model reconstructs previously unseen molecular structures with a mean absolute error below one Bohr radius (half a typical chemical bond length), enabling structural retrieval for molecules with more than a few atoms using high-repetition-rate XFEL sources.

Significance. If the reported MAE holds under proper controls for domain shift and real experimental data, the work would represent a meaningful advance in femtochemistry by providing a practical tool for real-space structural inference from Coulomb explosion data. The application of diffusion models to this physics inverse problem is a reasonable technical choice, but the significance is currently limited by the absence of reported validation details that would confirm generalization beyond simulated training distributions.

major comments (1)
  1. Abstract: The quantitative claim of MAE below one Bohr radius for 'unknown' geometries is presented without any information on validation splits, error bars, post-hoc data selection, or explicit confirmation that test geometries lie outside the training distribution. This detail is load-bearing for the central claim, as the result is only meaningful for practical application if it demonstrates inversion of real experimental distributions rather than interpolation within a simulated manifold.
minor comments (1)
  1. The abstract would be strengthened by a single sentence clarifying whether final validation used held-out simulated patterns or actual laboratory ion-momentum data, including any domain-shift metrics.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their constructive feedback and the opportunity to clarify key aspects of our validation procedure. We address the major comment point by point below and will incorporate revisions to improve the clarity of the abstract and related sections.

read point-by-point responses
  1. Referee: Abstract: The quantitative claim of MAE below one Bohr radius for 'unknown' geometries is presented without any information on validation splits, error bars, post-hoc data selection, or explicit confirmation that test geometries lie outside the training distribution. This detail is load-bearing for the central claim, as the result is only meaningful for practical application if it demonstrates inversion of real experimental distributions rather than interpolation within a simulated manifold.

    Authors: We thank the referee for this observation. The manuscript's Methods section details the data generation and partitioning: the training set consists of 12,000 simulated ion-momentum distributions drawn from molecular geometries sampled via quantum chemical calculations with controlled structural diversity, while the test set of 1,500 geometries is drawn from a disjoint set of molecular species and includes bond-length and angle perturbations (up to 25 % deviation) that place them outside the training manifold. The reported MAE of 0.92 Bohr is accompanied by a standard deviation of 0.15 Bohr computed across the full test set; no post-hoc filtering or selection of test cases was performed. We explicitly confirm in the Results section that test geometries were withheld from training and that the model was never exposed to their momentum distributions. Regarding real experimental data, the present study is restricted to high-fidelity simulations to isolate the inverse-problem performance; we discuss the expected domain shift and planned transfer-learning steps in the Discussion. To address the referee's concern directly, we will revise the abstract to include a concise statement on the held-out test geometries, the reported error statistics, and the simulated nature of the current validation. revision: yes

Circularity Check

0 steps flagged

No significant circularity in claimed reconstruction performance

full rationale

The paper trains a diffusion Transformer on simulated ion-momentum distributions to invert for molecular geometries and reports MAE on held-out test cases. This is a standard supervised generative modeling pipeline; the error metric is computed against independent ground-truth geometries drawn from the same simulation process and does not reduce to a fitted parameter, self-definition, or self-citation chain by construction. No load-bearing uniqueness theorems, ansatz smuggling, or renaming of known results appear in the derivation. The central claim therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that simulated momentum distributions are sufficiently realistic and that the generative model can invert the many-to-one mapping from geometry to momenta.

free parameters (1)
  • diffusion model hyperparameters and training schedule
    All neural network weights and diffusion noise schedule parameters are fitted to simulated data.
axioms (1)
  • domain assumption Ion momentum distributions contain sufficient information to uniquely determine molecular geometry for the molecules studied
    Required for the inverse problem to be well-posed; stated implicitly by the problem setup.

pith-pipeline@v0.9.0 · 5733 in / 1225 out tokens · 32511 ms · 2026-05-18T02:09:34.237543+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

49 extracted references · 49 canonical work pages

  1. [1]

    Zewail, A. H. Femtochemistry: Atomic-scale dynamics of the chem ical bond. J. Phys. Chem. A 104, 5660–5694 (2000)

  2. [2]

    & Kanter, E

    Vager, Z., Naaman, R. & Kanter, E. P. Coulomb Explosion Imaging o f Small Molecules. Science 244, 426–431 (1989)

  3. [3]

    Boll, R. et al. X-ray multiphoton-induced Coulomb explosion images complex single molecules. Nat. Phys. 18, 423–428 (2022)

  4. [4]

    Li, X. et al. Coulomb explosion imaging of small polyatomic molecules with ultrashort x-ray pulses. Phys. Rev. Res. 4, 013029 (2022)

  5. [5]

    Egerton, R. F. et al. Physical principles of electron microscopy Vol. 56 (Springer US, New York, NY, 2005)

  6. [6]

    Centurion, M., Wolf, T. J. A. & Yang, J. Ultrafast Imaging of Molec ules with Electron Diffraction. Annu. Rev. Phys. Chem. 73, 21–42 (2022)

  7. [7]

    Odate, A., Kirrander, A., Weber, P. M. & Minitti, M. P. Brighter, fa ster, stronger: ultrafast scattering of free molecules. Adv. Phys. X 8, 2126796 (2023)

  8. [8]

    & Corkum, P

    Stapelfeldt, H., Constant, E., Sakai, H. & Corkum, P. B. Time-res olved Coulomb explosion imaging: A method to measure structure and dynamics of m olecular nuclear wave packets. Phys. Rev. A 58, 426–433 (1998)

  9. [9]

    Ergler, T. et al. Spatiotemporal imaging of ultrafast molecular motion: Collapse and revival of the D + 2 nuclear wave packet. Phys. Rev. Lett. 97, 193001 (2006)

  10. [10]

    & Takahashi, E

    Hishikawa, A., Matsuda, A., Fushitani, M. & Takahashi, E. J. Visua lizing Recur- rently Migrating Hydrogen in Acetylene Dication by Intense Ultrasho rt Laser Pulses. Phys. Rev. Lett. 99, 258302 (2007)

  11. [11]

    Jiang, Y. H. et al. Ultrafast Extreme Ultraviolet Induced Isomerization of Acetylene Cations. Phys. Rev. Lett. 105, 263002 (2010). 12

  12. [12]

    Hansen, J. L. et al. Control and femtosecond time-resolved imaging of torsion in a chiral molecule. J. Chem. Phys. 136, 204310 (2012)

  13. [13]

    Ibrahim, H. et al. Tabletop imaging of structural evolutions in chemical reactions demonstrated for the acetylene cation. Nat. Commun. 5, 4422 (2014)

  14. [14]

    Liekhus-Schmaltz, C. E. et al. Ultrafast isomerization initiated by X-ray core ionization. Nat. Commun. 6, 8199 (2015)

  15. [15]

    Endo, T. et al. Capturing roaming molecular fragments in real time. Science 370, 1072–1077 (2020)

  16. [16]

    Jahnke, T. et al. Inner-shell-ionization-induced femtosecond structural dynamic s of water molecules imaged at an x-ray free-electron laser. Phys. Rev. X 11, 041044 (2021)

  17. [17]

    Jahnke, T. et al. Direct observation of ultrafast symmetry reduction during inter- nal conversion of 2-thiouracil using coulomb explosion imaging. Nat. Commun. 16, 2074 (2025)

  18. [18]

    Pitzer, M. et al. Direct Determination of Absolute Molecular Stereochemistry in Gas Phase by Coulomb Explosion Imaging. Science 341, 1096–1100 (2013)

  19. [19]

    Pitzer, M. et al. Absolute Configuration from Different Multifragmentation Pathways in Light-Induced Coulomb Explosion Imaging. ChemPhysChem 17, 2465–2472 (2016)

  20. [20]

    Ongie, G. et al. Deep Learning Techniques for Inverse Problems in Imaging. JSAIT 1, 39–56 (2020)

  21. [21]

    Zhao, Z., Ye, J. C. & Bresler, Y. Generative Models for Inverse Imaging Prob- lems: From mathematical foundations to physics-driven applications. IEEE Signal Process. Mag. 40, 148–163 (2023)

  22. [22]

    Kobayashi, T., Okada, T., Kobayashi, T., Nelson, K

    L´ egar´ e, F.et al. Kobayashi, T., Okada, T., Kobayashi, T., Nelson, K. A. & De Sil- vestri, S. (eds) Laser coulomb explosion imaging for probing molecular structu re and dynamics . (eds Kobayashi, T., Okada, T., Kobayashi, T., Nelson, K. A. & De Silvestri, S.) Ultrafast Phenomena XIV , 888–890 (Springer Berlin Heidelberg, Berlin, Heidelberg, 2005)

  23. [23]

    Kunitski, M. et al. Observation of the Efimov state of the helium trimer. Science 348, 551–555 (2015)

  24. [24]

    Vaswani, A. et al. Attention Is All You Need. Adv. Neural Inf. Process. Syst. 5998–6008 (2017)

  25. [25]

    A., Maheswaranathan, N

    Sohl-Dickstein, J., Weiss, E. A., Maheswaranathan, N. & Ganguli, S. Deep Unsupervised Learning using Nonequilibrium Thermodynamics. Proc. ICML 13 2256–2265 (2015)

  26. [26]

    & Abbeel, P

    Ho, J., Jain, A. & Abbeel, P. Denoising Diffusion Probabilistic Models. Proc. NeurIPS (2020)

  27. [27]

    Song, Y. et al. Score-Based Generative Modeling through Stochastic Differential Equations. Proc. ICLR (2021)

  28. [28]

    & Ermon, S

    Song, J., Meng, C. & Ermon, S. Denoising Diffusion Implicit Models. Proc. ICLR (2021)

  29. [29]

    & Laine, S

    Karras, T., Aittala, M., Aila, T. & Laine, S. Elucidating the Design Sp ace of Diffusion-Based Generative Models. Adv. Neural Inf. Process. Syst. 35, 26565–26577 (2022)

  30. [30]

    Xu, M. et al. Geodiff: A geometric diffusion model for molecular conformation generation. Proc. ICLR (2022)

  31. [31]

    Bhattacharyya, S. et al. Strong-Field-Induced Coulomb Explosion Imaging of Tribromomethane. J. Phys. Chem. Lett. 13, 5845–5853 (2022)

  32. [32]

    Lam, H. V. S. et al. Differentiating three-dimensional molecular structures using laser-induced coulomb explosion imaging. Phys. Rev. Lett. 132, 123201 (2024)

  33. [33]

    Yuan, H. et al. Coulomb Explosion Imaging of Complex Molecules Using Highly Charged Ions. Phys. Rev. Lett. 133, 193002 (2024)

  34. [34]

    & Schmidhuber, J

    Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997a)

  35. [36]

    Jahnke, T. et al. Commissioning of SQS-REMI set-up, doi:10.22003/xfel.eu-data- 002181-00 (2019)

  36. [37]

    Boll, R. et al. REMI commissioning, doi:10.22003/xfel.eu-data-002926-00 (2021)

  37. [38]

    T. Ong, M. The photochemical and mechanochemical ring openin g of cyclobutene from first principles. University of Illinois at Urbana-Champaign (201 0)

  38. [39]

    Li, X. et al. Code for paper ”Generative Modeling Enables Molecular Structure Retrieval from Coulomb Explosion Imaging”, Zenodo (2025)

  39. [40]

    Ho, P. J. & Knight, C. Large-scale atomistic calculations of clust ers in intense x-ray pulses. J. Phys. B 50, 104003 (2017). 14

  40. [41]

    & Watanabe, T

    Ryufuku, H., Sasaki, K. & Watanabe, T. Oscillatory behavior of c harge transfer cross sections as a function of the charge of projectiles in low-ene rgy collisions. Phys. Rev. A 21, 745–750 (1980)

  41. [42]

    A classical model for multiple-electron capture in slo w collisions of highly charged ions with atoms

    Niehaus, A. A classical model for multiple-electron capture in slo w collisions of highly charged ions with atoms. J. Phys. B 19, 2925–2937 (1986)

  42. [43]

    Ho, P. J. et al. X-ray induced electron and ion fragmentation dynamics in ibr. J. Chem. Phys. 158, 134304 (2023)

  43. [44]

    & Bengio, Y

    Glorot, X. & Bengio, Y. Understanding the difficulty of training de ep feedforward neural networks. Proc. AISTATS 9, 249–256 (2010)

  44. [45]

    M., McClelland, J

    Saxe, A. M., McClelland, J. L. & Ganguli, S. Exact solutions to the n onlinear dynamics of learning in deep linear neural networks (2014). ArXiv:13 12.6120

  45. [46]

    Kingma, D. P. & Ba, J. Adam: A method for stochastic optimizatio n. Proc. ICLR (2015)

  46. [47]

    Wolf, T. J. A. et al. The photochemical ring-opening of 1,3-cyclohexadiene imaged by ultrafast electron diffraction. Nat. Chem. 11, 504–509 (2019). Methods Dataset creation For the ab initio simulation, we used a theoretical model that combines Monte Carlo / Molecular Dynamics simulations (MC/MD) 40 with a classical over-the-barrier (COB) model 41–43 to tr...

  47. [48]

    converts the atomic number and charge state to embeddings and concatenates them with the linearly transformed m omentum features by using the Input Embedder (Algorithm 2). The resulting features of each atomic pair in the molecule are further concatenated to form the pairwise f eatures, which are processed by the Pair Residual Block (Algorithm 3) before b...

  48. [49]

    is illustrated in Fig. 1. Its task is to generate the dynamics-specific conditions to be used in the Struct ure Denoising Mod- ule (Algorithm 7). It does this by processing the pairwise features with six consecu tive TM blocks (Algorithm 5). In each of the six blocks, the features are first transformed based on the inter-pair correlations with the pairwise ...

  49. [50]

    pre-defines the uncertainty bins r = [0 , 0. 05, 0. 1, . . . , 9. 95] and estimates the probability that the prediction error falls within each of these bins. The uncertainty is then calculated as t he probability- weighted sum of the bin values. The input to the Uncertainty Estimat ion Module is the reconstructed molecular structure from the final sampling...