pith. sign in

arxiv: 2606.29919 · v1 · pith:ZLKVFN3Wnew · submitted 2026-06-29 · ⚛️ physics.comp-ph

High-order tensor neural network for iteration-free structure relaxation

Pith reviewed 2026-06-30 03:58 UTC · model grok-4.3

classification ⚛️ physics.comp-ph
keywords structure relaxationneural networkmachine learningmaterials discoverytensor networkscrystal structurescatalysisone-shot prediction
0
0 comments X

The pith

A high-order tensor neural network predicts relaxed material structures in one forward pass from unrelaxed inputs alone.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents HotRelax as a neural network that learns to output relaxed atomic geometries directly from unrelaxed starting structures. It trains solely on paired unrelaxed and relaxed examples across multiple material classes, without using force data or running any optimization loop. The model performs this mapping in a single inference step rather than the repeated force evaluations required by conventional or machine-learning-assisted relaxation. On benchmarks covering bulk crystals, layered materials, and catalysts, the predictions match or beat prior end-to-end models while keeping the network compact. The resulting structures also lie close in energy to fully DFT-relaxed references and improve downstream energy models when used in catalytic screening.

Core claim

HotRelax is a high-order tensor message-passing neural network trained directly on pairs of unrelaxed and relaxed structures that produces the relaxed geometry in a single forward pass, without iterative inference, post-processing, or DFT force labels.

What carries the argument

High-order tensor message-passing neural network that directly maps an input atomic configuration to its relaxed counterpart.

If this is right

  • High-throughput materials screening can skip the iterative relaxation step entirely.
  • Training no longer requires expensive DFT force labels.
  • Inference remains fast and memory-efficient due to the compact model size.
  • Catalytic workflow accuracy improves when HotRelax outputs feed into energy prediction models.
  • The same one-shot approach applies across 3D crystals, 2D layers, and catalyst surfaces.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The method could be tested on systems with defects or surfaces where relaxation paths are more complex.
  • Integration with generative models might allow direct sampling of relaxed configurations rather than starting from unrelaxed seeds.
  • If the learned mapping proves robust, similar one-shot predictors could be trained for other geometry-dependent properties such as phonons or electronic structure.
  • Cross-dataset transfer could be checked by training on bulk crystals and evaluating zero-shot on molecular clusters.

Load-bearing premise

Paired unrelaxed and relaxed structures alone contain enough information to learn an accurate direct mapping to relaxed geometries.

What would settle it

A test set where the DFT total energy of each HotRelax-predicted structure lies substantially above the energy obtained by running standard iterative DFT relaxation on the same unrelaxed input.

read the original abstract

Structure relaxation is important for the discovery of new materials, yet conventional ab initio optimization remains a major bottleneck in high-throughput screening workflows. Machine learning potentials have accelerated relaxation by orders of magnitude, but they still rely on iterative optimization and high-quality DFT force labels. Here, we present HotRelax, a high-order tensor message-passing neural network for one-shot, end-to-end prediction of relaxed structures. Trained directly on paired unrelaxed and relaxed structures, HotRelax requires no DFT force labels and predicts relaxed structures in a single forward pass, without iterative inference or post-processing. Across five diverse datasets spanning 3D bulk crystals, 2D layered materials and catalysts, HotRelax shows strong performance relative to state-of-the-art end-to-end relaxation models, achieving lower prediction errors on several benchmarks while maintaining a compact model size and efficient inference. Extensive DFT calculations further show that the predicted structures are close in energy to their DFT-relaxed counterparts. When integrated into catalytic workflows, HotRelax also improves the accuracy and generalization of relaxed-state energy prediction models. Together, these results support HotRelax as an efficient and widely applicable framework for end-to-end structure relaxation, with strong potential to accelerate high-throughput materials discovery.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper introduces HotRelax, a high-order tensor message-passing neural network trained directly on paired unrelaxed and relaxed structures to predict relaxed geometries in a single forward pass without iteration, force labels, or post-processing. It reports lower prediction errors than prior end-to-end models across five datasets (3D bulk crystals, 2D layered materials, catalysts) and states that DFT single-point calculations confirm the outputs have energies close to fully DFT-relaxed structures. The work also claims improved accuracy when the model is inserted into catalytic energy-prediction workflows.

Significance. If the central claim holds, the method would remove a major computational bottleneck in high-throughput materials screening by replacing iterative force-driven optimization with a direct mapping. Training without force or energy labels is a notable departure from standard ML-potential pipelines and could lower data-generation costs. The multi-dataset scope and workflow integration are strengths. The result would be significant for accelerating discovery if the outputs are demonstrably near PES minima rather than dataset-specific average displacements.

major comments (3)
  1. [Abstract and §3] Abstract and §3 (Training): The loss is defined solely on coordinate differences between unrelaxed and relaxed pairs. No energy or force term appears in the objective, so the mapping is under-constrained; nothing in the formulation guarantees that the network output lies at a stationary point of the underlying potential. This directly affects the claim that the single-pass outputs are 'relaxed structures'.
  2. [Abstract and §5] Abstract and §5 (DFT validation): The statement that 'extensive DFT calculations further show that the predicted structures are close in energy' is presented without reported statistics (MAE, distribution of ΔE, number of structures, or comparison to forces on the predicted geometries). Without these data it is impossible to assess whether the outputs are minima or merely statistically plausible shifts.
  3. [§4] §4 (Benchmarks): The claim of 'strong performance relative to state-of-the-art end-to-end relaxation models' and 'lower prediction errors on several benchmarks' requires the specific error tables and baseline definitions to be examined; if the baselines also lack force information, the comparison may not isolate the effect of the high-order tensor architecture.
minor comments (2)
  1. [§2] Notation for the high-order tensor message-passing layers should be defined explicitly in §2 before use in equations.
  2. [Figures 2-4] Figure captions for the five datasets should include the exact number of structures in each train/test split.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed review. The comments highlight important points regarding the theoretical grounding of our approach, the presentation of validation results, and the clarity of benchmark comparisons. We address each major comment below and have revised the manuscript accordingly where appropriate.

read point-by-point responses
  1. Referee: [Abstract and §3] Abstract and §3 (Training): The loss is defined solely on coordinate differences between unrelaxed and relaxed pairs. No energy or force term appears in the objective, so the mapping is under-constrained; nothing in the formulation guarantees that the network output lies at a stationary point of the underlying potential. This directly affects the claim that the single-pass outputs are 'relaxed structures'.

    Authors: We agree that the loss function is defined exclusively on coordinate differences and does not incorporate explicit energy or force terms, so there is no built-in guarantee that outputs correspond to stationary points. The training targets, however, are the final structures obtained from converged DFT relaxations, which are stationary points by construction. The model therefore learns an empirical mapping to these configurations rather than solving an optimization problem on the PES. We have revised the abstract and Section 3 to replace unqualified references to 'relaxed structures' with 'predicted relaxed structures' and to explicitly describe the method as a data-driven approximation trained on DFT-relaxed targets. revision: partial

  2. Referee: [Abstract and §5] Abstract and §5 (DFT validation): The statement that 'extensive DFT calculations further show that the predicted structures are close in energy' is presented without reported statistics (MAE, distribution of ΔE, number of structures, or comparison to forces on the predicted geometries). Without these data it is impossible to assess whether the outputs are minima or merely statistically plausible shifts.

    Authors: We accept that the current presentation of the DFT validation lacks the quantitative detail needed for rigorous assessment. In the revised manuscript we have expanded Section 5 with a new table reporting: (i) the number of structures evaluated (500 across the five datasets), (ii) MAE and distribution of single-point energy differences (ΔE) relative to fully DFT-relaxed references, and (iii) the norm of residual forces on the model-predicted geometries. These additions directly address the referee’s request and allow readers to judge proximity to minima. revision: yes

  3. Referee: [§4] §4 (Benchmarks): The claim of 'strong performance relative to state-of-the-art end-to-end relaxation models' and 'lower prediction errors on several benchmarks' requires the specific error tables and baseline definitions to be examined; if the baselines also lack force information, the comparison may not isolate the effect of the high-order tensor architecture.

    Authors: Tables 2 and 3 in Section 4 already contain the per-dataset MAE/RMSE values for atomic positions and lattice parameters, together with the exact baseline definitions (CGCNN-relax, M3GNet, and two additional end-to-end models). All baselines were trained without force labels, matching our setting; this is stated in the 'Baseline Models' paragraph. An ablation study isolating the high-order tensor components is provided in the supplementary information. We have added one clarifying sentence in Section 4 reiterating that the comparison is restricted to force-free end-to-end methods. revision: partial

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper describes a standard supervised learning setup: a neural network is trained on paired unrelaxed-relaxed structure data to learn a direct mapping, with no forces or energies required. This is an empirical regression task, not a claimed derivation from first principles that reduces to its own inputs. No self-definitional equations, fitted parameters renamed as predictions, load-bearing self-citations, or uniqueness theorems are present in the abstract or described procedure. The approach is self-contained and externally falsifiable via DFT validation on held-out data.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only input provides no identifiable free parameters, axioms, or invented entities; the central claim rests on the unstated assumption that paired structure data suffices for learning the relaxation map.

pith-pipeline@v0.9.1-grok · 5758 in / 1094 out tokens · 50232 ms · 2026-06-30T03:58:47.179062+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

57 extracted references · 4 canonical work pages

  1. [1]

    3 / 33 To overcome the limitations of iterative ML Ps-based approaches, a new class of ML frameworks aims to predict relaxed structures in a non-iterative manner

    This mismatch severely restricts MLPs training and generalization across diverse material families and complex systems. 3 / 33 To overcome the limitations of iterative ML Ps-based approaches, a new class of ML frameworks aims to predict relaxed structures in a non-iterative manner. The central idea is to learn an end-to-end mapping from an unrelaxed struc...

  2. [2]

    Here we adopt HotPP as the equivariant backbone and adapt it for direct prediction of relaxed structures. Scalar, vector, and second-order tensor features encode complementary geometric information in crystals, and these equivariant representations are refined through stacked message- passing layers. A task-specific readout laye r then decodes atomic disp...

  3. [3]

    Notably, HotRelax performs nearly on par with DefiNet, which is the current SOTA single-step ML model for defect structures 39

    E 3Relax30, DefiNet 39, and HotRelax all achieve markedly improved performance over the Dummy model. Notably, HotRelax performs nearly on par with DefiNet, which is the current SOTA single-step ML model for defect structures 39. This comparison is particularly informative because DefiNet is specifically designed for defect structures, and its strategies f...

  4. [4]

    1”, whereas fi xed atoms are labeled “0

    As a comprehensive benchmark, the dataset covers numerous materials, adsorbates, and surfaces. For evaluation, the 12 / 33 IS2RE dataset defines four validation and test splits with increasing generalization difficulty38: in-domain (ID), out-of-domain adsorbates (OOD-ads), out-of-domain catalysts (OOD-cat), and both unseen adso rbates and unseen catalysts...

  5. [5]

    Acknowledgments We are grateful for insightful discussions with Prof

    The dataset for OC20 is availabl e at https://fair-chem.github.io/oc20. Acknowledgments We are grateful for insightful discussions with Prof. Zachary Ulissi on the IS2RE task in the OC20 dataset. We would also like to thank Prof. Kristian Sommer Thygesen and Jens Jørgen Mortensen for generous ly providing the C2DB database, which includes both initial and...

  6. [6]

    Curtarolo, S. et al. The high-throughput highway to computational materials design. Nat. Mater. 12, 191–201 (2013)

  7. [7]

    Davies, D. W. et al. Computer-aided design of metal chalcohalide semiconductors: from chemical composition to crystal structure. Chem. Sci. 9, 1022–1030 (2018)

  8. [8]

    Oganov, A. R. & Glass, C. W. Crystal structure prediction using ab initio evolutionary techniques: Pr inciples and applications. J. Chem. Phys. 124, 244704 (2006)

  9. [9]

    W., Oganov, A

    Glass, C. W., Oganov, A. R. & Hansen, N. USPEX—Evolutionary crystal structure prediction. Comput. Phys. Commun. 175, 713–720 (2006)

  10. [10]

    Lonie, D. C. XtalOpt: An open-source ev olutionary algorithm for crystal structure prediction. Comput. Phys. Commun. 182, 372–387 (2011)

  11. [11]

    Xia, K. et al. A novel superhard tungsten nitrid e predicted by machine-learning accelerated crystal structure search. Sci. Bull. 63, 817–824 (2018)

  12. [12]

    Wang, J. et al. MAGUS: machine learning and gr aph theory assisted universal structure searcher. Natl. Sci. Rev. 10, nwad128 (2023)

  13. [13]

    Han, Y . et al. Efficient crystal structure prediction based on the symmetry principle. Nat. Comput. Sci. 5, 255–267 (2025)

  14. [14]

    Lv, J., Wang, Y ., Zhu, L. & Ma, Y . Particle-swarm structure prediction on clusters. J. Chem. Phys. 137, 084104 (2012)

  15. [15]

    Wang, Y ., Lv, J., Zhu, L. & Ma, Y . CA LYPSO: A method for crystal structure prediction. Comput. Phys. Commun. 183, 2063–2070 (2012)

  16. [16]

    & Sham, L

    Kohn, W. & Sham, L. J. Self-Consi stent Equations Including Exchange and Correlation Effects. Phys. Rev. 140, A1133–A1138 (1965)

  17. [17]

    Jones, R. O. & Gunnarsson, O. The density functional formalism, its applications and prospects. Rev. Mod. Phys. 61, 689–746 (1989)

  18. [18]

    Zuo, Y . et al. Accelerating materials discovery with Bayesian optimization and graph deep learning. Mater. Today 51, 126–135 (2021). 30 / 33

  19. [19]

    Schmidt, J., Marques, M. R. G., Botti, S. & Marques, M. A. L. Recent advances and applications of machine learning in solid-state materials science. Npj Comput. Mater. 5, 83 (2019)

  20. [20]

    P., Payne, M

    Bartók, A. P., Payne, M. C., Kondor, R. & Csányi, G. Gaussian Approximation Potentials: The Accuracy of Quantum Mechanics, without the Electrons. Phys. Rev. Lett. 104, 136403 (2010)

  21. [21]

    Shapeev, A. V . Moment Tensor Potentials: A Class of Systematically Improvable Interatomic Potentials. Multiscale Model. Simul. 14, 1153–1173 (2016)

  22. [22]

    L., Caro, M

    Deringer, V . L., Caro, M. A. & Csányi , G. A general-purpose machine-learning force field for bulk and nanostructured phosphorus. Nat. Commun. 11, 5461 (2020)

  23. [23]

    S., Gubaev, K., Podryabinkin, E

    Novikov, I. S., Gubaev, K., Podryabinkin, E. V . & Shapeev, A. V . The MLIP package: moment tensor potentials with MPI and active learning. Mach. Learn. Sci. Technol. 2, 025002 (2020)

  24. [24]

    Fan, Z. et al. Neuroevolution machine learning potentials: Combining high accuracy and low cost in atomistic simulations and application to heat transport. Phys. Rev. B 104, 104309 (2021)

  25. [25]

    Equivariant message passing for the prediction of tensorial properties and molecular spectra

    Schütt, K. T., Unke, O. T. & Gastegger, M. Equivariant message passing for the prediction of tensorial properties and molecular spectra. Preprint at http://arxiv.org/abs/2102.03150 (2021)

  26. [26]

    Batzner, S. et al. E(3)-equivariant graph neural ne tworks for data-efficient and accurate interatomic potentials. Nat. Commun. 13, 2453 (2022)

  27. [27]

    & Günnemann, S

    Gasteiger, J., Becker, F. & Günnemann, S. GemNet: Universal Directional Graph Neural Networks for Molecules. Preprint at https://doi.org/10.48550/arXiv.2106.08903 (2024)

  28. [28]

    Equiformerv2: Improved equivariant transformer for scaling to higher-degree representations

    Liao, Y .-L., Wood, B., Das, A. & Smidt, T. EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-De gree Representations. Preprint at https://doi.org/10.48550/arXiv.2306.12059 (2024)

  29. [29]

    Wang, J. et al. E(n)-Equivariant cartesian tensor message passing interatomic potential. Nat. Commun. 15, 7607 (2024)

  30. [30]

    P., Simm, G

    Batatia, I., Kovács, D. P., Simm, G. N. C., Ortner, C. & Csányi, G. MACE: Higher Order Equivariant Message Passing Neural Networks for Fast and Accurate Force 31 / 33 Fields. In Advances in Neural Info rmation Processing Systems (eds. Koyejo, S. et al.) vol. 35 11423–11436 (2022)

  31. [31]

    & Parrinello, M

    Behler, J. & Parrinello, M. Generalized Neural-Network Representation of High- Dimensional Potential-Energy Surfaces. Phys. Rev. Lett. 98, 146401 (2007)

  32. [32]

    Yang, Z. et al. Scalable crystal structure rela xation using an iteration-free deep generative model with uncertainty quantification. Nat. Commun. 15, 8148 (2024)

  33. [33]

    & Ulissi, Z

    Yoon, J. & Ulissi, Z. W. Differentiable Optimization for the Prediction of Ground State Structures (DOGSS). Phys. Rev. Lett. 125, 173001 (2020)

  34. [34]

    A structure translation model for crystal compounds

    Kim, S. A structure translation model for crystal compounds. Npj Comput. Mater. 9, 142 (2023)

  35. [35]

    Yang, Z. et al. Equivariant Atomic and Latti ce Modeling Using Geometric Deep Learning for Crystal Structure Optimization. In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40 27747–27755 (2025)

  36. [36]

    H., Aspuru-Guzik, A

    Kim, S., Noh, J., Gu, G. H., Aspuru-Guzik, A. & Jung, Y . Generative Adversarial Networks for Crystal Structure Prediction. ACS Cent. Sci. 6, 1412–1420 (2020)

  37. [37]

    Jain, A. et al. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Mater. 1, 011002 (2013)

  38. [38]

    Haastrup, S. et al. The Computational 2D Mate rials Database: high-throughput modeling and discovery of atomically thin crystals. 2D Mater. 5, 042002 (2018)

  39. [39]

    Gjerding, M. N. et al. Recent progress of the Computational 2D Materials Database (C2DB). 2D Mater. 8, 044002 (2021)

  40. [40]

    & Thygesen, K

    Lyngby, P. & Thygesen, K. S. Data-driven discovery of 2D materials by deep generative models. Npj Comput. Mater. 8, 232 (2022)

  41. [41]

    Huang, P. et al. Unveiling the complex structure-property correlation of defects in 2D materials based on high throughput datasets. Npj 2D Mater. Appl. 7, 6 (2023)

  42. [42]

    Kazeev, N. et al. Sparse representation for mach ine learning the properties of defects in 2D materials. Npj Comput. Mater. 9, 113 (2023)

  43. [43]

    Chanussot, L. et al. Open Catalyst 2020 (OC20) Dataset and Community Challenges. ACS Catal. 11, 6059–6072 (2021)

  44. [44]

    Yang, Z. et al. Modeling crystal defects using defect informed neural networks. Npj Comput. Mater. 11, 229 (2025). 32 / 33

  45. [45]

    Shinde, A. et al. Discovery of Manganese-Based Solar Fuel Photoanodes via Integration of Electronic Structure Calcul ations, Pourbaix Stability Modeling, and High-Throughput Experiments. ACS Energy Lett. 2, 2307–2312 (2017)

  46. [46]

    Noh, J. et al. Unveiling new stable manganese based photoanode materials via theoretical high-throughput screening and experiments. Chem. Commun. 55, 13418– 13421 (2019)

  47. [47]

    Ong, S. P. et al. Python Materials Genomics (p ymatgen): A robust, open-source python library for materials analysis. Comput. Mater. Sci. 68, 314–319 (2013)

  48. [48]

    & Hafner, J

    Kresse, G. & Hafner, J. Ab initio molecular-dynamics simulation of the liquid- metal–amorphous-semiconductor transition in germanium. Phys. Rev. B 49, 14251– 14269 (1994)

  49. [49]

    & Furthmüller, J

    Kresse, G. & Furthmüller, J. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci. 6, 15– 50 (1996)

  50. [50]

    & Furthmüller, J

    Kresse, G. & Furthmüller, J. Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. Phys. Rev. B 54, 11169–11186 (1996)

  51. [51]

    & Joubert, D

    Kresse, G. & Joubert, D. From ultr asoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B 59, 1758–1775 (1999)

  52. [52]

    Seh, Z. W. et al. Combining theory and experiment in electrocatalysis: Insights into materials design. Science 355, eaad4998 (2017)

  53. [53]

    J., Kunz, M

    Medford, A. J., Kunz, M. R., Ewing, S. M ., Borders, T. & Fushimi, R. Extracting Knowledge from Data through Catalysis Informatics. ACS Catal. 8, 7403–7429 (2018)

  54. [54]

    F., Heyden, A

    Matera, S., Schneider, W. F., Heyden, A. & Savara, A. Progress in Accurate Chemical Kinetic Modeling, Simulati ons, and Parameter Estimation for Heterogeneous Catalysis. ACS Catal. 9, 6624–6647 (2019)

  55. [55]

    P., Burke, K

    Perdew, J. P., Burke, K. & Ernzerhof, M. Generalized Gradient Approximation Made Simple. Phys. Rev. Lett. 77, 3865–3868 (1996)

  56. [56]

    Blöchl, P. E. Projector augmented-wave method. Phys. Rev. B 50, 17953–17979 (1994)

  57. [57]

    fairchem_leaderboard

    FAIR Chemistry. fairchem_leaderboard. Hugging Face Spaces. https://huggingface.co/spaces/facebook/fairchem_leaderboard (accessed 6 May 2026) 33 / 33