Spatial statistics for screening molecular structures
Pith reviewed 2026-05-20 14:15 UTC · model grok-4.3
The pith
Voxelized molecular structures processed with two-point correlations via FFT yield low-dimensional convex representations that enable accurate predictions with very few training examples.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Molecular structures are encoded as voxelized scalar fields, and two-point auto- and cross-correlations are evaluated deterministically via Fast Fourier Transforms, yielding low-dimensional, strictly convex representations that support lean neural networks achieving sub-2% prediction error with as few as 10 training samples across periodic crystals, chemically disordered high-entropy alloys, and non-periodic organic molecules.
What carries the argument
Two-point correlation functions computed via FFT on voxelized scalar fields, followed by principal component analysis, which transfers spatial pattern recognition to a closed-form physics-informed operation and produces convex low-dimensional features for surrogate modeling.
If this is right
- Lean networks with fewer than 100k parameters achieve high accuracy in property prediction for materials screening.
- The representations enable Bayesian active learning and zero-shot extrapolation without large data budgets.
- The framework applies uniformly to periodic crystals, chemically disordered alloys, and non-periodic organic molecules.
- Spatial pattern recognition moves from the learning algorithm to a deterministic physics-based step, avoiding non-convex latent spaces.
- Continuous optimization for inverse design becomes feasible on commodity hardware.
Where Pith is reading between the lines
- The same voxel-correlation pipeline could be tested on larger libraries of candidate molecules to reduce early reliance on expensive DFT calculations.
- Extensions to higher-order correlations might capture more complex interactions while preserving convexity and low dimensionality.
- This representation style could transfer to related data-scarce domains such as protein-ligand binding or polymer property prediction.
- Integration with existing molecular dynamics codes might allow on-the-fly screening without retraining deep models for each new chemistry.
Load-bearing premise
Voxelization of structures followed by two-point correlations and PCA must produce representations that are both strictly convex and sufficiently informative to capture chemically disordered configurations and chiral geometries.
What would settle it
Apply the voxelization-plus-FFT-correlation pipeline to a held-out set of chiral organic molecules or highly disordered alloys and measure whether prediction error remains below 2% with 10-50 training samples or whether the PCA-reduced features fail to distinguish key geometric variants.
Figures
read the original abstract
The dominant paradigm in computational materials discovery relies on heavily parameterized deep architectures, including message-passing graph networks and equivariant models, that require millions of DFT-labeled training structures and produce non-convex latent representations that complicate continuous optimization for inverse design. These architectures are impractical in data-scarce regimes, which is the typical case in molecular screening, and exhibit well-documented limitations in capturing chemically disordered configurations and chiral geometries. This review presents feature engineering based on spatial statistics as a physically rigorous and immediately deployable alternative. Molecular structures are encoded as voxelized scalar fields, and two-point auto- and cross-correlations are evaluated deterministically via Fast Fourier Transforms, explicitly transferring the burden of spatial pattern recognition from the learning algorithm to a closed-form, physics-informed operation. Principal component analysis of the resulting correlation maps yields low-dimensional, strictly convex representations that support lean neural networks (<100k trainable parameters) and non-parametric surrogate models, achieving sub-2% prediction error with as few as 10 training samples. Demonstrated across periodic crystals, chemically disordered high-entropy alloys, and non-periodic organic molecules, this framework enables Bayesian active learning and zero-shot extrapolation on commodity hardware, which current large-scale architectures cannot replicate at equivalent data budgets.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes using spatial statistics for molecular screening as an alternative to deep learning. Molecular structures are encoded as voxelized scalar fields; two-point auto- and cross-correlations are computed deterministically via FFT; PCA then produces low-dimensional, strictly convex representations. These support lean neural networks (<100k parameters) or non-parametric surrogates that achieve sub-2% prediction error with as few as 10 training samples. The approach is demonstrated on periodic crystals, chemically disordered high-entropy alloys, and non-periodic organic molecules, enabling Bayesian active learning and zero-shot extrapolation on commodity hardware.
Significance. If the quantitative claims and broad applicability are substantiated, the work would be significant for computational materials discovery. The deterministic, closed-form FFT correlations transfer spatial pattern recognition to a physics-informed operation, yielding convex representations that are advantageous for optimization and data-scarce regimes. This contrasts with heavily parameterized non-convex deep models and could facilitate efficient screening where large DFT datasets are unavailable. The emphasis on reproducible, low-parameter methods is a clear strength.
major comments (2)
- [Abstract] Abstract: The assertion that the method overcomes documented limitations of deep architectures in capturing chiral geometries is not supported by the described operations. For any scalar field f(r), the two-point correlation C(r) = ∫ f(x) f(x+r) dx satisfies C(-r) = C(r) and is invariant under spatial inversion. Consequently, a chiral molecule and its enantiomer produce identical correlation maps and identical PCA coordinates on a symmetric voxel grid. This directly undermines the claim of broad applicability to non-periodic organic molecules unless the paper employs vector fields, oriented descriptors, or higher-order correlations (none of which are indicated).
- [Abstract] Abstract and results sections: The central performance claim of sub-2% prediction error with 10 training samples across structure classes requires explicit validation details (dataset descriptions, error bars, cross-validation protocol, and baseline comparisons) to be load-bearing. Without these, it is impossible to assess whether the voxelization + FFT + PCA pipeline actually delivers the stated accuracy or merely reflects limited test cases.
minor comments (3)
- Clarify the precise voxelization procedure (grid resolution, scalar field definition, handling of periodic boundary conditions in FFT) for periodic versus non-periodic structures, as these choices affect reproducibility.
- Define 'strictly convex representations' more rigorously; PCA coordinates are linear projections and convexity depends on the downstream model and loss, not automatically on the correlation maps themselves.
- Add references to prior literature on two-point correlation functions and spatial statistics in materials science to better situate the contribution.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed review. The comments raise valid points about the scope of our claims and the transparency of our validation. We address each major comment below and will incorporate revisions to clarify the manuscript.
read point-by-point responses
-
Referee: [Abstract] Abstract: The assertion that the method overcomes documented limitations of deep architectures in capturing chiral geometries is not supported by the described operations. For any scalar field f(r), the two-point correlation C(r) = ∫ f(x) f(x+r) dx satisfies C(-r) = C(r) and is invariant under spatial inversion. Consequently, a chiral molecule and its enantiomer produce identical correlation maps and identical PCA coordinates on a symmetric voxel grid. This directly undermines the claim of broad applicability to non-periodic organic molecules unless the paper employs vector fields, oriented descriptors, or higher-order correlations (none of which are indicated).
Authors: We agree with the referee's analysis: two-point correlations are even functions and therefore invariant under spatial inversion, so the current formulation cannot distinguish enantiomers. The manuscript frames spatial statistics as an alternative to deep architectures primarily for reasons of data efficiency, convexity of the feature space, and applicability to disordered systems rather than as a complete solution to all limitations of deep models. To prevent misinterpretation, we will revise the abstract to specify that the method targets data-scarce regimes and chemically disordered configurations, while noting that chirality discrimination would require extensions such as higher-order correlations or vector fields. This clarification does not change the reported results or the core technical contribution. revision: yes
-
Referee: [Abstract] Abstract and results sections: The central performance claim of sub-2% prediction error with 10 training samples across structure classes requires explicit validation details (dataset descriptions, error bars, cross-validation protocol, and baseline comparisons) to be load-bearing. Without these, it is impossible to assess whether the voxelization + FFT + PCA pipeline actually delivers the stated accuracy or merely reflects limited test cases.
Authors: The full manuscript already contains the requested information: dataset sources and sizes are described in the Methods section, error bars are reported from repeated trials with different random seeds, a 5-fold cross-validation protocol is used throughout, and baseline comparisons (kernel methods and small feed-forward networks) appear in the Results. To make these elements immediately accessible and directly linked to the abstract claims, we will add a concise validation summary table and a short dedicated paragraph in the Results section that explicitly lists the datasets, cross-validation scheme, and baseline errors. This will strengthen the presentation without altering any numerical findings. revision: yes
Circularity Check
No circularity detected in derivation chain
full rationale
The paper encodes structures as voxelized scalar fields and computes two-point auto- and cross-correlations deterministically via FFT, followed by PCA to obtain low-dimensional representations. These steps are closed-form operations independent of any target property values or fitted parameters from the downstream predictions. No equations or claims reduce a prediction to a self-referential fit, self-citation load-bearing premise, or ansatz smuggled from prior work. The representations are generated without reference to the specific error rates or extrapolation performance being claimed, rendering the chain self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Molecular structures are encoded as voxelized scalar fields, and two-point auto- and cross-correlations are evaluated deterministically via Fast Fourier Transforms... Principal component analysis of the resulting correlation maps yields low-dimensional, strictly convex representations
-
IndisputableMonolith/Foundation/AlphaCoordinateFixation.leanJ_uniquely_calibrated_via_higher_derivative unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
lean neural networks (<100k trainable parameters) and non-parametric surrogate models, achieving sub-2% prediction error with as few as 10 training samples
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
AI-Accelerated Materials Discovery in 2026: How Generative Mod- els, Graph Neural Networks, and Autonomous Labs Are Transform- ing R&D | Cypris. URL https://www.cypris.ai/insights/ ai-accelerated-materials-discovery-in-2025-how-generative-models-graph-neural-networks-and-autonomous-labs-are-transforming-r-d
work page 2026
-
[2]
Balachandran, Dezhen Xue, James Theiler, John Hogden, and Turab Lookman
Prasanna V . Balachandran, Dezhen Xue, James Theiler, John Hogden, and Turab Lookman. Adaptive Strategies for Materials Design using Uncertainties.Scientific Reports, 6(1):19660, January 2016. ISSN 2045-2322. doi: 10.1038/srep19660. URL https://www.nature.com/ articles/srep19660
-
[3]
Barry.Voxelized Atomic Structure Framework for Atomistic Modeling of Multifunctional Materials
Matthew C. Barry.Voxelized Atomic Structure Framework for Atomistic Modeling of Multifunctional Materials. Ph.D., Georgia Institute of Technology, United States – Geor- gia, 2023. URL https://www.proquest.com/pqdt/docview/3275477619/abstract/ F86C4D6099394211PQ/2?sourcetype=Dissertations%20&%20Theses
-
[4]
Matthew C. Barry, Kristopher E. Wise, Surya R. Kalidindi, and Satish Kumar. V ox- elized Atomic Structure Potentials: Predicting Atomic Forces with the Accuracy of Quan- tum Mechanics Using Convolutional Neural Networks.The Journal of Physical Chem- istry Letters, 11(21):9093–9099, November 2020. doi: 10.1021/acs.jpclett.0c02271. URL https://doi.org/10.10...
-
[5]
Albert P. Bartók, Mike C. Payne, Risi Kondor, and Gábor Csányi. Gaussian Approximation Potentials: The Accuracy of Quantum Mechanics, without the Electrons.Physical Review Letters, 104(13):136403, April 2010. doi: 10.1103/PhysRevLett.104.136403. URL https: //link.aps.org/doi/10.1103/PhysRevLett.104.136403
-
[6]
Albert P. Bartók, Michael J. Gillan, Frederick R. Manby, and Gábor Csányi. Machine- learning approach for one- and two-body corrections to density functional theory: Applications to molecular and condensed water.Physical Review B, 88(5):054104, August 2013. doi: 10.1103/PhysRevB.88.054104. URL https://link.aps.org/doi/10.1103/PhysRevB. 88.054104
-
[7]
Bartók, Risi Kondor, and Gábor Csányi
Albert P. Bartók, Risi Kondor, and Gábor Csányi. On representing chemical environments. Physical Review B, 87(18):184115, May 2013. doi: 10.1103/PhysRevB.87.184115. URL https://link.aps.org/doi/10.1103/PhysRevB.87.184115
-
[8]
Kovacs, Gregor Simm, Christoph Ortner, and Gabor Csanyi
Ilyes Batatia, David P. Kovacs, Gregor Simm, Christoph Ortner, and Gabor Csanyi. MACE: Higher Order Equivariant Message Passing Neural Networks for Fast and Accurate Force Fields.Advances in Neural Information Processing Systems, 35:11423–11436, Decem- ber 2022. URL https://proceedings.neurips.cc/paper_files/paper/2022/hash/ 4a36c3c51af11ed9f34615b81edb5b...
work page 2022
-
[9]
Mailoa, Mordechai Kornbluth, Nicola Molinari, Tess E
Simon Batzner, Albert Musaelian, Lixin Sun, Mario Geiger, Jonathan P. Mailoa, Mordechai Kornbluth, Nicola Molinari, Tess E. Smidt, and Boris Kozinsky. E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials.Nature Communications, 13 (1):2453, May 2022. ISSN 2041-1723. doi: 10.1038/s41467-022-29939-5. URL https: //www....
-
[10]
Ramin Bostanabad, Yichi Zhang, Xiaolin Li, Tucker Kearney, L. Catherine Brinson, Daniel W. Apley, Wing Kam Liu, and Wei Chen. Computational microstructure characterization and reconstruction: Review of the state-of-the-art techniques.Progress in Materials Science, 95:1–41, June 2018. ISSN 00796425. doi: 10.1016/j.pmatsci.2018.01.005. URL https: //linkingh...
-
[11]
Brough, Daniel Wheeler, and Surya R
David B. Brough, Daniel Wheeler, and Surya R. Kalidindi. Materials Knowledge Sys- tems in Python—a Data Science Framework for Accelerated Development of Hierarchical Materials.Integrating Materials and Manufacturing Innovation, 6(1):36–53, March 2017. ISSN 2193-9772. doi: 10.1007/s40192-017-0089-0. URL https://doi.org/10.1007/ s40192-017-0089-0
-
[12]
PolyMicros: Boot- strapping a Foundation Model for Polycrystalline Material Structure, May 2025
Michael Buzzy, Andreas Robertson, Peng Chen, and Surya Kalidindi. PolyMicros: Boot- strapping a Foundation Model for Polycrystalline Material Structure, May 2025. URL http://arxiv.org/abs/2506.11055. arXiv:2506.11055 [cs]
-
[13]
Michael O. Buzzy, Andreas E. Robertson, and Surya R. Kalidindi. Statistically conditioned polycrystal generation using denoising diffusion models.Acta Materialia, 267:119746, April
-
[14]
doi: 10.1016/j.actamat.2024.119746
ISSN 13596454. doi: 10.1016/j.actamat.2024.119746. URL https://linkinghub. elsevier.com/retrieve/pii/S1359645424000995
-
[15]
Buzzy, David Montes de Oca Zapiain, Adam P
Michael O. Buzzy, David Montes de Oca Zapiain, Adam P. Generale, Surya R. Kalidindi, and Hojun Lim. Active learning for the design of polycrystalline textures using condi- tional normalizing flows.Acta Materialia, 284:120537, January 2025. ISSN 1359-6454. doi: 10.1016/j.actamat.2024.120537. URL https://www.sciencedirect.com/science/ article/pii/S1359645424008863
-
[16]
Alex D. Casey, Steven F. Son, Ilias Bilionis, and Brian C. Barnes. Prediction of Energetic Material Properties from Electronic Structure Using 3D Convolutional Neural Networks. Journal of Chemical Information and Modeling, 60(10):4457–4473, October 2020. ISSN 1549-9596. doi: 10.1021/acs.jcim.0c00259. URLhttps://doi.org/10.1021/acs.jcim. 0c00259
-
[17]
Ahmet Cecen, Tony Fast, and Surya R. Kalidindi. Versatile algorithms for the computation of 2-point spatial correlations in quantifying material structure.Integrating Materials and Manufacturing Innovation, 5(1):1–15, December 2016. ISSN 2193-9772. doi: 10.1186/ s40192-015-0044-x. URLhttps://doi.org/10.1186/s40192-015-0044-x. 13
-
[18]
Brahamananda Chakraborty, Pranoy Ray, Nandini Garg, and Srikumar Banerjee. High capacity reversible hydrogen storage in titanium doped 2D carbon allotrope Ψ-graphene: Density Functional Theory investigations.International Journal of Hydrogen Energy, 46(5):4154– 4167, January 2021. ISSN 0360-3199. doi: 10.1016/j.ijhydene.2020.10.161. URL https: //www.scien...
-
[19]
Chi Chen, Weike Ye, Yunxing Zuo, Chen Zheng, and Shyue Ping Ong. Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals.Chemistry of Materials, 31(9):3564–3572, May 2019. ISSN 0897-4756. doi: 10.1021/acs.chemmater.9b01294. URL https://doi.org/10.1021/acs.chemmater.9b01294
-
[20]
Kamal Choudhary. Machine learning with force-field-inspired descriptors for materials: Fast screening and mapping energy landscape.Physical Review Materials, 2(8), 2018. doi: 10.1103/PhysRevMaterials.2.083801
-
[21]
Kamal Choudhary and Brian DeCost. Atomistic Line Graph Neural Network for improved materials property predictions.npj Computational Materials, 7(1):1–8, November 2021. ISSN 2057-3960. doi: 10.1038/s41524-021-00650-1. URL https://www.nature.com/ articles/s41524-021-00650-1. Number: 1
-
[22]
Stefano Curtarolo, Gus L. W. Hart, Marco Buongiorno Nardelli, Natalio Mingo, Stefano Sanvito, and Ohad Levy. The high-throughput highway to computational materials design. Nature Materials, 12(3):191–201, March 2013. ISSN 1476-4660. doi: 10.1038/nmat3568. URLhttps://www.nature.com/articles/nmat3568
-
[23]
Jyotirmoy Deb, Ajit Kundu, Nandini Garg, Utpal Sarkar, and Brahmananda Chakraborty. Copper decorated graphyne as a promising nanocarrier for cisplatin anti-cancer drug: A DFT study.Applied Surface Science, 622:156885, 2023
work page 2023
-
[24]
Deneault, Jorge Chang, Jay Myung, Daylond Hooper, Andrew Armstrong, Mark Pitt, and Benji Maruyama
James R. Deneault, Jorge Chang, Jay Myung, Daylond Hooper, Andrew Armstrong, Mark Pitt, and Benji Maruyama. Toward autonomous additive manufacturing: Bayesian optimiza- tion on a 3D printer.MRS Bulletin, 46(7):566–575, July 2021. ISSN 0883-7694, 1938-
work page 2021
-
[25]
URL https://link.springer.com/10.1557/ s43577-021-00051-1
doi: 10.1557/s43577-021-00051-1. URL https://link.springer.com/10.1557/ s43577-021-00051-1
-
[26]
Bowen Deng, Peichen Zhong, KyuJung Jun, Janosh Riebesell, Kevin Han, Christopher J. Bartel, and Gerbrand Ceder. CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling.Nature Machine Intelligence, 5(9):1031–1041, September 2023. ISSN 2522-5839. doi: 10.1038/s42256-023-00716-3. URL https://www. nature.com/articles/...
-
[27]
E(3)-equivariant models cannot learn chirality: Field- based molecular generation, April 2025
Alexandru Dumitrescu, Dani Korpela, Markus Heinonen, Yogesh Verma, Valerii Iakovlev, Vikas Garg, and Harri Lähdesmäki. E(3)-equivariant models cannot learn chirality: Field- based molecular generation, April 2025. URL http://arxiv.org/abs/2402.15864. arXiv:2402.15864 [cs]
-
[28]
David Duvenaud, Dougal Maclaurin, Jorge Aguilera-Iparraguirre, Rafael Gómez-Bombarelli, Timothy Hirzel, Alán Aspuru-Guzik, and Ryan P. Adams. Convolutional Networks on Graphs for Learning Molecular Fingerprints, November 2015. URL http://arxiv.org/abs/1509. 09292. arXiv:1509.09292 [cs]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[29]
ELEC- TRA: A Cartesian Network for 3D Charge Density Prediction with Floating Orbitals, January
Jonas Elsborg, Luca Thiede, Alán Aspuru-Guzik, Tejs Vegge, and Arghya Bhowmik. ELEC- TRA: A Cartesian Network for 3D Charge Density Prediction with Floating Orbitals, January
- [30]
-
[31]
Anatole von Lilienfeld, and Rickard Armiento
Felix Faber, Alexander Lindmaa, O. Anatole von Lilienfeld, and Rickard Armiento. Crystal structure representations for machine learning models of formation energies.International Journal of Quantum Chemistry, 115(16):1094–1101, 2015. ISSN 1097-461X. doi: 10.1002/ qua.24917. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/qua.24917. _eprint: https://on...
-
[32]
Tony Fast and Surya R. Kalidindi. Formulation and calibration of higher-order elas- tic localization relationships using the MKS approach.Acta Materialia, 59(11):4595– 4605, June 2011. ISSN 1359-6454. doi: 10.1016/j.actamat.2011.04.005. URL https: //www.sciencedirect.com/science/article/pii/S1359645411002473
-
[33]
D. T. Fullwood, S. R. Kalidindi, S. R. Niezgoda, A. Fast, and N. Hampson. Gradient-based microstructure reconstructions from distributions using fast Fourier transforms.Materials Science and Engineering: A, 494(1):68–72, October 2008. ISSN 0921-5093. doi: 10.1016/ j.msea.2007.10.087. URL https://www.sciencedirect.com/science/article/pii/ S0921509307019752
work page 2008
-
[34]
David T. Fullwood, Stephen R. Niezgoda, and Surya R. Kalidindi. Microstructure recon- structions from 2-point statistics using phase-recovery algorithms.Acta Materialia, 56 (5):942–948, March 2008. ISSN 13596454. doi: 10.1016/j.actamat.2007.10.044. URL https://linkinghub.elsevier.com/retrieve/pii/S1359645407007458
-
[35]
David T. Fullwood, Stephen R. Niezgoda, Brent L. Adams, and Surya R. Kalidindi. Mi- crostructure sensitive design for performance optimization.Progress in Materials Science, 55(6):477–562, August 2010. ISSN 0079-6425. doi: 10.1016/j.pmatsci.2009.08.002. URL https://www.sciencedirect.com/science/article/pii/S0079642509000760
-
[36]
GemNet: Universal Directional Graph Neural Networks for Molecules, June 2024
Johannes Gasteiger, Florian Becker, and Stephan Günnemann. GemNet: Universal Directional Graph Neural Networks for Molecules, June 2024. URL http://arxiv.org/abs/2106. 08903. arXiv:2106.08903 [physics]
-
[37]
Adam P. Generale, Andreas E. Robertson, Conlain Kelly, and Surya R. Kalidindi. In- verse stochastic microstructure design.Acta Materialia, 271:119877, June 2024. ISSN 13596454. doi: 10.1016/j.actamat.2024.119877. URL https://linkinghub.elsevier. com/retrieve/pii/S1359645424002301
-
[38]
Joshua A. Gomberg, Andrew J. Medford, and Surya R. Kalidindi. Extracting knowledge from molecular mechanics simulations of grain boundaries using machine learning.Acta Materialia, 133:100–108, July 2017. ISSN 1359-6454. doi: 10.1016/j.actamat.2017.05.009. URL https://www.sciencedirect.com/science/article/pii/S1359645417303865
-
[39]
Weiyi Gong and Qimin Yan. Graph-based deep learning frameworks for molecules and solid-state materials.Computational Materials Science, 195:110332, July 2021. ISSN 09270256. doi: 10.1016/j.commatsci.2021.110332. URL https://linkinghub.elsevier. com/retrieve/pii/S0927025621000574
-
[40]
Rishi Gurnani, Deepak Kamal, Huan Tran, Harikrishna Sahu, Kenny Scharm, Usman Ashraf, and Rampi Ramprasad. polyG2G: A Novel Machine Learning Algorithm Applied to the Generative Design of Polymer Dielectrics.Chemistry of Materials, 33(17):7008– 7016, September 2021. ISSN 0897-4756. doi: 10.1021/acs.chemmater.1c02061. URL https://doi.org/10.1021/acs.chemmat...
-
[41]
D. R. Hamann. Generalized norm-conserving pseudopotentials.Physical Review B, 40(5): 2980–2987, August 1989. doi: 10.1103/PhysRevB.40.2980. URL https://link.aps.org/ doi/10.1103/PhysRevB.40.2980
-
[42]
D. R. Hamann. Optimized norm-conserving Vanderbilt pseudopotentials.Physical Review B, 88(8):085117, August 2013. doi: 10.1103/PhysRevB.88.085117. URL https://link.aps. org/doi/10.1103/PhysRevB.88.085117
-
[43]
D. R. Hamann, M. Schlüter, and C. Chiang. Norm-Conserving Pseudopotentials.Physical Review Letters, 43(20):1494–1497, November 1979. doi: 10.1103/PhysRevLett.43.1494. URL https://link.aps.org/doi/10.1103/PhysRevLett.43.1494
-
[44]
Harrington, Conlain Kelly, Vahid Attari, Raymundo Arroyave, and Surya R
Grayson H. Harrington, Conlain Kelly, Vahid Attari, Raymundo Arroyave, and Surya R. Kalidindi. Application of a Chained-ANN for Learning the Process–Structure Mapping in Mg2SixSn1−x Spinodal Decomposition.Integrating Materials and Manufacturing Innovation, 11(3):433–449, September 2022. ISSN 2193-9772. doi: 10.1007/s40192-022-00274-3. URL https://doi.org/...
-
[45]
Trevor Hastings, Mrinalini Mulukutla, Danial Khatamsaz, Daniel Salas, Wenle Xu, Daniel Lewis, Nicole Person, Matthew Skokan, Braden Miller, James Paramore, Brady Butler, Douglas Allaire, Vahid Attari, Ibrahim Karaman, George Pharr, Ankit Srivastava, and Ray- mundo Arróyave. Accelerated multi-objective alloy discovery through efficient bayesian methods: Ap...
-
[46]
Lauri Himanen, Amber Geurts, Adam Stuart Foster, and Patrick Rinke. Data- Driven Materials Science: Status, Challenges, and Perspectives.Advanced Science, 6(21):1900808, 2019. ISSN 2198-3844. doi: 10.1002/advs.201900808. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/advs.201900808. _eprint: https://advanced.onlinelibrary.wiley.com/doi/pdf/10.1002/ad...
-
[47]
P. Hohenberg and W. Kohn. Inhomogeneous Electron Gas.Physical Review, 136(3B):B864– B871, November 1964. doi: 10.1103/PhysRev.136.B864. URL https://link.aps.org/ doi/10.1103/PhysRev.136.B864
-
[48]
Bayesian Optimization in Materials Science
Zhufeng Hou and Koji Tsuda. Bayesian Optimization in Materials Science. In Kristof T. Schütt, Stefan Chmiela, O. Anatole von Lilienfeld, Alexandre Tkatchenko, Koji Tsuda, and Klaus-Robert Müller, editors,Machine Learning Meets Quantum Physics, pages 413–426. Springer International Publishing, Cham, 2020. ISBN 978-3-030-40245-7. doi: 10.1007/ 978-3-030-402...
-
[49]
Bing Huang and O. Anatole von Lilienfeld. Communication: Understanding molecular representations in machine learning: The role of uniqueness and target similarity.The Journal of Chemical Physics, 145(16):161102, October 2016. ISSN 0021-9606. doi: 10.1063/1. 4964627. URLhttps://doi.org/10.1063/1.4964627
work page doi:10.1063/1 2016
-
[50]
Olexandr Isayev, Corey Oses, Cormac Toher, Eric Gossett, Stefano Curtarolo, and Alexander Tropsha. Universal fragment descriptors for predicting properties of inorganic crystals.Nature Communications, 8(1):15679, June 2017. ISSN 2041-1723. doi: 10.1038/ncomms15679. URLhttps://www.nature.com/articles/ncomms15679
-
[51]
Kevin Maik Jablonka, Giriprasad Melpatti Jothiappan, Shefang Wang, Berend Smit, and Brian Yoo. Bias free multiobjective active learning for materials design and discovery.Nature Com- munications, 12(1):2312, April 2021. ISSN 2041-1723. doi: 10.1038/s41467-021-22437-0. URLhttps://www.nature.com/articles/s41467-021-22437-0
-
[52]
Dejun Jiang, Zhenxing Wu, Chang-Yu Hsieh, Guangyong Chen, Ben Liao, Zhe Wang, Chao Shen, Dongsheng Cao, Jian Wu, and Tingjun Hou. Could Graph Neural Networks Learn Better Molecular Representation for Drug Discovery? A Comparison Study of Descriptor-based and Graph-based Models, September 2020. URL https://www.researchsquare.com/ article/rs-79416/v1. ISSN:...
work page 2020
-
[53]
Seiji Kajita, Nobuko Ohba, Ryosuke Jinnouchi, and Ryoji Asahi. A Universal 3D V oxel Descriptor for Solid-State Material Informatics with Deep Convolutional Neural Networks.Scientific Reports, 7(1):16991, December 2017. ISSN 2045-
work page 2017
-
[54]
URL https://www.nature.com/articles/ s41598-017-17299-w
doi: 10.1038/s41598-017-17299-w. URL https://www.nature.com/articles/ s41598-017-17299-w. Number: 1
-
[55]
Surya Kalidindi, Stephen Niezgoda, I Giacomo L, and Tony Fast. A Novel Framework for Building Materials Knowledge Systems.Computers, Materials & Continua, 17(2):103– 126, 1970. ISSN 1546-2218, 1546-2226. doi: 10.3970/cmc.2010.017.103. URL https: //www.techscience.com/cmc/v17n2/22557
-
[56]
Kalidindi.Hierarchical Materials Informatics
Surya R. Kalidindi.Hierarchical Materials Informatics. Elsevier, 2015. ISBN 978-0-12- 410394-8. doi: 10.1016/C2012-0-07337-1. URL https://linkinghub.elsevier.com/ retrieve/pii/C20120073371
-
[57]
Surya R. Kalidindi. Feature engineering of material structure for AI-based materials knowledge systems.Journal of Applied Physics, 128(4):041103, July 2020. ISSN 0021-8979. doi: 10.1063/5.0011258. URLhttps://doi.org/10.1063/5.0011258. 16
-
[58]
Surya R. Kalidindi, Joshua A. Gomberg, Zachary T. Trautt, and Chandler A. Becker. Ap- plication of data science tools to quantify and distinguish between structures and models in molecular dynamics datasets.Nanotechnology, 26(34):344006, August 2015. ISSN 0957-4484. doi: 10.1088/0957-4484/26/34/344006. URL https://dx.doi.org/10.1088/ 0957-4484/26/34/344006
-
[59]
Kaundinya, Kamal Choudhary, and Surya R
Prathik R. Kaundinya, Kamal Choudhary, and Surya R. Kalidindi. Machine learning ap- proaches for feature engineering of the crystal structure: Application to the prediction of the formation energy of cubic compounds.Physical Review Materials, 5(6):063802, June 2021. doi: 10.1103/PhysRevMaterials.5.063802. URL https://link.aps.org/doi/10.1103/ PhysRevMater...
-
[60]
Kaundinya, Kamal Choudhary, and Surya R
Prathik R. Kaundinya, Kamal Choudhary, and Surya R. Kalidindi. Prediction of the Electron Density of States for Crystalline Compounds with Atomistic Line Graph Neu- ral Networks (ALIGNN).JOM, 74(4):1395–1405, April 2022. ISSN 1543-1851. doi: 10.1007/s11837-022-05199-y. URL https://doi.org/10.1007/s11837-022-05199-y
-
[61]
Steven Kearnes, Kevin McCloskey, Marc Berndl, Vijay Pande, and Patrick Riley. Molecular graph convolutions: moving beyond fingerprints.Journal of Computer-Aided Molecular Design, 30(8):595–608, August 2016. ISSN 1573-4951. doi: 10.1007/s10822-016-9938-8. URLhttps://doi.org/10.1007/s10822-016-9938-8
- [62]
-
[63]
doi: 10.1016/j.commatsci.2021.110356
ISSN 0927-0256. doi: 10.1016/j.commatsci.2021.110356. URL https://www. sciencedirect.com/science/article/pii/S0927025621000811
-
[64]
Conlain Kelly and Surya R. Kalidindi. Thermodynamically-Informed Iterative Neural Oper- ators for heterogeneous elastic localization.Computer Methods in Applied Mechanics and Engineering, 441:117939, June 2025. ISSN 00457825. doi: 10.1016/j.cma.2025.117939. URL https://linkinghub.elsevier.com/retrieve/pii/S0045782525002117
-
[65]
Johnson, Douglas Allaire, and Raymundo Arróyave
Danial Khatamsaz, Brent Vela, Prashant Singh, Duane D. Johnson, Douglas Allaire, and Raymundo Arróyave. Bayesian optimization with active learning of design constraints using an entropy-based approach.npj Computational Materials, 9(1):49, April 2023. ISSN 2057-
work page 2023
-
[66]
URL https://www.nature.com/articles/ s41524-023-01006-7
doi: 10.1038/s41524-023-01006-7. URL https://www.nature.com/articles/ s41524-023-01006-7
-
[67]
W. Kohn and L. J. Sham. Self-Consistent Equations Including Exchange and Correlation Effects.Physical Review, 140(4A):A1133–A1138, November 1965. doi: 10.1103/PhysRev. 140.A1133. URLhttps://link.aps.org/doi/10.1103/PhysRev.140.A1133
-
[68]
G. Kresse and J. Furthmüller. Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set.Computational Materials Science, 6(1): 15–50, July 1996. ISSN 09270256. doi: 10.1016/0927-0256(96)00008-0. URL https: //linkinghub.elsevier.com/retrieve/pii/0927025696000080
-
[69]
G. Kresse and J. Furthmüller. Efficient iterative schemes forab initiototal-energy calculations using a plane-wave basis set.Physical Review B, 54(16):11169–11186, October 1996. ISSN 0163-1829, 1095-3795. doi: 10.1103/PhysRevB.54.11169. URL https://link.aps.org/ doi/10.1103/PhysRevB.54.11169
-
[70]
G. Kresse and D. Joubert. From ultrasoft pseudopotentials to the projector augmented-wave method.Physical Review B, 59(3):1758–1775, January 1999. ISSN 0163-1829, 1095-3795. doi: 10.1103/PhysRevB.59.1758. URL https://link.aps.org/doi/10.1103/PhysRevB.59. 1758
-
[71]
Ajit Kundu, Ankita Jaiswal, Pranoy Ray, Sridhar Sahu, and Brahmananda Chakraborty. Zr doped C 24 fullerene as efficient hydrogen storage material: insights from DFT simula- tions.Journal of Physics D: Applied Physics, 57(49):495502, December 2024. ISSN 0022- 3727, 1361-6463. doi: 10.1088/1361-6463/ad75a1. URL https://iopscience.iop.org/ article/10.1088/13...
-
[72]
Xiangyun Lei and Andrew J. Medford. A Universal Framework for Featurization of Atomistic Systems.The Journal of Physical Chemistry Letters, 13(34):7911–7919, September 2022. doi: 10.1021/acs.jpclett.2c02100. URLhttps://doi.org/10.1021/acs.jpclett.2c02100
-
[73]
Levine, Muhammed Shuaibi, Evan Walter Clark Spotte-Smith, Michael G
Daniel S. Levine, Muhammed Shuaibi, Evan Walter Clark Spotte-Smith, Michael G. Taylor, Muhammad R. Hasyim, Kyle Michel, Ilyes Batatia, Gábor Csányi, Misko Dzamba, Peter Eastman, Nathan C. Frey, Xiang Fu, Vahe Gharakhanyan, Aditi S. Krishnapriyan, Joshua A. Rackers, Sanjeev Raja, Ammar Rizvi, Andrew S. Rosen, Zachary Ulissi, Santiago Vargas, C. Lawrence Zi...
-
[74]
Balachandran, Dezhen Xue, and Ruihao Yuan
Turab Lookman, Prasanna V . Balachandran, Dezhen Xue, and Ruihao Yuan. Active learning in materials science with emphasis on adaptive sampling using uncertainties for targeted design.npj Computational Materials, 5(1):21, February 2019. ISSN 2057-3960. doi: 10.1038/ s41524-019-0153-8. URLhttps://www.nature.com/articles/s41524-019-0153-8
work page 2019
-
[75]
Silke Maes, Frederik De Ceuster, Marie Van de Sande, and Leen Decin. MACE: a Machine- learning Approach to Chemistry Emulation.Journal of Open Source Software, 10(108):7148, April 2025. ISSN 2475-9066. doi: 10.21105/joss.07148. URL https://joss.theoj.org/ papers/10.21105/joss.07148
- [76]
-
[77]
Andrew Mann and Surya R. Kalidindi. Development of a Robust CNN Model for Cap- turing Microstructure-Property Linkages and Building Property Closures Supporting Ma- terial Design.Frontiers in Materials, 9(851085):1–14, 2022. ISSN 2296-8016. URL https://www.frontiersin.org/articles/10.3389/fmats.2022.851085
-
[78]
Sergei Manzhos and Tucker Jr. Carrington. Neural Network Potential Energy Surfaces for Small Molecules and Reactions.Chemical Reviews, 121(16):10187–10217, August 2021. ISSN 0009-2665. doi: 10.1021/acs.chemrev.0c00665. URL https://doi.org/10.1021/ acs.chemrev.0c00665
-
[79]
Principal components analysis (PCA).Com- puters & Geosciences, 19(3):303–342, March 1993
Andrzej Ma´ckiewicz and Waldemar Ratajczak. Principal components analysis (PCA).Com- puters & Geosciences, 19(3):303–342, March 1993. ISSN 0098-3004. doi: 10.1016/ 0098-3004(93)90090-R. URL https://www.sciencedirect.com/science/article/ pii/009830049390090R
-
[80]
Schoenholz, Muratahan Aykol, Gowoon Cheon, and Ekin Dogus Cubuk
Amil Merchant, Simon Batzner, Samuel S. Schoenholz, Muratahan Aykol, Gowoon Cheon, and Ekin Dogus Cubuk. Scaling deep learning for materials discovery.Nature, 624(7990): 80–85, December 2023. ISSN 1476-4687. doi: 10.1038/s41586-023-06735-9. URL https: //www.nature.com/articles/s41586-023-06735-9
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.