Interpolation and extrapolation of global potential energy surfaces for polyatomic systems by Gaussian processes with composite kernels
Pith reviewed 2026-05-24 18:38 UTC · model grok-4.3
The pith
Gaussian process models with composite kernels build accurate global six-dimensional PES for H3O+ from 500 ab initio points.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Gaussian process regression models of potential energy surfaces trained with composite kernels, selected via the Bayesian information criterion, achieve a root mean square error of 65.8 cm^{-1} for the six-dimensional PES of H3O+ over 0 to 21,000 cm^{-1} using 500 ab initio points. The models can also extrapolate the PES to 21,000 cm^{-1} from 1500 points below 10,000 cm^{-1}. The accuracy is maximized by iteratively increasing kernel complexity and optimizing the distribution of training points.
What carries the argument
Composite kernels for Gaussian process regression, formed by combining multiple kernel functions to capture complex features of the potential energy surface.
If this is right
- GP models with composite kernels produce global PES with high accuracy from small numbers of ab initio points.
- The Bayesian information criterion automates selection of optimal kernel compositions for PES modeling.
- Varying the distribution of training points further improves model accuracy for a fixed number of points.
- GP models with composite kernels enable physical extrapolation of PES to higher energy regions from low-energy training data.
Where Pith is reading between the lines
- The method could reduce the number of expensive ab initio calculations needed for PES of larger polyatomic systems.
- Extrapolation performance could be tested on other molecular systems to establish how far the low-to-high energy transfer generalizes.
- Pairing composite-kernel selection with active learning for point placement might lower the required training set size even further.
Load-bearing premise
Kernel compositions chosen by the Bayesian information criterion generalize to the full PES without overfitting to the particular distribution of the selected training points.
What would settle it
An independent test set of points above 10,000 cm^{-1} yields RMSE substantially larger than 65.8 cm^{-1} for a composite-kernel model trained only on points below 10,000 cm^{-1}.
Figures
read the original abstract
Gaussian process regression has recently emerged as a powerful, system-agnostic tool for building global potential energy surfaces (PES) of polyatomic molecules. While the accuracy of GP models of PES increases with the number of potential energy points, so does the numerical difficulty of training and evaluating GP models. Here, we demonstrate an approach to improve the accuracy of global PES without increasing the number of energy points. The present work reports four important results. First, we show that the selection of the best kernel function for GP models of PES can be automated using the Bayesian information criterion as a model selection metric. Second, we demonstrate that GP models of PES trained by a small number of energy points can be significantly improved by iteratively increasing the complexity of GP kernels. The composite kernels thus obtained maximize the accuracy of GP models for a given distribution of potential energy points. Third, we show that the accuracy of the GP models of PES with composite kernels can be further improved by varying the training point distributions. Fourth, we show that GP models with composite kernels can be used for physical extrapolation of PES. We illustrate the approach by constructing the six-dimensional PES for H$_3$O$^+$. For the interpolation problem, we show that this algorithm produces a global six-dimensional PES for H$_3$O$^+$ in the energy range between zero and $21,000$ cm$^{-1}$ with the root mean square error $65.8$ cm$^{-1}$ using only 500 randomly selected {\it ab initio} points as input. To illustrate extrapolation, we produce the PES at high energies using the energy points at low energies as input. We show that one can obtain an accurate global fit of the PES extending to $21,000$ cm$^{-1}$ based on 1500 potential energy points at energies below $10,000$ cm$^{-1}$.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a Gaussian process regression approach using composite kernels selected by the Bayesian information criterion (BIC) to construct global potential energy surfaces (PES) for polyatomic systems. It reports four results: automated kernel selection, iterative complexity increase for better accuracy, improvement by varying training distributions, and physical extrapolation. For H3O+, it achieves RMSE of 65.8 cm^{-1} for interpolation up to 21,000 cm^{-1} with 500 ab initio points, and demonstrates extrapolation to the same range using 1500 low-energy points below 10,000 cm^{-1}.
Significance. If validated with proper hold-out testing, the method could reduce the number of ab initio points required for accurate global PES while automating kernel composition via BIC, offering a practical advance for computational chemistry of polyatomics. The concrete numerical demonstration on a six-dimensional H3O+ surface and the focus on extrapolation are strengths, though the absence of baseline comparisons and error estimates weakens immediate applicability.
major comments (3)
- [Abstract] Abstract: The reported interpolation RMSE of 65.8 cm^{-1} (500 randomly selected points) is presented without error bars, cross-validation statistics, or direct comparison to non-composite kernels, so it is impossible to determine whether the composite-kernel improvement is statistically meaningful or robust to point-distribution variation.
- [Abstract] Abstract (extrapolation paragraph): The claim that 1500 points below 10,000 cm^{-1} suffice for an accurate global fit to 21,000 cm^{-1} provides no RMSE, hold-out error, or other quantitative metric on the high-energy regime; BIC selection occurs exclusively on the low-energy training distribution, leaving the generalization assumption untested.
- [Abstract] Abstract (four results): No quantitative evidence is supplied for the second and third results (iterative kernel-complexity increase and varying training distributions), so the central assertion that composite kernels maximize accuracy for a given point set rests on the single interpolation number without supporting ablation data.
Simulated Author's Rebuttal
We thank the referee for the constructive comments on our manuscript. We address each of the major comments below and indicate the revisions we will make to improve the clarity and completeness of the abstract and supporting evidence.
read point-by-point responses
-
Referee: [Abstract] Abstract: The reported interpolation RMSE of 65.8 cm^{-1} (500 randomly selected points) is presented without error bars, cross-validation statistics, or direct comparison to non-composite kernels, so it is impossible to determine whether the composite-kernel improvement is statistically meaningful or robust to point-distribution variation.
Authors: The abstract provides a summary of the key numerical result. The full manuscript describes the use of BIC for automated kernel selection and includes comparisons through the iterative process. To address the referee's concern, we will revise the abstract to mention that the reported RMSE is obtained after cross-validation and that composite kernels were selected over simpler alternatives based on BIC scores. revision: yes
-
Referee: [Abstract] Abstract (extrapolation paragraph): The claim that 1500 points below 10,000 cm^{-1} suffice for an accurate global fit to 21,000 cm^{-1} provides no RMSE, hold-out error, or other quantitative metric on the high-energy regime; BIC selection occurs exclusively on the low-energy training distribution, leaving the generalization assumption untested.
Authors: We agree that a specific quantitative metric for the extrapolation performance would strengthen the abstract. The manuscript shows the extrapolation by training exclusively on low-energy data and evaluating the model on the full range up to 21,000 cm^{-1}. In the revised version, we will include the RMSE achieved in the high-energy regime to quantify the extrapolation accuracy. Regarding BIC selection, it is performed on the training set as is standard, and the hold-out on high energies tests the generalization. revision: yes
-
Referee: [Abstract] Abstract (four results): No quantitative evidence is supplied for the second and third results (iterative kernel-complexity increase and varying training distributions), so the central assertion that composite kernels maximize accuracy for a given point set rests on the single interpolation number without supporting ablation data.
Authors: The manuscript provides demonstrations and supporting data for the iterative kernel complexity increase and the effect of training distributions in dedicated sections, including accuracy improvements shown in figures. The abstract condenses these into qualitative statements. We will update the abstract to include brief quantitative indications of the improvements from these steps, such as the reduction in error from iterative composition. revision: yes
Circularity Check
No significant circularity; results benchmarked against external ab initio data
full rationale
The paper trains GP models (with BIC-selected composite kernels) on subsets of ab initio points for H3O+ and reports RMSE on other ab initio points for interpolation (500 training points) or on high-energy points for extrapolation (1500 low-energy training points). These are standard held-out evaluations against independent quantum-chemistry calculations, not reductions to the training inputs by construction. No self-citations, uniqueness theorems, or ansatzes are invoked as load-bearing steps in the provided text. The derivation chain consists of data-driven model selection followed by direct numerical comparison to external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (1)
- kernel hyperparameters
axioms (2)
- domain assumption PES are sufficiently smooth for GP regression with standard kernels to be applicable
- domain assumption BIC is a reliable metric for selecting kernels that generalize beyond the training set
Reference graph
Works this paper leans on
-
[1]
Specifically, Figure 5 includes 26383 points above 5000 cm−1; 22386 points above 7000 cm−1; 16252 points above 10000 cm−1; and 6950 points above 15000 cm−1. These points thus sample the regions both within and outside the range of coordinates of the training points. The RMSE of the PES thus obtained are shown in Figure 6. As is clear from the numerical val...
-
[2]
The set of ab initio points does not describe this part of the configuration space. There is thus absolutely no information about this part of the 14 configuration space in the training set of energy points. The different curves in Figure 8 correspond to predictions of five different GP models obtained, as before, by GP regression of the training points at ene...
work page 2000
-
[3]
B. J. Braams and J. M. Bowman, Permutationally invariant potential energy surfaces in high dimensionality,Int. Rev. Phys. Chem.28, 577 (2009)
work page 2009
-
[4]
J. N. Murrell, S. Carter, S. C. Farantos, P. Huxley, and A. J. C. Varandas,Molecular Potential Energy Functions(Wiley, Chichester, England, 1984)
work page 1984
-
[5]
T. Hollebeek, T.-S. Ho, and H. Rabitz, Constructing multidimensional molecular potential energy surfaces fromab initiodata, Annu. Rev. Phys. Chem.50, 537 (1999)
work page 1999
-
[6]
M. A. Collins, Molecular potential-energy surfaces for chemical reaction dynamics, Theor. Chem. Acc.108, 313 (2002)
work page 2002
-
[7]
C. M. Handley and P. L. A. Popelier, Potential Energy Surfaces Fitted by Artificial Neural Networks, J. Phys. Chem.A 114, 3371 (2010)
work page 2010
-
[8]
R. V. Krems, Bayesian Machine Learning for Quantum Molecular Dynamics,Phys. Chem. Chem. Phys. 21, 13392 (2019)
work page 2019
-
[9]
Behler, Perspective: Machine learning potentials for atomistic simulations, J
J. Behler, Perspective: Machine learning potentials for atomistic simulations, J. Chem. Phys.145, 170901 (2016)
work page 2016
-
[10]
S. Manzhos and T. Carrington, Jr., A random-sampling high dimensional model representation neural network for building potential energy surfacesJ. Chem. Phys.125, 084109 (2006)
work page 2006
-
[11]
S. Manzhos, X. Wang, R. Dawes, and T. Carrington, Jr., A nested molecule-independent neural network approach for high-quality potential fits,J. Phys. Chem.A 110, 5295 (2006)
work page 2006
-
[12]
J Behler and M Parrinello, Generalized neural-network representation of high-dimensional potential-energy surfaces,Phys. Rev. Lett.98, 146401 (2007)
work page 2007
-
[13]
J. Behler, Neural network potential-energy surfaces in chemistry: a tool for large-scale simu- lations, Phys. Chem. Chem. Phys.13, 17930 (2011)
work page 2011
-
[14]
Behler, Constructing high-dimensional neural network potentials: A tutorial review,Int
J. Behler, Constructing high-dimensional neural network potentials: A tutorial review,Int. J. Quant. Chem. 115, 1032 (2015)
work page 2015
-
[15]
E. Pradesh and A. Brown, A ground state potential energy surface for HONO based on a neural network with exponential fitting functions,Phys. Chem. Chem. Phys.19, 22272 (2017)
work page 2017
-
[16]
A. Leclerc and T. Carrington, Jr., Calculating vibrational spectra with sum of product basis functions without storing full-dimensional vectors or matrices,J. Chem. Phys. 140, 174111 (2014). 19
work page 2014
-
[17]
S. Manzhos, R. Dawes, and T. Carrington, Neural network-based approaches for building high dimensional and quantum dynamics-friendly potential energy surfaces,Int. J. Quant. Chem. 115, 1012 (2015)
work page 2015
-
[18]
J. Chen, X. Xu, X. Xu, and D. H. Zhang, A global potential energy surface for the H2 + OH ↔ H2O + H reaction using neural networks,J. Chem. Phys.138, 154301 (2013)
work page 2013
-
[19]
Q. Liu, X. Zhou, L. Zhou, Y. Zhang, X. Luo, H. Guo, and B. Jiang, Constructing High- Dimensional Neural Network Potential Energy Surfaces for Gas-Surface Scattering and Reac- tions, J. Phys. Chem.C 122, 1761 (2018)
work page 2018
-
[20]
C. M. Handley, G. I. Hawe, D. B. Kellab and P. L. A. Popelier, Optimal construction of a fast and accurate polarisable water potential based on multipole moments trained by machine learning, Phys. Chem. Chem. Phys.11, 6365 (2009)
work page 2009
-
[21]
A. P. Bartók, M. C. Payne, R. Kondor, and G. Csányi, Gaussian Approximation Potentials: The Accuracy of Quantum Mechanics, without the Electrons,Phys. Rev. Lett. 104, 136403 (2010)
work page 2010
-
[22]
A. P. Bartók and G. Csányi, Gaussian approximation potentials: A brief tutorial introduction, Int. J. Quant. Chem.115, 1051 (2015)
work page 2015
- [23]
-
[24]
P. O. Dral, A. Owens, S. N. Yurchenko, and W. Thiel, Structure-based sampling and self- correcting machine learning for accurate calculations of potential energy surfaces and vibra- tional levels,J. Chem. Phys.146, 244108 (2017)
work page 2017
-
[25]
B. Kolb, P. Marshall, B. Zhao, B. Jiang, and Hua Guo, Representing Global Reactive Potential Energy Surfaces Using Gaussian Processes,J. Phys. Chem.A 121, 2552 (2017)
work page 2017
-
[26]
A. Kamath, R. A. Vargas-Hernandez, R. V. Krems, T. Carrington Jr., and S. Manzhos, Neural networks vs Gaussian process regression for representing potential energy surfaces: A compar- ative study of fit quality and vibrational spectrum accuracy,J. Chem. Phys. 148, 241702 (2018)
work page 2018
-
[27]
G. Schmitz and O. Christiansen, Gaussian process regression to accelerate geometry optimiza- tions relying on numerical differentiationJ. Chem. Phys.148, 241704 (2018)
work page 2018
-
[28]
Y. Guan, S. Yang, and D. H. Zhang, Construction of reactive potential energy surfaces with Gaussian process regression: active data selection,Mol. Phys. 116, 823 (2018). 20
work page 2018
- [29]
-
[30]
Y. Guan, S. Yang, and D. H. Zhang, Application of Clustering Algorithms to Partitioning Configuration Space in Fitting Reactive Potential Energy Surfaces,J. Phys. Chem. A 122, 3140 (2018)
work page 2018
-
[31]
A. E. Wiens, A. V. Copan, H. F. Schaefer, Multi-Fidelity Gaussian Process Modeling for Chemical Energy Surfaces,Chem. Phys. Lett.X, in press (2019)
work page 2019
-
[32]
C. Qu, Q. Yu, B. L. Van Hoozen Jr, J. M. Bowman, and R. A. Vargas-Hernandez, Assessing Gaussian Process Regression and Permutationally Invariant Polynomial Approaches To Rep- resent High-Dimensional Potential Energy Surfaces,J. Chem. Theor. Comp.14, 3381 (2018)
work page 2018
-
[33]
T. S. Ho and H. Rabitz, A general method for constructing multidimensional molecular po- tential energy surfaces fromab initiocalculations, J. Chem. Phys.104, 2584 (1996)
work page 1996
-
[34]
T. Hollebeek, T. S. Ho, and H. Rabitz, A fast algorithm for evaluating multidimensional potential energy surfaces,J. Chem. Phys.106, 7223 (1997)
work page 1997
-
[35]
T. S. Ho and H. Rabitz, Reproducing kernel Hilbert space interpolation methods as a paradigm of high dimensional model representations: Application to multidimensional potential energy surface construction,J. Chem. Phys.119, 6433 (2003)
work page 2003
-
[36]
A. Christianen, T. Karman, R. A. Vargas-Hernandez, G. C. Groenenboom and R. V. Krems, Six-dimensional potential energy surface for NaK-NaK collisions: Gaussian process represen- tation with correct asymptotic form,J. Chem. Phys.150, 064106 (2019)
work page 2019
-
[37]
R.V.Krems, Molecules in electromagnetic fields: from ultracold physics to controlled chemistry, Wiley (2018)
work page 2018
-
[38]
R.Vargas-Hernandez, Y. Guan, D. H. Zhang, and R. V. Krems, Bayesian optimization for the inverse scattering problem in quantum reaction dynamics,New J. Phys.(Fast Track Commu- nication) 21, 022001 (2019)
work page 2019
-
[39]
D. K. Duvenaud, H. Nickisch, and C. E. Rasmussen, Additive Gaussian Processes,Adv. Neur. Inf. Proc. Sys.24, 226 (2011)
work page 2011
-
[40]
D. K. Duvenaud, J. Lloyd, R. Grosse, J. B. Tenenbaum, and Z. Ghahramani, Structure Dis- covery in Nonparametric Regression through Compositional Kernel Search,Proceedings of the 30th International Conference on Machine Learning Research28, 1166 (2013)
work page 2013
-
[41]
R.Vargas-Hernandez, J.Sous, M.Berciu, andR.V.Krems, Extrapolatingquantumobservables 21 with machine learning: Inferring multiple phase transitions from properties of a single phase, Phys. Rev. Lett.121, 255702 (2018)
work page 2018
-
[42]
Yu, Q.; Bowman, J. M.ab initio potential for H3O+→ H+ + H2O: A step to a many-body representation of the hydrated proton?J. Chem. Theor. Comp.12, 5284 (2016)
work page 2016
-
[43]
C. E. Rasmussen, and C. K. I. Williams,Gaussian Processes for Machine Learning(The MIT Press, Cambridge, 2006)
work page 2006
-
[44]
Schwarz, Estimating the dimension of a model,The Annals of Statistics6(2), 461 (1978)
G. Schwarz, Estimating the dimension of a model,The Annals of Statistics6(2), 461 (1978)
work page 1978
- [45]
-
[46]
J. Hensman, N. Fusi, N. D. Lawrence,Gaussian processes for big data, Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013), Report number: UAI-P-2013-PG-282-290 (2013)
work page 2013
-
[47]
When Gaussian Process Meets Big Data: A Review of Scalable GPs
H. Liu, Y.-S. Ong, X. Shen, and J. Cai,When Gaussian Process Meets Big Data: A Review of Scalable GPs, arXiv:1807.01065. 22
work page internal anchor Pith review Pith/arXiv arXiv
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.