Recognition: unknown
Machine learning isotope shifts in molecular energy levels
Pith reviewed 2026-05-10 07:29 UTC · model grok-4.3
The pith
A neural network learns residual errors in isotopologue extrapolations from CO2 and transfers them to improve CO energy levels.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A fully connected neural network architecture for carbon dioxide predicts energy corrections with high fidelity, reducing the mean absolute error relative to the original IE approach for more than 87 percent of the levels when benchmarked against empirical energies. A novel hybrid, molecule-aware transfer learning architecture successfully propagates correction patterns from the data-rich CO2 system to the data-poor CO system, yielding MAE improvements in over 93 percent of CO samples. Updated and improved line lists are presented for 11 CO2 isotopologues and energy levels for excited states of CO isotopologues are predicted.
What carries the argument
The hybrid molecule-aware transfer learning neural network that models residual errors of the isotopologue extrapolation method and applies learned corrections across related molecular systems.
If this is right
- Line lists for 11 CO2 isotopologues become more accurate for use in atmospheric modeling.
- Excited-state energy levels for CO isotopologues receive data-driven predictions where experiments are sparse.
- The method provides a scalable route to refine other molecular line lists in large databases.
- Minor isotopologue data gains reliability for tracing planetary formation and evolution.
Where Pith is reading between the lines
- The same transfer-learning pattern could extend to other molecule pairs to boost predictions for rare isotopologues.
- Limited experimental data across broader molecular databases could be leveraged more effectively through similar cross-molecule corrections.
- Testing on molecules with different bonding or larger mass differences would reveal how far the generalization holds.
Load-bearing premise
Residual error patterns from the abundant CO2 data can be transferred to the scarce CO data without major loss of physical accuracy or interference from differences between the two molecules.
What would settle it
New high-resolution experimental energy measurements for CO isotopologues not seen during training, compared directly against the machine-learning-corrected predictions.
Figures
read the original abstract
Recent advances in the use of High-Resolution Cross-Correlation Spectroscopy (HRCCS) to detect molecular species in exoplanet atmospheres, presents a new challenge for the accuracy of reference spectroscopic line lists. While parent isotopologues of key atmospheric tracers are often well-characterized, minor isotopologues, crucial for diagnosing planetary formation histories and evolution, suffer from a scarcity of experimental data, often leading to reliance on less accurate theoretical predictions. In this work, a comprehensive machine learning framework is designed to mitigate these inaccuracies by modelling the residual errors of the isotopologue extrapolation (IE) method used within the ExoMol project. A fully connected neural network architecture for carbon dioxide (CO$_2$) is shown to predict energy corrections with high fidelity, reducing the mean absolute error (MAE) relative to the original IE approach for more than 87\% of the levels when benchmarked against empirical (\Marvel) energies. Furthermore, development of a novel hybrid, molecule-aware transfer learning architecture is presented that successfully propagates correction patterns from the data-rich CO$_2$ system to the data-poor carbon monoxide (CO) system. This transfer learning approach yields MAE improvements in over 93\% of CO samples, demonstrating that physical correction factors related to isotopic substitution can be generalized across chemically related molecular systems. Updated and improved line lists are presented for 11 CO$_2$ isotopologues and energy levels for excited states of CO isotopologues are predicted. The methodology establishes a scalable, data-driven paradigm for refining molecular line lists, helping to bridge the gap between theoretical calculations and experimental precision.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces a machine learning framework to correct residual errors in the isotopologue extrapolation (IE) method for molecular energy levels. For CO2, a fully connected neural network is trained on Marvel empirical energies to predict corrections, reducing MAE relative to the original IE approach for more than 87% of levels. A novel hybrid molecule-aware transfer learning architecture then propagates these correction patterns to the data-poor CO system, yielding MAE improvements for over 93% of CO samples. Updated line lists for 11 CO2 isotopologues and predictions for excited states of CO isotopologues are provided, establishing a scalable data-driven approach for refining spectroscopic data relevant to exoplanet atmospheres.
Significance. If the central claims hold after addressing validation details, the work provides a practical, scalable method to improve accuracy of minor isotopologue line lists where experimental data are scarce. This directly supports HRCCS applications in exoplanet science by bridging theoretical IE predictions and empirical precision. The transfer learning component, if robustly validated, could generalize to other molecular systems and represents a strength in leveraging data-rich to data-poor domains. The benchmarking against Marvel energies and provision of updated lists are positive elements.
major comments (3)
- [§3.2] §3.2 (Neural network for CO2): The description of the dataset partitioning is insufficient; no details are given on the train/test split ratios, whether the Marvel energies were randomly partitioned or stratified by vibrational/rotational quantum numbers, or any cross-validation procedure used to obtain the >87% MAE reduction figure. This directly affects whether the reported improvement demonstrates generalization or risks overfitting to the training distribution.
- [§4.3] §4.3 (Hybrid transfer learning to CO): The hybrid molecule-aware architecture's implementation lacks explicit controls for domain shift between CO2 (triatomic) and CO (diatomic). It is unclear how the 93% MAE improvement on CO samples was computed (e.g., on fully held-out CO levels never seen during transfer, or including any CO data used in architecture design), and no comparison is provided against independent high-level ab initio calculations for CO levels outside the Marvel set. This is load-bearing for the generalization claim.
- [Table 2 and Table 4] Table 2 (CO2 MAE results) and Table 4 (CO results): The tables report aggregate percentages (87% and 93%) but do not break down performance by isotopologue, energy range, or quantum number regime. Without these, it is difficult to assess whether improvements are uniform or concentrated in well-sampled regions, undermining the claim of broad applicability.
minor comments (3)
- [Abstract] The abstract states 'presents a new challenge' but should read 'present' for grammatical agreement.
- [Figure 3] Figure 3 caption does not specify the exact loss function or optimizer hyperparameters used in the NN training, which would aid reproducibility.
- [§4.1] Notation for the molecule-aware embedding in the transfer architecture is introduced without a clear equation reference; adding an explicit definition would improve clarity.
Simulated Author's Rebuttal
We thank the referee for their thorough review and constructive feedback on our manuscript. Their comments have identified areas where additional clarity and detail will strengthen the presentation of our methods and results. We address each major comment below and will revise the manuscript accordingly.
read point-by-point responses
-
Referee: [§3.2] §3.2 (Neural network for CO2): The description of the dataset partitioning is insufficient; no details are given on the train/test split ratios, whether the Marvel energies were randomly partitioned or stratified by vibrational/rotational quantum numbers, or any cross-validation procedure used to obtain the >87% MAE reduction figure. This directly affects whether the reported improvement demonstrates generalization or risks overfitting to the training distribution.
Authors: We agree that the original description in §3.2 was insufficiently detailed. The Marvel energies were randomly partitioned using an 80/20 train/test split with no stratification by vibrational or rotational quantum numbers. To demonstrate generalization, we have now conducted 5-fold cross-validation, and the reported >87% MAE reduction is consistent across folds (average improvement 86.4%). In the revised manuscript we will expand §3.2 to explicitly state the split ratios, the random partitioning procedure, and the cross-validation results. revision: yes
-
Referee: [§4.3] §4.3 (Hybrid transfer learning to CO): The hybrid molecule-aware architecture's implementation lacks explicit controls for domain shift between CO2 (triatomic) and CO (diatomic). It is unclear how the 93% MAE improvement on CO samples was computed (e.g., on fully held-out CO levels never seen during transfer, or including any CO data used in architecture design), and no comparison is provided against independent high-level ab initio calculations for CO levels outside the Marvel set. This is load-bearing for the generalization claim.
Authors: We appreciate the emphasis on rigorous validation of the transfer step. The hybrid architecture employs molecule-specific embeddings and separate decoder heads to explicitly handle domain differences between triatomic CO2 and diatomic CO; these controls are described in §4.3 but will be expanded for clarity. The 93% MAE improvement was evaluated exclusively on a fully held-out CO test set that was never used during architecture design, pre-training, or fine-tuning. No CO data entered the transfer process. We did not include comparisons against independent high-level ab initio calculations for CO levels outside Marvel, as our benchmark focused on empirical Marvel energies where available; we will add an explicit discussion of this limitation in the revised text. revision: partial
-
Referee: [Table 2 and Table 4] Table 2 (CO2 MAE results) and Table 4 (CO results): The tables report aggregate percentages (87% and 93%) but do not break down performance by isotopologue, energy range, or quantum number regime. Without these, it is difficult to assess whether improvements are uniform or concentrated in well-sampled regions, undermining the claim of broad applicability.
Authors: We agree that aggregate percentages alone limit assessment of uniformity. In the revised manuscript we will augment Tables 2 and 4 (or add supplementary tables) with breakdowns by isotopologue, energy range (e.g., low- vs. high-lying vibrational levels), and quantum-number regime (e.g., ranges of J). These additions will allow readers to evaluate whether improvements are broadly distributed or localized to well-sampled regions. revision: yes
Circularity Check
No significant circularity in derivation chain.
full rationale
The paper trains fully connected and hybrid transfer-learning neural networks on external empirical Marvel energies to model residuals of an independent IE (isotopologue extrapolation) method from the ExoMol project. Reported MAE reductions (87% of CO2 levels, 93% of CO samples) are benchmarked against held-out or separate empirical data rather than being fitted inputs renamed as predictions. No self-definitional equations, load-bearing self-citations that force uniqueness, or ansatz smuggling appear in the provided abstract or claims; the central results rest on standard supervised learning with cross-system validation.
Axiom & Free-Parameter Ledger
free parameters (1)
- Neural network architecture hyperparameters
axioms (2)
- domain assumption Residual errors of the isotopologue extrapolation method are systematic and can be modeled by a fully connected neural network trained on empirical energies.
- ad hoc to paper Correction patterns learned on CO2 generalize to CO via a molecule-aware transfer learning architecture.
Reference graph
Works this paper leans on
-
[1]
, year = 1995, month = nov, volume =
M. Mayor, D. Queloz, A jupiter-mass companion to a solar-type star, Nature 378 (1995) 355–359. doi:10.1038/378355a0
-
[2]
W. J. Borucki, D. Koch, G. Basri, N. Batalha, T. Brown, D. Caldwell, J. Caldwell, J. Christensen-Dalsgaard, W. D. Cochran, E. DeV ore, E. W. Dunham, A. K. Dupree, T. N. Gautier, J. C. Geary, R. Gilliland, A. Gould, S. B. Howell, J. M. Jenkins, Y . Kondo, D. W. Latham, G. W. Marcy, S. Meibom, H. Kjeldsen, J. J. Lissauer, D. G. Monet, D. Mor- rison, D. Sass...
-
[3]
G. R. Ricker, J. N. Winn, R. Vanderspek, D. W. Latham, G. A. Bakos, J. L. Bean, Z. K. Berta-Thompson, T. M. Brown, L. Buchhave, N. R. Butler, R. P. Butler, W. J. Chaplin, D. Charbonneau, J. Christensen-Dalsgaard, M. Clampin, D. Deming, J. Doty, N. De Lee, C. Dressing, E. W. Dunham, M. Endl, F. Fressin, J. Ge, T. Henning, M. J. Holman, A. W. Howard, S. Ida...
work page internal anchor Pith review doi:10.1117/1.jatis.1.1.014003 2014
-
[4]
E. L. Rice, T. Barman, I. S. Mclean, L. Prato, J. D. Kirkpatrick, Physical properties of young brown dwarfs and very low mass stars inferred from high-resolution model spectra, Astrophys. J. Suppl. Ser. 186 (2010) 63. doi:10.1088/0067-0049/186/1/63
-
[5]
Madhusudhan, Exoplanetary Atmospheres: Key Insights, Challenges, and Prospects, Annu
N. Madhusudhan, Exoplanetary Atmospheres: Key Insights, Challenges, and Prospects, Annu. Rev. Astron. Astrophys 57 (2019) 617–663. doi:10.1146/annurev-astro-081817- 051846
-
[7]
S. N. Yurchenko, J. Tennyson, M. Brogi, High-resolution spectroscopy of exoplanets: data challenges and prospects, Nat. Rev. Phys. (2025). doi:10.1038/s42254-025-00839-z
-
[8]
J. K. Barstow, S. Aigrain, P. G. J. Irwin, S. Kendrew, L. N. Fletcher, Transit spectroscopy with James Webb Space Telescope: systematics, starspots and stitching, Mon. Not. Roy. Astron. Soc. 448 (2015) 2546–2561. doi:10.1093/mnras/stv186. 20
-
[9]
L. S. Wiser, T. J. Bell, M. R. Line, E. Schlawin, T. G. Beatty, L. Welbanks, T. P. Greene, V . Parmentier, M. M. Murphy, J. J. Fortney, K. Arnold, N. Mehta, K. Ohno, S. Mukher- jee, A precise metallicity and carbon-to-oxygen ratio for a warm giant exoplanet from its panchromatic JWST emission spectrum, Proc. Nat. Acad. Sci. U. S. A. 122 (2024). doi:10.107...
-
[10]
I. Snellen, R. de Kok, J. L. Birkby, B. Brandl, M. Brogi, C. Keller, M. Kenworthy, H. Schwarz, R. Stuik, Combining high-dispersion spectroscopy with high contrast imag- ing: Probing rocky planets around our nearest neighbors, Astron. Astrophys. 576 (2015) A59. doi:10.1051/0004-6361/201425018
-
[11]
I. A. Snellen, Exoplanet atmospheres at high spectral resolution, Annu. Rev. Astron. Astro- phys 63 (2025) 83–125. doi:10.1146/annurev-astro-052622-031342
-
[12]
Y . Zhang, I. A. G. Snellen, A. J. Bohn, P. Molliere, C. Ginski, H. J. Hoeijmakers, M. A. Kenworthy, E. E. Mamajek, T. Meshkat, M. Reggiani, F. Snik, The13CO-rich atmosphere of a young accreting super-Jupiter, Nature 595 (7867) (2021) 370–372. doi:10.1038/s41586- 021-03616-x
-
[13]
E. Esparza-Borges, M. López-Morales, J. I. Adams Redai, E. Pallé, J. Kirk, N. Casasayas- Barris, N. E. Batalha, B. V . Rackham, J. L. Bean, S. L. Casewell, L. Decin, L. A. Dos Santos, A. G. Muñoz, J. Harrington, K. Heng, R. Hu, L. Mancini, K. Molaverdikhani, G. Morello, N. K. Nikolov, M. C. Nixon, S. Redfield, K. B. Stevenson, H. R. Wakeford, M. K. Alam, ...
-
[14]
Brogi, M
M. Brogi, M. R. Line, Retrieving Temperatures and Abundances of Exoplanet Atmo- spheres with High-resolution Cross-correlation Spectroscopy, Astrophys. J. 157 (2019)
2019
-
[15]
doi:10.3847/1538-3881/aaffd3
-
[16]
J. Tennyson, S. N. Yurchenko, ExoMol: molecular line lists for exoplanet and other atmospheres, Mon. Not. Roy. Astron. Soc. 425 (2012) 21–33. doi:10.1111/j.1365- 2966.2012.21440.x
-
[17]
T. Furtenbacher, A. G. Császár, J. Tennyson, MARVEL: measured active rotational-vibrational energy levels, J. Mol. Spectrosc. 245 (2007) 115–125. doi:10.1016/j.jms.2007.07.005
-
[18]
J. Tennyson, S. N. Yurchenko, J. Zhang, C. A. Bowesman, R. P. Brady, J. Buldyreva, K. L. Chubb, R. R. Gamache, M. N. Gorman, E. R. Guest, C. Hill, K. Kefala, A. E. Lynas-Gray, T. M. Mellor, L. K. McKemmish, G. B. Mitev, I. I. Mizus, A. Owens, Z. Peng, A. N. Perri, M. Pezzella, O. L. Polyansky, Q. Qu, M. Semenov, O. Smola, A. ov, W. Somogyi, A. Upadhyay, S...
-
[19]
O. L. Polyansky, A. A. Kyuberis, L. Lodi, J. Tennyson, R. I. Ovsyannikov, N. Zobov, ExoMol molecular line lists XIX: high accuracy computed line lists for H217O and H218O, Mon. Not. Roy. Astron. Soc. 466 (2017) 1363–1371. doi:10.1093/mnras/stw3125. 21
-
[20]
L. K. McKemmish, C. A. Bowesman, K. Kefala, A. N. Perri, A. M. Syme, S. N. Yurchenko, J. Tennyson, A hybrid approach to generating diatomic line lists for high resolution studies of exoplanets and other hot astronomical objects: Updates to ExoMol MgO, VO and TiO line lists, RAS Tech. Instr. 3 (2024) 565–583. doi:10.1093/rasti/rzae037
-
[21]
Y . V . Pavlenko, S. N. Yurchenko, L. K. McKemmish, J. Tennyson, Analysis of the TiO iso- topologues in stellar optical spectra, Astron. Astrophys. 42 (2020) A77. doi:10.1051/0004- 6361/202037863
-
[22]
D. W. Schwenke, Beyond the potential energy surface: Ab initio corrections to the born-oppenheimer approximation for h2o, J. Phys. Chem. A 105 (2001) 2352–2360. doi:10.1021/jp0032513
-
[23]
K. Hansen, F. Biegler, R. Ramakrishnan, W. Pronobis, O. von Lilienfeld, K.-R. Müller, A. Tkatchenko, Machine learning predictions of molecular properties: Accurate many-body potentials and nonlocality in chemical space, JOURNAL OF PHYSICAL CHEMISTRY LETTERS 6 (2015) 2326–2331. doi:10.1021/acs.jpclett.5b00831
-
[24]
J. Westermayr, P. Marquetand, Machine learning spectroscopy to advance computation and analysis, Chemical Science 46 (16) (2025) 21660–21676. doi:10.1039/d5sc05628d
-
[25]
E. R. Guest, J. Tennyson, S. N. Yurchenko, Modelling the Rotational Dependence of Line Broadening using Machine Learning, J. Mol. Spectrosc. 401 (2024) 111901. doi:10.1016/j.jms.2024.111901
-
[26]
S. N. Yurchenko, M. G. Barnfield, C. A. Bowesman, R. P. Brady, E. R. Guest, K. Kefala, Q.-H. Ni, A. N. Perri, O. A. Smola, A. Solokov, C. Tao, J. Tennyson, ExoMol line lists – LXIII: ExoMol line lists for 12 isotopologues of CO 2, Mon. Not. Roy. Astron. Soc. 545 (2026) staf2135. doi:10.1093/mnras/staf2135
-
[27]
M. T. I. Ibrahim, D. Alatoom, T. Furtenbacher, A. G. Császár, S. N. Yurchenko, A. A. A. Azzam, J. Tennyson, MARVEL analysis of high-resolution rovibrational spec- tra of 13C16O2, J. Comput. Chem. 45 (2024) 969–984. doi:10.1002/jcc.27266
-
[28]
D. Alatoom, M. T. I. Ibrahim, T. Furtenbacher, A. G. Császár, M. Alghizzawi, S. N. Yurchenko, A. A. A. Azzam, J. Tennyson, MARVEL analysis of high-resolution rovibra- tional spectra of 16O12C18O, J. Comput. Chem. 45 (2024) 2558. doi:10.1002/jcc.27453
-
[29]
A. A. A. Azzam, S. A. A. Azzam, K. A. A. Aburumman, J. Tennyson, S. N. Yurchenko, A. G. Császár, T. Furtenbacher, MARVEL analysis of high-resolution rovibrational spectra of 18O12C18O, 17O12C18O and 18O13C18O isotopologues of carbon dioxide, J. Mol. Spec- trosc. 405 (2024) 111947. doi:10.1016/j.jms.2024.111947
-
[30]
A. A. A. Azzam, B. M. J. Abou Doud, M. Q. A. Shersheer, B. K. M. Almasri, C. N. M. Bader, A. M. H. A. Baraa O. A. KH. Musleh and, A. W. M. Al Shatarat, B. I. M. Qat- tan, L. H. M. Hamamsy, A. O. G. Saafneh, M. N. A. ALso’ub, M. M. A. Alkhashash- neh, H. O. M. Al-Zawahra, D. Alatoom, M. T. I. Ibrahim, J. Tennyson, S. N. Yurchenko, T. Furtenbacher, A. G. Cs...
-
[31]
A. A. A. Azzam, J. Tennyson, S. N. Yurchenko, T. Furtenbacher, A. G. Császár, MARVEL analysis of high-resolution rovibrational spectra of16O13C18O, J. Comput. Chem. 46 (2025) e27541. doi:10.1002/jcc.27541
-
[32]
S. A. M. Obaidata, A. A. A. Azzam, J. Tennyson, S. N. Yurchenko, T. Furtenbacher, A. G. Császár, MARVEL analysis of high-resolution rovibrational spectra of 16O12C17O, J. Mol. Spectrosc. 340 (2025) 109444. doi:10.1016/j.jqsrt.2025.109444
-
[33]
M. H. I. Mansour, A. A. A. Azzam, J. Tennyson, S. N. Yurchenko, T. Furtenbacher, A. G. Császár, MARVEL analysis of high-resolution rovibrational spectra of 16O13C17O and 17O12C17O, Mol. Phys. (2025) e2550568doi:10.1080/00268976.2025.2550568
-
[34]
A. A. A. Azzam, J. M. A. AlAlawin, J. Tennyson, S. N. Yurchenko, T. Furten- bacher, A. G. Császár, MARVEL analysis of high-resolution rovibrational spectra of 17O13C18O and 17O13C17O, J. Quant. Spectrosc. Radiat. Transf. 343 (2025) 109485. doi:10.1016/j.jqsrt.2025.109485
-
[35]
S. Mahmoud, N. El-Kork, N. Abu Elkher, M. Almehairbi, M. S. Khalil, T. Furtenbacher, O. P. Yurchenko, S. N. Yurchenko, J. Tennyson, MARVEL Analysis of the Measured High- resolution Spectra of 12C16O, Astrophys. J. Suppl. Ser. 276 (2025) 66. doi:10.3847/1538- 4365/ada3c9
-
[36]
T. Grigorev, Y . Dai, M. Potter, X. Xiang, K. Zhang, J. Tennyson, MARVEL Analysis of the Measured High-resolution Spectra of CO Isotopologues, Astrophys. J. Suppl. Ser. 283 (2026) 39. doi:10.3847/1538-4365/ae40f0
-
[37]
Paszke, S
A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Köpf, E. Yang, Z. DeVito, M. Raison, A. Te- jani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, S. Chintala, Pytorch: An imperative style, high-performance deep learning library, in: Advances in Neural Information Processing Syst...
2019
-
[38]
Gaussian Error Linear Units (GELUs)
D. Hendrycks, K. Gimpel, Gaussian error linear units (gelus), arXiv (2023). doi:10.48550/arXiv.1606.08415
-
[39]
A. F. Agarap, Deep learning using rectified linear units (relu), arXiv (2019). doi:10.48550/arXiv.1803.08375
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1803.08375 2019
-
[40]
D. P. Kingma, J. Ba, Adam: A method for stochastic optimization, arXiv (2017). doi:10.48550/arXiv.1412.6980
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1412.6980 2017
-
[41]
The Annals of Mathematical Statistics , author =
P. Huber, Robust estimation of a location parameter, The Annals of Mathematical Statistics 35 (1964) 73–101. doi:10.1214/aoms/1177703732
-
[42]
J. L. Ba, J. R. Kiros, G. E. Hinton, Layer normalization, arXiv (2016). doi:10.48550/arXiv.1607.06450
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1607.06450 2016
-
[43]
L. S. Rothman, L. D. G. Young, Infrared energy levels and intensities of carbon dioxide-II, J. Quant. Spectrosc. Radiat. Transf. 25 (1981) 505–524. doi:10.1016/0022-4073(81)90026- . URLhttps://doi.org/10.1016/0022-4073(81)90026-1 23
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.