A Principal Component Analysis-based method to analyse high-resolution spectroscopic data
Pith reviewed 2026-05-25 15:07 UTC · model grok-4.3
The pith
A new pipeline uses Principal Component Analysis on CRIRES time-series spectra to isolate and detect CO and H2O signals in hot Jupiter atmospheres.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims that a novel application of Principal Component Analysis to CRIRES time-series spectra, followed by cross-correlation with atmospheric models, successfully extracts the molecular signatures of CO and H2O from the atmospheres of HD209458b and HD189733b without manual intervention, yielding detections in agreement with previous studies.
What carries the argument
Principal Component Analysis applied to time-series spectra to isolate the planetary atmospheric signal before cross-correlation with model templates.
If this is right
- The pipeline automates detection of molecular species in CRIRES datasets of hot Jupiters.
- Results for CO and H2O match those obtained by other techniques on the same targets.
- The method can be run on additional CRIRES observations to search for the same or other molecules.
- Consistency with existing literature supports the pipeline for routine atmospheric characterization.
Where Pith is reading between the lines
- The approach may reduce observer-dependent choices in component selection across multiple datasets.
- Similar PCA isolation could be tested on spectra from other high-resolution instruments.
- Automation opens the possibility of applying the same steps to larger samples of targets observed under comparable conditions.
Load-bearing premise
That Principal Component Analysis removes only noise and systematics while leaving the planetary atmospheric signal undistorted for the cross-correlation step.
What would settle it
Absence of a significant cross-correlation peak at the expected planetary radial velocity when testing the pipeline on the well-studied CO and H2O signals in HD209458b or HD189733b.
Figures
read the original abstract
High-Resolution Spectroscopy (HRS) has been used to study the composition and dynamics of exoplanetary atmospheres. In particular, the spectrometer CRIRES installed on the ESO-VLT has been used to record high-resolution spectra in the Near-IR of gaseous exoplanets. Here we present a new automatic pipeline to analyze CRIRES data-sets. Said pipeline is based on a novel use of Principal Component Analysis (PCA) and Cross-Correlation Function (CCF). The exoplanetary atmosphere is modeled with the $\tau$-REx code using opacities at high temperature from the ExoMol project. In this work, we tested our analysis tools on the detection of CO and H$_2$O in the atmospheres of the hot-Jupiters HD209458b and HD189733b. The results of our pipeline are in agreement with previous results in the literature and other techniques.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents an automatic pipeline for processing CRIRES time-series spectra of exoplanet atmospheres. It applies Principal Component Analysis (PCA) to remove telluric and stellar variance, followed by cross-correlation of the residuals against τ-REx atmospheric models that incorporate high-temperature ExoMol opacities. The pipeline is tested on the detection of CO and H2O in the hot Jupiters HD 209458b and HD 189733b, with the abstract stating that the results agree with previous literature detections.
Significance. An automated, reproducible PCA+CCF pipeline could reduce analysis time for high-resolution spectroscopy datasets if the planetary signal is demonstrably preserved. The current manuscript provides no quantitative metrics of agreement or validation of signal fidelity, so the work does not yet establish a clear methodological advance over existing techniques.
major comments (2)
- [Abstract] Abstract: the central claim that 'the results of our pipeline are in agreement with previous results in the literature' is stated without any reported detection significances, CCF peak amplitudes, or direct numerical comparisons to prior studies, rendering the agreement assertion unverifiable from the given information.
- [Methods] Methods (implied PCA step): the pipeline relies on the assumption that the time-varying planetary Doppler signal remains orthogonal to the removed principal components and is not partially projected out; no injection-recovery tests, component-selection diagnostics, or covariance analysis are described that would confirm this preservation, which is load-bearing for any detection claim.
minor comments (1)
- [Abstract] Abstract: the phrase 'novel use of Principal Component Analysis' is used without specifying the technical novelty relative to prior PCA applications in HRS (e.g., component ordering or masking strategy).
Simulated Author's Rebuttal
We thank the referee for their constructive comments. We address each major comment below and indicate the revisions planned.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central claim that 'the results of our pipeline are in agreement with previous results in the literature' is stated without any reported detection significances, CCF peak amplitudes, or direct numerical comparisons to prior studies, rendering the agreement assertion unverifiable from the given information.
Authors: We agree that the abstract's agreement claim would be strengthened by quantitative metrics. In the revised manuscript we will add the detection significances and CCF peak amplitudes from our pipeline together with direct numerical comparisons to the corresponding literature values for both planets. revision: yes
-
Referee: [Methods] Methods (implied PCA step): the pipeline relies on the assumption that the time-varying planetary Doppler signal remains orthogonal to the removed principal components and is not partially projected out; no injection-recovery tests, component-selection diagnostics, or covariance analysis are described that would confirm this preservation, which is load-bearing for any detection claim.
Authors: The assumption that the Doppler-shifted planetary signal is orthogonal to the dominant static components is standard in the PCA literature for high-resolution spectroscopy. Nevertheless, we acknowledge that explicit validation strengthens the work. We will add injection-recovery tests and component-selection diagnostics to the revised methods section. revision: yes
Circularity Check
No significant circularity; pipeline validated against external literature
full rationale
The paper introduces a PCA+CCF pipeline for CRIRES spectra and reports detections of CO/H2O on HD209458b and HD189733b that match prior independent results. No equations, fitted parameters, or self-citations are shown that reduce the reported signals to quantities defined from the same data by construction. The central claim rests on external agreement rather than internal re-derivation.
Axiom & Free-Parameter Ledger
free parameters (1)
- Number of PCA components retained
axioms (1)
- domain assumption The tau-REx atmospheric models with ExoMol opacities provide a sufficiently accurate template for cross-correlation with the observed spectra.
Reference graph
Works this paper leans on
-
[1]
Agol, E., Cowan, N. B., Knutson, H. A., et al. 2010, ApJ, 721, 1861, doi: 10.1088/0004-637X/721/2/1861 Artigau, ´E., Astudillo-Defru, N., Delfosse, X., et al. 2014, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 9149, Observatory Operations: Strategies, Processes, and Systems V, 914905
-
[2]
Birkby, J. L. 2018, arXiv e-prints, arXiv:1806.04617. https://arxiv.org/abs/1806.04617
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[3]
Birkby, J. L., de Kok, R. J., Brogi, M., et al. 2013, MNRAS, 436, L35, doi: 10.1093/mnrasl/slt107
-
[4]
Snellen, I. A. G. 2017, AJ, 153, 138, doi: 10.3847/1538-3881/aa5c87
-
[5]
2005, A&A, 444, L15, doi: 10.1051/0004-6361:200500201
Bouchy, F., Udry, S., Mayor, M., et al. 2005, A&A, 444, L15, doi: 10.1051/0004-6361:200500201
-
[6]
1965, The Fourier Transform and Its Applications
Bracewell, R. 1965, The Fourier Transform and Its Applications. (New York: McGraw-Hill), 46 and 243
work page 1965
-
[7]
Brogi, M., de Kok, R. J., Albrecht, S., et al. 2016, ApJ, 817, 106, doi: 10.3847/0004-637X/817/2/106
-
[8]
Snellen, I. A. G. 2014, A&A, 565, A124, doi: 10.1051/0004-6361/201423537
-
[9]
Brogi, M., Snellen, I. A. G., de Kok, R. J., et al. 2013, ApJ, 767, 27, doi: 10.1088/0004-637X/767/1/27
-
[10]
Charbonneau, D., Brown, T. M., Noyes, R. W., & Gilliland, R. L. 2002, ApJ, 568, 377, doi: 10.1086/338770
-
[11]
2017, AJ, 154, 39, doi: 10.3847/1538-3881/aa738b de Kok, R
Tinetti, G. 2017, AJ, 154, 39, doi: 10.3847/1538-3881/aa738b de Kok, R. J., Brogi, M., Snellen, I. A. G., et al. 2013, A&A, 554, A82, doi: 10.1051/0004-6361/201321381
-
[12]
Follert, R., Dorn, R. J., Oliva, E., et al. 2014, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 9147, Ground-based and Airborne Instrumentation for Astronomy V, 914719
work page 2014
-
[13]
2014, Nature, 513, 526, doi: 10.1038/nature13785
Fraine, J., Deming, D., Benneke, B., et al. 2014, Nature, 513, 526, doi: 10.1038/nature13785
-
[14]
J., Burrows, A., Charbonneau, D., et al
Grillmair, C. J., Burrows, A., Charbonneau, D., et al. 2008, Nature, 456, 767, doi: 10.1038/nature07574
-
[15]
Horne, K. 1986, Publications of the Astronomical Society of the Pacific, 98, 609, doi: 10.1086/131801 Jolliffe, I. T. 2002, Principal component analysis
-
[16]
Kipping, D. M. 2010, MNRAS, 407, 301, doi: 10.1111/j.1365-2966.2010.16894.x
-
[17]
Knutson, H. A., Charbonneau, D., Noyes, R. W., Brown, T. M., & Gilliland, R. L. 2007, ApJ, 655, 564, doi: 10.1086/510111
-
[18]
Kobayashi, N., Tokunaga, A. T., Terada, H., et al. 2000, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 4008, Optical and IR Telescope Instrumentation and Detectors, ed. M. Iye & A. F. Moorwood, 1056–1066
work page 2000
-
[19]
L., Yang, H., France, K., et al
Linsky, J. L., Yang, H., France, K., et al. 2010, ApJ, 717, 1291, doi: 10.1088/0004-637X/717/2/1291 14 Damiano et al
-
[20]
2000, ApJ, 532, L55, doi: 10.1086/312558
Mazeh, T., Naef, D., Torres, G., et al. 2000, ApJ, 532, L55, doi: 10.1086/312558
-
[21]
K., Kawahara, H., Masuda, K., et al
Nugroho, S. K., Kawahara, H., Masuda, K., et al. 2017, AJ, 154, 221, doi: 10.3847/1538-3881/aa9433
-
[22]
2012, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol
Oliva, E., Origlia, L., Maiolino, R., et al. 2012, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 8446, Ground-based and Airborne Instrumentation for Astronomy IV, 84463T
work page 2012
-
[23]
Piskorz, D., Benneke, B., Crockett, N. R., et al. 2016, ApJ, 832, 131, doi: 10.3847/0004-637X/832/2/131 —. 2017, AJ, 154, 78, doi: 10.3847/1538-3881/aa7dd8
-
[24]
Quirrenbach, A., Amado, P. J., Caballero, J. A., et al. 2014, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 9147, Ground-based and Airborne Instrumentation for Astronomy V, 91471F Redfield, S., Endl, M., Cochran, W. D., & Koesterke, L. 2008, ApJ, 673, L87, doi: 10.1086/527475
-
[25]
Ridden-Harper, A. R., Snellen, I. A. G., Keller, C. U., et al. 2016, A&A, 593, A129, doi: 10.1051/0004-6361/201628448
-
[26]
2003, ApJ, 585, 1038, doi: 10.1086/346105
Seager, S., & Mall´ en-Ornelas, G. 2003, ApJ, 585, 1038, doi: 10.1086/346105
-
[27]
Sing, D. K., Fortney, J. J., Nikolov, N., et al. 2016, Nature, 529, 59, doi: 10.1038/nature16068
-
[28]
2010, Nature, 465, 1049, doi: 10.1038/nature09111
Albrecht, S. 2010, Nature, 465, 1049, doi: 10.1038/nature09111
-
[29]
Tamuz, O., Mazeh, T., & Zucker, S. 2005, MNRAS, 356, 1466, doi: 10.1111/j.1365-2966.2004.08585.x
-
[30]
Tennyson, J., & Yurchenko, S. N. 2012, MNRAS, 425, 21, doi: 10.1111/j.1365-2966.2012.21440.x
-
[31]
Tennyson, J., Yurchenko, S. N., Al-Refaie, A. F., et al. 2016, Journal of Molecular Spectroscopy, 327, 73, doi: 10.1016/j.jms.2016.05.002
-
[32]
2007, Nature, 448, 169, doi: 10.1038/nature06002
Tinetti, G., Vidal-Madjar, A., Liang, M.-C., et al. 2007, Nature, 448, 169, doi: 10.1038/nature06002
-
[33]
Torres, G., Winn, J. N., & Holman, M. J. 2008, ApJ, 677, 1324, doi: 10.1086/529429
-
[34]
Triaud, A. H. M. J., Queloz, D., Bouchy, F., et al. 2009, A&A, 506, 377, doi: 10.1051/0004-6361/200911897
-
[35]
Tsiaras, A., Waldmann, I. P., Rocchetto, M., et al. 2016a, ApJ, 832, 202, doi: 10.3847/0004-637X/832/2/202
-
[36]
Tsiaras, A., Rocchetto, M., Waldmann, I. P., et al. 2016b, ApJ, 820, 99, doi: 10.3847/0004-637X/820/2/99
-
[37]
Tsiaras, A., Waldmann, I. P., Zingales, T., et al. 2018, AJ, 155, 156, doi: 10.3847/1538-3881/aaaf75
-
[38]
2012, A&A, 546, A43, doi: 10.1051/0004-6361/201219310
Venot, O., H´ ebrard, E., Ag´ undez, M., et al. 2012, A&A, 546, A43, doi: 10.1051/0004-6361/201219310
-
[39]
P., Rocchetto, M., Tinetti, G., et al
Waldmann, I. P., Rocchetto, M., Tinetti, G., et al. 2015a, ApJ, 813, 13, doi: 10.1088/0004-637X/813/1/13
-
[40]
P., Tinetti, G., Rocchetto, M., et al
Waldmann, I. P., Tinetti, G., Rocchetto, M., et al. 2015b, ApJ, 802, 107, doi: 10.1088/0004-637X/802/2/107
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.