Scalable Dark Siren Cosmology with gwcosmo: GPU Acceleration, Validation and Systematics
Pith reviewed 2026-05-25 03:17 UTC · model grok-4.3
The pith
A GPU-accelerated gwcosmo processes full gravitational-wave catalogues 1000 times faster than the prior CPU version.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The upgraded gwcosmo leverages vectorisation on graphics processing units to process the entire gravitational-wave catalogue in parallel with each iteration. This achieves a speed-up of 1000 times over the previous version, facilitating analyses of O5-like numbers of GW events on wall-clock timescales of hours. The results demonstrate the scalability of the gwcosmo pipeline for the increasing computational load of expanding event catalogues.
What carries the argument
The GPU vectorization and parallel implementation of the gwcosmo cosmological inference pipeline, which integrates over each gravitational-wave source's localisation volume.
If this is right
- Analyses of O5-scale gravitational-wave catalogues become feasible within hours rather than days or weeks.
- Complete event lists, including quieter sources, can be retained in dark-siren inference without computational truncation.
- The pipeline scales to the growing size of future gravitational-wave catalogues.
- Population-level hierarchical analyses can incorporate the full catalogue information at each iteration.
Where Pith is reading between the lines
- Faster runtimes could support updating cosmological constraints each time a new confident event is added to the catalogue.
- The same parallelisation strategy may transfer to other hierarchical inference problems that integrate over localisation volumes.
- Validation against the CPU baseline lowers the chance that hardware-specific rounding alters final cosmological constraints.
Load-bearing premise
The GPU implementation reproduces the cosmological inference results of the prior CPU version without numerical discrepancies or biases.
What would settle it
A side-by-side run of the CPU and GPU versions on identical event data that produces statistically different cosmological parameter posteriors would show the acceleration changes the inference.
Figures
read the original abstract
As the number of confident gravitational-wave detections grows, population-level hierarchical analyses face increasing computational costs. Dark-siren cosmological inference integrates over the localisation volume of each gravitational-wave source. To remain feasible without discarding information from the quieter but more numerous sources in the catalogue, significant efficiency improvements are vital for analysis pipelines. In this work, we present an upgraded version of the cosmological inference pipeline gwcosmo, which leverages vectorisation on graphics processing units to process the entire gravitational-wave catalogue in parallel with each iteration. This new implementation achieves a speed-up of 1000 times over the previous version, facilitating analyses of O5-like numbers of GW events on wall-clock timescales of hours. Our results demonstrate the scalability of the gwcosmo pipeline, specifically its ability to handle the increasing computational load of expanding event catalogues, positioning it as a vital tool for future advances in dark-siren cosmology.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents an upgraded version of the gwcosmo pipeline for dark-siren cosmological inference. It uses GPU vectorization and parallel processing to analyze entire gravitational-wave catalogues simultaneously, claiming a 1000x wall-clock speedup over the prior CPU implementation. This enables feasible analyses of O5-scale event numbers (hundreds of events) in hours rather than longer timescales, with results demonstrating scalability and including validation against previous versions plus systematics studies.
Significance. If the GPU implementation is shown to reproduce the CPU results without introducing numerical biases or discrepancies in the recovered cosmological parameters (particularly H0), the work would be significant for the field. It directly addresses the computational bottleneck in hierarchical dark-siren analyses as GW catalogues grow, providing a practical tool for future LIGO-Virgo-KAGRA observing runs and positioning gwcosmo as scalable for population-level cosmology.
major comments (2)
- [Validation section] Validation section: The central claim of a 1000x speedup enabling O5 analyses rests on the assumption that the GPU-parallel implementation produces statistically identical posteriors to the validated CPU version. No quantitative metrics (e.g., KL divergence between H0 posteriors, maximum parameter shifts, or bias in recovered cosmology) are reported for the full catalogue or for low-SNR edge cases, which could be affected by floating-point reductions or altered marginalization over localization volumes.
- [§3 (GPU Implementation)] §3 (GPU Implementation): The description of vectorization and parallel catalogue processing does not specify how the marginalization over each event's localization volume is preserved exactly under GPU reductions; any deviation here would undermine the claim that results are equivalent to the CPU version without additional validation tests.
minor comments (2)
- [Abstract] The abstract states 'results demonstrate the scalability' but does not preview the specific validation metrics or systematics tests performed; adding one sentence on these would improve clarity.
- Figure captions for speedup benchmarks should explicitly state the hardware configuration (GPU model, CPU baseline) and event catalogue size used for the 1000x factor.
Simulated Author's Rebuttal
We thank the referee for their constructive feedback and for recognizing the potential impact of the GPU-accelerated gwcosmo pipeline. We address each major comment below and have revised the manuscript to strengthen the validation and implementation sections.
read point-by-point responses
-
Referee: [Validation section] Validation section: The central claim of a 1000x speedup enabling O5 analyses rests on the assumption that the GPU-parallel implementation produces statistically identical posteriors to the validated CPU version. No quantitative metrics (e.g., KL divergence between H0 posteriors, maximum parameter shifts, or bias in recovered cosmology) are reported for the full catalogue or for low-SNR edge cases, which could be affected by floating-point reductions or altered marginalization over localization volumes.
Authors: We agree that quantitative metrics strengthen the validation. The revised manuscript now includes KL divergence between the GPU and CPU H0 posteriors for the full catalogue and for low-SNR subsets, along with maximum parameter shifts and checks for bias in recovered cosmology. These confirm statistical equivalence within numerical precision. revision: yes
-
Referee: [§3 (GPU Implementation)] §3 (GPU Implementation): The description of vectorization and parallel catalogue processing does not specify how the marginalization over each event's localization volume is preserved exactly under GPU reductions; any deviation here would undermine the claim that results are equivalent to the CPU version without additional validation tests.
Authors: We have expanded §3 to detail the marginalization procedure. The GPU implementation sums over identical localization samples as the CPU version using double-precision reductions, preserving the exact marginalization without approximation or deviation. revision: yes
Circularity Check
No circularity in computational speedup claim
full rationale
The paper describes a GPU-vectorized reimplementation of the existing gwcosmo pipeline. The 1000x wall-clock speedup is obtained by parallel processing of the catalogue and is independent of any cosmological model, parameter fitting, or derivation. Validation against the prior CPU version is presented as an empirical reproduction check rather than a self-referential step. No equations, ansatzes, uniqueness theorems, or self-citations are invoked in a load-bearing way that reduces the central claim to its own inputs by construction. The result is a standard engineering improvement whose correctness can be verified externally via direct comparison of posteriors.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
B. F. Schutz, Determining the Hubble Constant from Gravitational Wave Observations, Nature323, 310 (1986). 13
work page 1986
-
[2]
D. E. Holz and S. A. Hughes, Using gravitational- wave standard sirens, Astrophys. J.629, 15 (2005), arXiv:astro-ph/0504616
work page internal anchor Pith review Pith/arXiv arXiv 2005
-
[3]
Morescoet al., Unveiling the Universe with emerg- ing cosmological probes, Living Rev
M. Morescoet al., Unveiling the Universe with emerg- ing cosmological probes, Living Rev. Rel.25, 6 (2022), arXiv:2201.07241 [astro-ph.CO]
-
[5]
Exploring short gamma-ray bursts as gravitational-wave standard sirens
S. Nissanke, D. E. Holz, S. A. Hughes, N. Dalal, and J. L. Sievers, Exploring short gamma-ray bursts as gravitational-wave standard sirens, Astrophys. J.725, 496 (2010), arXiv:0904.1017 [astro-ph.CO]
work page internal anchor Pith review Pith/arXiv arXiv 2010
-
[6]
Abbott, B. P.et al., A gravitational-wave standard siren measurement of the hubble constant, Nature551, 85–88 (2017)
work page 2017
-
[7]
S. R. Taylor, J. R. Gair, and I. Mandel, Cosmology us- ing advanced gravitational-wave detectors alone, Physi- cal Review D85, 10.1103/physrevd.85.023535 (2012)
-
[8]
W. M. Farr, M. Fishbach, J. Ye, and D. E. Holz, A future percent-level measurement of the hubble expansion at redshift 0.8 with advanced ligo, The Astrophysical Jour- nal Letters883, L42 (2019)
work page 2019
- [9]
-
[10]
J. M. Ezquiaga and D. E. Holz, Spectral sirens: Cos- mology from the full mass distribution of compact binaries, Physical Review Letters129, 10.1103/phys- revlett.129.061102 (2022)
-
[11]
G. Pierra and A. Papadopoulos, Heavy black-holes also matter in standard siren cosmology (2026), arXiv:2601.03257 [astro-ph.CO]
-
[12]
D. F. Chernoff and L. S. Finn, Gravitational radiation, inspiraling binaries, and cosmology, Astrophys. J. Lett. 411, L5 (1993), arXiv:gr-qc/9304020
work page internal anchor Pith review Pith/arXiv arXiv 1993
-
[13]
W. Del Pozzo, Inference of cosmological parameters from gravitational waves: Applications to second generation interferometers, Phys. Rev. D86, 043011 (2012)
work page 2012
-
[14]
Soares-Santos, M.et al., First measurement of the hub- ble constant from a dark standard siren using the dark energy survey galaxies and the ligo/virgo binary–black- hole merger gw170814, The Astrophysical Journal Let- ters876, L7 (2019)
work page 2019
-
[15]
A. Palmeseet al., A statistical standard siren measure- ment of the hubble constant from the ligo/virgo gravita- tional wave compact object merger gw190814 and dark energy survey galaxies, The Astrophysical Journal Let- ters900, L33 (2020)
work page 2020
- [16]
-
[17]
S. Mastrogiovanni, K. Leyde, C. Karathanasis, E. Chassande-Mottin, D. A. Steer, J. Gair, A. Ghosh, R. Gray, S. Mukherjee, and S. Rinaldi, On the impor- tance of source population models for gravitational-wave cosmology, Phys. Rev. D104, 062009 (2021)
work page 2021
-
[18]
R. Gray, C. Messenger, and J. Veitch, A pixelated ap- proach to galaxy catalogue incompleteness: improving the dark siren measurement of the Hubble constant, Monthly Notices of the Royal Astronomical Society512, 1127 (2022), https://academic.oup.com/mnras/article- pdf/512/1/1127/45303118/stac366.pdf
work page 2022
- [20]
-
[21]
J. Aasiet al.(LIGO Scientific), Advanced LIGO, Class. Quant. Grav.32, 074001 (2015), arXiv:1411.4547 [gr-qc]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[22]
Advanced Virgo: a 2nd generation interferometric gravitational wave detector
F. Acerneseet al.(VIRGO), Advanced Virgo: a second- generation interferometric gravitational wave detector, Class. Quant. Grav.32, 024001 (2015), arXiv:1408.3978 [gr-qc]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[23]
T. Akutsuet al., Overview of kagra: Detector de- sign and construction history, Progress of Theo- retical and Experimental Physics2021, 05A101 (2020), https://academic.oup.com/ptep/article- pdf/2021/5/05A101/37974994/ptaa125.pdf
work page 2020
-
[24]
B. Abbottet al., Gwtc-1: A gravitational-wave transient catalog of compact binary mergers observed by ligo and virgo during the first and second observing runs, Physical Review X9, 10.1103/physrevx.9.031040 (2019)
-
[25]
R. Abbottet al., Gwtc-2: Compact binary coales- cences observed by ligo and virgo during the first half of the third observing run, Physical Review X11, 10.1103/physrevx.11.021053 (2021)
-
[26]
R. Abbottet al., Gwtc-3: Compact binary coales- cences observed by ligo and virgo during the second part of the third observing run, Physical Review X13, 10.1103/physrevx.13.041039 (2023)
-
[27]
Abac, A. G.et al., Gwtc-4.0: Updating the gravitational- wave transient catalog with observations from the first part of the fourth ligo-virgo-kagra observing run (2025), arXiv:2508.18082 [gr-qc]
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[28]
R. Gray, I. M. n. Hernandez, H. Qi, A. Sur, P. R. Brady, H.-Y. Chen, W. M. Farr, M. Fishbach, J. R. Gair, A. Ghosh, D. E. Holz, S. Mastrogiovanni, C. Messenger, D. A. Steer, and J. Veitch, Cosmological inference using gravitational wave standard sirens: A mock data analy- sis, Phys. Rev. D101, 122001 (2020)
work page 2020
-
[29]
R. Gray, F. Beirnaert, C. Karathanasis, B. Revenu, C. Turski, A. Chen, T. Baker, S. Vallejo, A. E. Romano, T. Ghosh, A. Ghosh, K. Leyde, S. Mastrogiovanni, and S. More, Joint cosmological and gravitational-wave pop- ulation inference using dark sirens and galaxy catalogues (2023), arXiv:2308.02281 [astro-ph.CO]
-
[30]
S. Mastrogiovanni, D. Laghi, R. Gray, G. C. Santoro, A. Ghosh, C. Karathanasis, K. Leyde, D. A. Steer, S. Per- ries, and G. Pierra, Joint population and cosmological properties inference with gravitational waves standard sirens and galaxy surveys, Phys. Rev. D108, 042002 (2023), arXiv:2305.10488 [astro-ph.CO]
-
[31]
S. Mastrogiovanni, G. Pierra, S. Perri` es, D. Laghi, G. C. Santoro, A. Ghosh, R. Gray, C. Karathanasis, and K. Leyde, Icarogw: A python package for inference of as- trophysical population properties of noisy, heterogeneous and incomplete observations (2023), arXiv:2305.17973 14 [astro-ph.CO]
-
[32]
M. Tagliazucchi, M. Moresco, N. Borghi, and M. Fiebig, Accelerating the standard siren method: Improved con- straints on modified gravitational-wave propagation with future data, Astronomy and Astrophysics702, A244 (2025)
work page 2025
-
[33]
Abac, A. G.et al.(LIGO Scientific, Virgo, and KAGRA Collaborations), GWTC-4.0: Constraints on the Cos- mic Expansion Rate and Modified Gravitational-wave Propagation, arXiv preprint arXiv:2509.04348 (2025), arXiv:2509.04348 [astro-ph.CO]
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[34]
I. Mandel, W. M. Farr, and J. R. Gair, Ex- tracting distribution parameters from multiple un- certain observations with selection biases, Monthly Notices of the Royal Astronomical Society486, 1086 (2019), https://academic.oup.com/mnras/article- pdf/486/1/1086/28390969/stz896.pdf
work page 2019
- [35]
-
[36]
M. Fishbach, D. E. Holz, and W. M. Farr, Does the black hole merger rate evolve with redshift?, The Astrophysical Journal Letters863, L41 (2018)
work page 2018
-
[37]
W. M. Farr, Accuracy requirements for empirically mea- sured selection functions, Research Notes of the AAS3, 66 (2019)
work page 2019
-
[38]
C. Talbot and J. Golomb, Growing pains: Understand- ing the impact of likelihood uncertainty on hierarchi- cal bayesian inference for gravitational-wave astronomy (2023), arXiv:2304.06138 [astro-ph.IM]
-
[39]
PyTorch: An Imperative Style, High-Performance Deep Learning Library
A. Paszke, S. Gross, F. Massa, A. Lerer, J. Brad- bury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. K¨ opf, E. Yang, Z. De- Vito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and S. Chintala, Pytorch: An impera- tive style, high-performance deep learning library (2019), arXiv:1912.01703 [cs.LG]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[40]
J. Nickolls, I. Buck, M. Garland, and K. Skadron, Scal- able parallel programming with cuda, Queue6, 40 (2008)
work page 2008
-
[41]
S. K. Lam, A. Pitrou, and S. Seibert, Numba: A LLVM- based Python JIT Compiler, inProc. Second Workshop on the LLVM Compiler Infrastructure in HPC(2015) pp. 1–6
work page 2015
-
[42]
Abbott, B. P.et al., Prospects for observing and local- izing gravitational-wave transients with advanced ligo, advanced virgo and kagra, Living Reviews in Relativity 23, 10.1007/s41114-020-00026-9 (2020)
-
[43]
Bilby: A user-friendly Bayesian inference library for gravitational-wave astronomy
G. Ashtonet al., BILBY: A user-friendly Bayesian infer- ence library for gravitational-wave astronomy, Astrophys. J. Suppl.241, 27 (2019), arXiv:1811.02042 [astro-ph.IM]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[44]
M. J. Williams, nessai: Nested sampling with artificial intelligence (2021)
work page 2021
- [45]
-
[46]
GWTC-4.0: Population Properties of Merging Compact Binaries
Abac, A. G.et al.(LIGO Scientific, Virgo, and KA- GRA Collaborations), Gwtc-4.0: Population properties of merging compact binaries (2025), arXiv:2508.18083 [astro-ph.HE]
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[47]
S. Kullback and R. A. Leibler, On Information and Suf- ficiency, The Annals of Mathematical Statistics22, 79 (1951)
work page 1951
-
[48]
D. W. Scott,Multivariate Density Estimation: Theory, Practice, and Visualization, Wiley Series in Probability and Statistics (Wiley, 2009)
work page 2009
-
[49]
B. W. Silverman,Density estimation for statistics and data analysis(1986)
work page 1986
-
[50]
Wilcox,Introduction to Robust Estimation and Hy- pothesis Testing
R. Wilcox,Introduction to Robust Estimation and Hy- pothesis Testing. 5th Edition. San Diego, CA: Academic Press(2021)
work page 2021
-
[51]
I. McMahon, D. Laghi, M. Soares-Santos, K. Ackley, G. D´ alya, Y. Gen¸ cel, D. S´ anchez-Cid, F. Andrade- Oliveira, S. MacBride, C. Chapman-Bird, R. Gray, and A. Papadopoulos, Measurement of the hubble constant using the dark energy survey year 6 gold galaxy cata- log and the fourth gravitational-wave transient catalog (2026), arXiv:2602.04766 [astro-ph.CO]
-
[52]
Abbot, R.et al.(LIGO Scientific, Virgo, and KAGRA Collaborations), Population properties of compact ob- jects from the second ligo–virgo gravitational-wave tran- sient catalog, The Astrophysical Journal Letters913, L7 (2021)
work page 2021
-
[53]
P. Madau and M. Dickinson, Cosmic star-formation his- tory, Annual Review of Astronomy and Astrophysics52, 415–486 (2014)
work page 2014
-
[54]
and others., Planck 2018 results: Vi
Aghanim, N. and others., Planck 2018 results: Vi. cos- mological parameters, Astronomy and Astrophysics641, A6 (2020)
work page 2018
-
[55]
R. Essicket al., Compact binary coalescence sensi- tivity estimates with injection campaigns during the LIGO-Virgo-KAGRA Collaborations’ fourth observing run, Phys. Rev. D112, 102001 (2025), arXiv:2508.10638 [gr-qc]
-
[56]
I. M. Romero-Shaw, C. Talbot, S. Biscoveanu, V. D’Emilio, G. Ashton, C. P. L. Berry, S. Cough- lin, S. Galaudage, C. Hoy, M. H¨ ubner, K. S. Phukon, M. Pitkin, M. Rizzo, N. Sarin, R. Smith, S. Stevenson, A. Vajpeyi, M. Ar` ene, K. Athar, S. Banagiri, N. Bose, M. Carney, K. Chatziioannou, J. A. Clark, M. Colleoni, R. Cotesta, B. Edelman, H. Estell´ es, C. ...
work page 2020
-
[57]
G. Ashton and C. Talbot, Bilby-mcmc: an mcmc sampler for gravitational-wave inference, Monthly Notices of the Royal Astronomical Society507, 2037 (2021), https://academic.oup.com/mnras/article- pdf/507/2/2037/44923607/stab2236.pdf
work page 2037
-
[58]
M. Men´ endez, J. Pardo, L. Pardo, and M. Pardo, The jensen-shannon divergence, Journal of the Franklin In- stitute334, 307 (1997)
work page 1997
-
[59]
odyssee-mure.eu/(2023), accessed: 2026-05-06
ODYSSEE-MURE, Odyssee database: Energy effi- ciency indicators and consumption trends,https://www. odyssee-mure.eu/(2023), accessed: 2026-05-06
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.