pith. sign in

arxiv: 2502.14164 · v3 · submitted 2025-02-20 · 🌌 astro-ph.CO

Implementing a Robust Test of Galaxy Catalogue Completeness for Dark Siren Measurements of the Hubble Constant

Pith reviewed 2026-05-23 03:09 UTC · model grok-4.3

classification 🌌 astro-ph.CO
keywords galaxy catalogue completenessdark sirensHubble constantgravitational wave cosmologymagnitude-redshift samplegwcosmo pipelineGLADE catalogue
0
0 comments X

The pith

A statistical test for galaxy catalogue completeness improves dark siren Hubble constant measurements without luminosity function assumptions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper applies a straightforward statistical test to find the apparent magnitude completeness limit in magnitude-redshift galaxy samples. This test is inserted into the gwcosmo pipeline used for dark siren cosmology to constrain the Hubble constant H0. The approach is less conservative than the method currently used in the pipeline and does not need prior knowledge of the luminosity function. For the GWTC-1 events with the GLADE catalogue the test yields a 1.3 percent improvement on the dark-siren-only H0 posterior and a 3.4 percent improvement when combined with GW170817; larger gains appear with GLADE+. The final posterior shows no change for the GWTC-3 dataset with GLADE+ K-band data because that catalogue supplies essentially no K-band coverage for those events.

Core claim

The robust statistical test determines the apparent magnitude completeness limit of a magnitude-redshift sample directly from the data and, when adopted inside gwcosmo, produces tighter and less biased H0 posteriors from dark sirens than the conservative completeness estimates previously employed.

What carries the argument

The robust statistical test for the apparent magnitude completeness limit of a magnitude-redshift sample, which replaces the existing conservative estimate inside the gwcosmo pipeline.

If this is right

  • H0 inference from dark sirens alone improves by 1.3 percent with GLADE and by 8.6 percent with GLADE+ for GWTC-1 events.
  • The combined dark-siren plus GW170817 posterior tightens by 3.4 percent with GLADE and by 6.3 percent with GLADE+.
  • No change occurs for GWTC-3 events with GLADE+ K-band photometry because the catalogue supplies negligible coverage in that band.
  • As deeper galaxy catalogues become available the adoption of less conservative completeness limits will grow in importance for gravitational-wave cosmology.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same test could be inserted into other dark-siren analysis codes that currently rely on fixed or luminosity-function-dependent completeness cuts.
  • If the test is applied to future catalogues that reach higher redshifts it may reduce the dominant systematic uncertainty in multi-messenger H0 measurements.
  • A direct test on mock catalogues with injected known completeness would quantify any residual bias introduced by the statistical procedure.

Load-bearing premise

The statistical test correctly identifies the true completeness limit and that feeding this limit into gwcosmo yields unbiased H0 improvements without creating new selection effects.

What would settle it

Re-running the GWTC-1 analysis with the test on a simulated catalogue whose completeness limit is known by construction and finding that the recovered H0 posterior is shifted or broadened relative to the known truth would falsify the claim.

Figures

Figures reproduced from arXiv: 2502.14164 by L. Datrier, M. Hendry.

Figure 1
Figure 1. Figure 1: Illustrating the 𝑆1 and 𝑆2 areas for a single pixel in the GLADE 𝐵-band. 𝑀 vs 𝜇 with 𝑆1 and 𝑆2 for a galaxy (𝑀𝑖 ,𝜇𝑖). The dash-dotted orange and blue lines show, respectively, the robust and median apparent magnitude thresholds for this sample of galaxies. The green line shows a test limit magnitude 𝑚∗ that is fainter than the true 𝑚𝑡ℎ𝑟 . 𝐸𝑖 = 1 2 and 𝑉𝑖 = 𝑛𝑖 − 1 12(𝑛𝑖 + 1) . (5) R01 defines the quantity 𝑇… view at source ↗
Figure 2
Figure 2. Figure 2: Final posterior on 𝐻0 using the GLADE 𝐵-band on the GWTC-1 dataset. Vertical dashed lines show the 1𝜎 intervals. The solid lines show results showing the robust method, while dash-dotted lines show results using 𝑚med as the apparent magnitude threshold. 20 40 60 80 100 120 140 H0 (km s 1 Mpc 1 ) 0.000 0.005 0.010 0.015 0.020 0.025 0.030 0.035 0.040 p(H 0) (k m 1 s M p c) Median, dark sirens Median with GW1… view at source ↗
Figure 3
Figure 3. Figure 3 [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 5
Figure 5. Figure 5: Results using the robust (blue) and median (orange) methods for individual events using the GLADE+ 𝐵-band with the GWTC-1 catalogue. The event GW170814 is analysed using the 𝑔-band of the DES-Y1 catalogue. 20 40 60 80 100 120 140 H0 (km s 1 Mpc 1 ) 0.00 0.01 0.02 0.03 0.04 0.05 0.06 p(H 0) (k m 1 s M p c) Median, dark sirens Median with GW170817 Robust, dark sirens Robust with GW170817 [PITH_FULL_IMAGE:fi… view at source ↗
Figure 6
Figure 6. Figure 6: Final posterior on 𝐻0 using the GLADE+ 𝐾-band on the GWTC-3 dataset. Vertical dashed lines show the 1𝜎 intervals. The solid lines show results showing the robust method, while dash-dotted lines show results using 𝑚med as the apparent magnitude threshold. further investigation of the validity of this assumption for other bands and other catalogues will be carried out in future work. Ongoing work ahead of th… view at source ↗
Figure 7
Figure 7. Figure 7: Inferred 𝐻0 using the robust (orange) and median (blue) methods for estimating galaxy catalogue completeness, for individual events in the GWTC-3 catalogue. The 𝐾-band of the GLADE+ catalogue is used for analysis. edge the use of the following python packages in this work: gwcosmo (Gray et al. 2020), Matplotlib (Hunter 2007), healpy (Górski et al. 2005; Zonca et al. 2019). DATA AVAILABILITY The GWTC-3 data… view at source ↗
read the original abstract

We present the application of a robust test of galaxy catalogue completeness to the gwcosmo pipeline. The method implements a straightforward statistical test for determining the apparent magnitude completeness limit of a magnitude-redshift sample. This offers an improved, less conservative approach compared with how galaxy catalogue completeness is currently estimated in the gwcosmo gravitational wave cosmology pipeline for determining the Hubble constant $H_{0}$. The test also does not require prior knowledge of the luminosity function, and thus returns a more robust estimate of the limiting apparent magnitude for a magnitude-redshift sample of galaxies. For GWTC-1 results using $B_{J}$-band photometry of galaxies in the GLADE catalogue, we find a $1.3\%$ improvement on the inference of $H_{0}$ using dark sirens only and a $3.4\%$ improvement for the combined posterior with GW170817. Using GLADE+, there is a $8.6\%$ improvement with dark sirens only and a $6.3\%$ improvement for the combined posterior with GW170817. However, the final posterior on $H_{0}$ using the GWTC-3 dataset with the GLADE+ $K$-band shows no improvement when applying the robust method. This is because the GLADE+ galaxy catalogue provides little or no coverage in the $K$-band for any of the GWTC-3 events. With the use of deeper galaxy catalogues in future gravitational wave cosmology analyses, the adoption of a less conservative estimate of magnitude completeness will become increasingly important.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper claims that a statistical test for the apparent magnitude completeness limit of magnitude-redshift galaxy samples can be integrated into the gwcosmo pipeline for dark-siren H0 inference. This yields a less conservative completeness estimate than the current gwcosmo prescription, requires no luminosity-function prior, and produces quantified improvements in H0 posterior precision (1.3–8.6 % for GWTC-1 events with GLADE/GLADE+; zero improvement for GWTC-3 K-band data due to catalog coverage).

Significance. If the test recovers an unbiased completeness limit and the resulting selection function does not introduce new systematics, the approach would allow more galaxies to enter dark-siren analyses, tightening H0 constraints as deeper catalogs become available. The manuscript supplies concrete percentage gains on real GWTC-1 events and identifies the catalog-coverage limitation for GWTC-3, which is a useful diagnostic.

major comments (2)
  1. [§4] §4 (Application to GWTC-1/GLADE): the reported 1.3–8.6 % precision gains rest on the assumption that the statistical test returns a completeness limit whose associated selection function leaves the H0 posterior unbiased; no Monte-Carlo injection study or comparison against catalogs with independently known completeness is shown to confirm this.
  2. [§5] §5 (GWTC-3 K-band results): the statement that zero improvement occurs solely because of catalog coverage is not accompanied by a demonstration that the test itself correctly identifies the (low) completeness limit or that altering the included galaxies does not shift the posterior in a biased direction.
minor comments (2)
  1. [Abstract, §2] The abstract and §2 should name the specific statistical test (e.g., the exact form of the magnitude-redshift completeness estimator) rather than referring only to “a robust test.”
  2. [Figure 3] Figure 3 (or equivalent) comparing completeness limits: the caption should state whether the plotted limit is the mode, median, or 95 % credible value of the test output.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful reading and for acknowledging the potential utility of the method. Below we respond point-by-point to the two major comments.

read point-by-point responses
  1. Referee: [§4] §4 (Application to GWTC-1/GLADE): the reported 1.3–8.6 % precision gains rest on the assumption that the statistical test returns a completeness limit whose associated selection function leaves the H0 posterior unbiased; no Monte-Carlo injection study or comparison against catalogs with independently known completeness is shown to confirm this.

    Authors: We agree that an explicit Monte-Carlo injection campaign on the full gwcosmo pipeline would provide the strongest possible confirmation that the selection function derived from the test does not bias the H0 posterior. The present manuscript does not contain such a study. The completeness test we implement follows the procedure validated in the original reference (which recovered input limits in simulated magnitude-redshift catalogs without luminosity-function assumptions). In this work we only compare the resulting H0 posteriors against the more conservative gwcosmo cut on the same real catalogs. We will revise §4 to include a concise discussion of the test’s prior validation and to state explicitly that a dedicated end-to-end injection study lies beyond the scope of the current paper. revision: partial

  2. Referee: [§5] §5 (GWTC-3 K-band results): the statement that zero improvement occurs solely because of catalog coverage is not accompanied by a demonstration that the test itself correctly identifies the (low) completeness limit or that altering the included galaxies does not shift the posterior in a biased direction.

    Authors: For the GWTC-3 events the test returns a very bright completeness limit because GLADE+ supplies essentially no K-band photometry at the relevant redshifts; consequently both the standard and the new prescriptions retain almost no galaxies and the posterior is unchanged. We will add a short table or figure in §5 that reports the completeness magnitude returned by the test, the number of galaxies retained under each prescription, and the resulting H0 credible intervals, thereby making the catalog-coverage explanation quantitative. A full bias-validation study for these particular data is not performed in the manuscript. revision: partial

Circularity Check

0 steps flagged

No significant circularity; external test applied to catalogs

full rationale

The paper applies an external statistical test for apparent magnitude completeness limits to existing galaxy catalogs (GLADE/GLADE+) and compares the resulting H0 posteriors against the standard gwcosmo prescription. Reported improvements (1.3–8.6%) are empirical outcomes from re-running the pipeline on GWTC-1/3 data with the new limits; no equations, parameters, or results reduce by construction to inputs defined within the paper. The test requires no luminosity function and is benchmarked directly on real catalogs, rendering the derivation self-contained against external data.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the validity of the statistical test for determining completeness limits and its integration into the gwcosmo pipeline without introducing bias.

axioms (1)
  • domain assumption The statistical test accurately determines the apparent magnitude completeness limit of a magnitude-redshift sample without prior knowledge of the luminosity function.
    This is presented as the key advantage over the current conservative method in gwcosmo.

pith-pipeline@v0.9.0 · 5811 in / 1421 out tokens · 40497 ms · 2026-05-23T03:09:53.450942+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

46 extracted references · 46 canonical work pages · 1 internal anchor

  1. [1]

    write newline

    " write newline "" before.all 'output.state := FUNCTION fin.entry write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION new.sentence output.state after.block = 'skip output.state before.all = 'skip after.sentence 'output.state := if if FUNCTION not #0 #1 if FUNCTION and 'skip pop #0 if FUNCTION or pop #1...

  2. [2]

    write newline

    " write newline "" before.all 'output.state := FUNCTION fin.entry write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION new.sentence output.state after.block = 'skip output.state before.all = 'skip after.sentence 'output.state := if if FUNCTION not #0 #1 if FUNCTION and 'skip pop #0 if FUNCTION or pop #1...

  3. [3]

    P., Abbott, R., Abbott, T

    Abbott B. P., et al., 2017a, @doi [Physical Review Letters] 10.1103/PhysRevLett.119.161101 , http://adsabs.harvard.edu/abs/2017PhRvL.119p1101A 119, 161101

  4. [4]

    P., et al., 2017b, @doi [Nature] 10.1038/nature24471 , 551, 85

    Abbott B. P., et al., 2017b, @doi [Nature] 10.1038/nature24471 , 551, 85

  5. [5]

    Abbott T. M. C., et al., 2018, @doi [The Astrophysical Journal Supplement Series] 10.3847/1538-4365/aae9f0 , 239, 18

  6. [6]

    P., et al., 2019, @doi [Physical Review X] 10.1103/PhysRevX.9.031040 , https://ui.adsabs.harvard.edu/abs/2019PhRvX...9c1040A 9, 031040

    Abbott B. P., et al., 2019, @doi [Physical Review X] 10.1103/PhysRevX.9.031040 , https://ui.adsabs.harvard.edu/abs/2019PhRvX...9c1040A 9, 031040

  7. [7]

    P., et al., 2020a, @doi [The Astrophysical Journal] 10.3847/2041-8213/ab75f5 , 892, L3

    Abbott B. P., et al., 2020a, @doi [The Astrophysical Journal] 10.3847/2041-8213/ab75f5 , 892, L3

  8. [8]

    Abbott R., et al., 2020b, @doi [The Astrophysical Journal Letters] 10.3847/2041-8213/ab960f , 896, L44

  9. [9]

    P., et al., 2021a, @doi [The Astrophysical Journal] 10.3847/1538-4357/abdcb7 , 909, 218

    Abbott B. P., et al., 2021a, @doi [The Astrophysical Journal] 10.3847/1538-4357/abdcb7 , 909, 218

  10. [10]

    Abbott R., et al., 2021b, @doi [The Astrophysical Journal Letters] 10.3847/2041-8213/ac082e , 915, L5

  11. [11]

    Abbott R., et al., 2023a, @doi [Physical Review X] 10.1103/PhysRevX.13.041039 , https://ui.adsabs.harvard.edu/abs/2023PhRvX..13d1039A 13, 041039

  12. [12]

    Abbott R., et al., 2023b, @doi [The Astrophysical Journal] 10.3847/1538-4357/ac74bb , 949, 76

  13. [13]

    Afroz S., Mukherjee S., 2024, @doi [Mon. Not. Roy. Astron. Soc.] 10.1093/mnras/stae2139 , 534, 1283

  14. [14]

    E., 2018, @doi [Nature] 10.1038/s41586-018-0606-0 , 562, 545–547

    Chen H.-Y., Fishbach M., Holz D. E., 2018, @doi [Nature] 10.1038/s41586-018-0606-0 , 562, 545–547

  15. [15]

    F., Finn L

    Chernoff D. F., Finn L. S., 1993, @doi [ ] 10.1086/186898 , https://ui.adsabs.harvard.edu/abs/1993ApJ...411L...5C 411, L5

  16. [16]

    Dalang C., Baker T., 2024, @doi [JCAP] 10.1088/1475-7516/2024/02/024 , 02, 024

  17. [17]

    Del Pozzo W., 2012, @doi [Phys. Rev. D] 10.1103/PhysRevD.86.043011 , 86, 043011

  18. [18]

    Doctor Z., et al., 2019, @doi [The Astrophysical Journal Letters] 10.3847/2041-8213/ab08a3 , 873, L24

  19. [19]

    Drlica-Wagner A., et al., 2018, @doi [The Astrophysical Journal Supplement Series] 10.3847/1538-4365/aab4f5 , 235, 33

  20. [20]

    Dálya G., et al., 2018, @doi [Monthly Notices of the Royal Astronomical Society] 10.1093/mnras/sty1703 , 479, 2374

  21. [21]

    Dálya G., et al., 2022, @doi [Monthly Notices of the Royal Astronomical Society] 10.1093/mnras/stac1443 , 514, 1403

  22. [22]

    Efron B., Petrosian V., 1992, @doi [ ] 10.1086/171931 , https://ui.adsabs.harvard.edu/abs/1992ApJ...399..345E 399, 345

  23. [23]

    , keywords =

    Farr W. M., Fishbach M., Ye J., Holz D. E., 2019, @doi [The Astrophysical Journal Letters] 10.3847/2041-8213/ab4284 , 883, L42

  24. [24]

    Fishbach M., et al., 2019, @doi [The Astrophysical Journal Letters] 10.3847/2041-8213/aaf96e , 871, L13

  25. [25]

    HEALPix -- a Framework for High Resolution Discretization, and Fast Analysis of Data Distributed on the Sphere

    G \'o rski K. M., Hivon E., Banday A. J., Wandelt B. D., Hansen F. K., Reinecke M., Bartelmann M., 2005, @doi [ ] 10.1086/427976 , http://adsabs.harvard.edu/abs/2005ApJ...622..759G 622, 759

  26. [26]

    Gray R., et al., 2020, @doi [Physical Review D] 10.1103/physrevd.101.122001 , 101

  27. [27]

    Gray R., Messenger C., Veitch J., 2022, @doi [Monthly Notices of the Royal Astronomical Society] 10.1093/mnras/stac366 , 512, 1127

  28. [28]

    Gray R., et al., 2023, @doi [Journal of Cosmology and Astroparticle Physics] 10.1088/1475-7516/2023/12/023 , 2023, 023

  29. [29]

    D., 2007, Computing in science & engineering, 9, 90

    Hunter J. D., 2007, Computing in science & engineering, 9, 90

  30. [30]

    Johnston R., Teodoro L., Hendry M., 2007, @doi [ ] 10.1111/j.1365-2966.2007.11559.x , https://ui.adsabs.harvard.edu/abs/2007MNRAS.376.1757J 376, 1757

  31. [31]

    LIGO Scientific Collaboration and Virgo Collaboration and KAGRA Collaboration 2023, GWTC-3: Compact Binary Coalescences Observed by LIGO and Virgo During the Second Part of the Third Observing Run — Parameter estimation data release , @doi 10.5281/zenodo.8177023 , https://doi.org/10.5281/zenodo.8177023

  32. [32]

    L., Hogan C

    MacLeod C. L., Hogan C. J., 2008, @doi [Phys. Rev. D] 10.1103/PhysRevD.77.043512 , 77, 043512

  33. [33]

    Messenger C., Read J., 2012, @doi [Phys. Rev. Lett.] 10.1103/PhysRevLett.108.091101 , 108, 091101

  34. [34]

    Mo G., Haster C.-J., Katsavounidis E., 2024, On the use of galaxy catalogs in gravitational-wave parameter estimation ( @eprint arXiv 2410.14663 ), https://arxiv.org/abs/2410.14663

  35. [35]

    D., Nissanke S

    Mukherjee S., Wandelt B. D., Nissanke S. M., Silvestri A., 2021, @doi [Phys. Rev. D] 10.1103/PhysRevD.103.043520 , 103, 043520

  36. [36]

    D., Silk J., 2024, @doi [Astrophys

    Mukherjee S., Krolewski A., Wandelt B. D., Silk J., 2024, @doi [Astrophys. J.] 10.3847/1538-4357/ad7d90 , 975, 189

  37. [37]

    Nishizawa A., 2017, @doi [Phys. Rev. D] 10.1103/PhysRevD.96.101303 , 96, 101303

  38. [38]

    Palmese A., et al., 2020, @doi [The Astrophysical Journal Letters] 10.3847/2041-8213/abaeff , 900, L33

  39. [39]

    R., Mucesh S., Hartley W

    Palmese A., Bom C. R., Mucesh S., Hartley W. G., 2023, @doi [The Astrophysical Journal] 10.3847/1538-4357/aca6e3 , 943, 56

  40. [40]

    Planck Collaboration et al., 2020, @doi [Astronomy & Astrophysics] 10.1051/0004-6361/201833910 , 641, A6

  41. [41]

    Rauzy S., 2001, @doi [Monthly Notices of the Royal Astronomical Society] 10.1046/j.1365-8711.2001.04078.x , 324, 51–56

  42. [42]

    G., et al., 2022, @doi [The Astrophysical Journal Letters] 10.3847/2041-8213/ac5c5b , 934, L7

    Riess A. G., et al., 2022, @doi [The Astrophysical Journal Letters] 10.3847/2041-8213/ac5c5b , 934, L7

  43. [43]

    F., 1986, Nature, 323, 310

    Schutz B. F., 1986, Nature, 323, 310

  44. [44]

    Soares-Santos M., et al., 2019, @doi [The Astrophysical Journal Letters] 10.3847/2041-8213/ab14f1 , 876, L7

  45. [45]

    R., Gair J

    Taylor S. R., Gair J. R., 2012, @doi [Phys. Rev. D] 10.1103/PhysRevD.86.023502 , 86, 023502

  46. [46]

    Zonca A., Singer L., Lenz D., Reinecke M., Rosset C., Hivon E., Gorski K., 2019, @doi [Journal of Open Source Software] 10.21105/joss.01298 , 4, 1298