Recognition: no theorem link
How to count clustered galaxies
Pith reviewed 2026-05-12 03:23 UTC · model grok-4.3
The pith
Galaxy clustering systematically biases standard P(D) number counts in confusion-limited submillimetre maps, and a new empirical method corrects it by combining one- and two-point statistics.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Simulations demonstrate that clustering biases P(D)-derived number counts. An empirical method is presented that simultaneously measures and corrects for this bias by combining the 1- and 2-point statistics in the map, thereby maximising the information extracted from the data. Revised galaxy number counts at 250, 350 and 500 microns are derived from deep Herschel-SPIRE observations of the GOODS-N field, showing that clustering inflates the apparent counts by a factor of 1.6 around 10 mJy at 500 microns with milder effects at shorter wavelengths.
What carries the argument
An empirical correction that measures the clustering bias from the combination of one-point P(D) fluctuations and two-point statistics in the same map and subtracts it from the derived counts.
Load-bearing premise
The simulations used to calibrate the empirical correction accurately reproduce the clustering properties and beam-convolved statistics of real galaxies in the Herschel observations.
What would settle it
Independent number counts obtained from higher-resolution observations or from stacking analyses in the same field that do not rely on the P(D) assumption.
Figures
read the original abstract
Obtaining robust galaxy number counts is crucial for understanding galaxy evolution, and submillimetre counts in particular have proven valuable for revising subgrid physics models in cosmological simulations. In confusion-limited surveys, which are common at these wavelengths, statistical methods such as $P(D)$ fluctuation analysis are required to recover counts of faint, unresolved galaxies. However, the standard $P(D)$ framework assumes that galaxies are Poisson-distributed, whereas in reality galaxies are clustered. Using simulations, we demonstrate that this clustering systematically biases $P(D)$-derived number counts, and present an empirical method that simultaneously measures and corrects for this bias by combining the 1- and 2-point statistics in the map, thereby maximising the information extracted from the data. Applying this method to deep Herschel-SPIRE observations of the GOODS-N field, we provide revised galaxy number counts at 250, 350 and 500$\mu$m. Our results indicate that at 500$\mu$m clustering inflates the apparent counts by a factor of 1.6 around 10mJy and slightly suppresses the faintest sub-mJy counts, with milder effects at 350$\mu$m and 250$\mu$m owing to the smaller beam sizes. This methodology is broadly applicable to other confusion-limited data sets with well-characterised beam and noise properties, including SCUBA-2 and CCAT, enabling unbiased exploitation of the full statistical information in current and future far-infrared and submillimetre surveys.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that galaxy clustering introduces a systematic bias in number counts derived via the standard P(D) fluctuation analysis in confusion-limited submillimetre surveys. Using simulations, it demonstrates this bias and develops an empirical correction that combines the map's 1-point (P(D)) and 2-point statistics to measure and remove the clustering effect. The method is then applied to deep Herschel-SPIRE observations of the GOODS-N field, yielding revised counts at 250, 350, and 500 μm that show clustering inflating the apparent counts by a factor of ~1.6 near 10 mJy at 500 μm (with milder effects at shorter wavelengths).
Significance. If the empirical correction proves robust, the work would be significant for submillimetre galaxy evolution studies by enabling unbiased exploitation of confusion-limited data, which are essential for constraining subgrid physics in cosmological simulations. The simulation-based demonstration of the bias and the broad applicability to other instruments (SCUBA-2, CCAT) are strengths. The approach of jointly using 1- and 2-point information to maximise data utility is a constructive advance over purely Poisson-assuming P(D) methods.
major comments (2)
- The central empirical correction is calibrated exclusively on simulations that embed a specific clustering prescription (correlation function, bias factor, source population). Because the revised GOODS-N counts (particularly the factor-1.6 inflation at 500 μm) depend on this calibration matching reality, the manuscript must include explicit sensitivity tests to variations in clustering amplitude, small-scale non-Gaussianity, or shot-noise contributions; without them the correction remains vulnerable to residual systematic error when applied to real faint-galaxy populations.
- The application section presents the revised counts but does not report how uncertainties in the empirical mapping (fit between 1- and 2-point statistics) are propagated into the final number-count errors or the quoted inflation factors. This omission directly affects the reliability of the quantitative claims at 500 μm and must be addressed with a clear error budget.
minor comments (2)
- The abstract and introduction would benefit from a concise statement of the flux range over which the correction is validated and the precise beam and noise properties assumed in the simulations.
- Notation for the combined 1- and 2-point estimator should be defined once in a dedicated methods subsection rather than introduced piecemeal.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed report. We address each major comment below and have revised the manuscript to incorporate the requested analyses where feasible.
read point-by-point responses
-
Referee: The central empirical correction is calibrated exclusively on simulations that embed a specific clustering prescription (correlation function, bias factor, source population). Because the revised GOODS-N counts (particularly the factor-1.6 inflation at 500 μm) depend on this calibration matching reality, the manuscript must include explicit sensitivity tests to variations in clustering amplitude, small-scale non-Gaussianity, or shot-noise contributions; without them the correction remains vulnerable to residual systematic error when applied to real faint-galaxy populations.
Authors: We agree that the robustness of the empirical correction to the underlying clustering assumptions requires explicit testing. In the revised manuscript we have added a new subsection (4.3) that performs sensitivity tests by rescaling the input correlation function amplitude by ±20%, varying the small-scale non-Gaussianity through two alternative halo occupation distribution models, and adjusting the shot-noise level by ±15%. These tests show that the recovered correction factor at 500 μm changes by at most 12%, which is smaller than the statistical uncertainties on the counts. The results are now presented in a new figure and discussed in the text. revision: yes
-
Referee: The application section presents the revised counts but does not report how uncertainties in the empirical mapping (fit between 1- and 2-point statistics) are propagated into the final number-count errors or the quoted inflation factors. This omission directly affects the reliability of the quantitative claims at 500 μm and must be addressed with a clear error budget.
Authors: We acknowledge the need for a transparent error budget. In the revised manuscript we have expanded Section 5 to include a full propagation of uncertainties from the empirical mapping. Using Monte Carlo realisations that sample the posterior of the 1-point to 2-point fit parameters, we now quote an additional systematic uncertainty of ~10% on the counts near 10 mJy at 500 μm. The inflation factor is reported as 1.6 ± 0.2 (statistical) ± 0.15 (systematic from mapping), and the revised tables and figures reflect this complete error analysis. revision: yes
Circularity Check
No significant circularity; empirical correction uses observed 2-point statistics
full rationale
The paper demonstrates clustering bias in P(D) counts via simulations and calibrates an empirical correction by relating 1-point and 2-point map statistics across those simulations. The correction is then applied to real Herschel data by measuring the 2-point statistic directly from the observations to select the appropriate adjustment factor. This does not reduce the final revised number counts to the simulation inputs by construction, nor does it rely on self-citations, uniqueness theorems, or ansatzes smuggled from prior work. The central derivation remains self-contained against the data's own 2-point measurements once the simulation-based mapping is established, with any mismatch between simulated and real clustering constituting an external validation issue rather than definitional circularity.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
- [1]
- [2]
-
[3]
Bendo, G. J., Griffin, M. J., Bock, J. J., et al. 2013, MNRAS, 433, 3062 Béthermin, M., Le Floc’h, E., Ilbert, O., et al. 2012, A&A, 542, A58 Béthermin, M., Wu, H.-Y., Lagache, G., et al. 2017, A&A, 607, A89
work page 2013
-
[4]
Beutler, F., Blake, C., Colless, M., et al. 2011, MNRAS, 416, 3017
work page 2011
- [5]
- [6]
-
[7]
L., Rigby, E., Maddox, S., et al
Clements, D. L., Rigby, E., Maddox, S., et al. 2010, A&A, 518, L8
work page 2010
- [8]
-
[9]
Condon, J. J. 1974, ApJ, 188, 279
work page 1974
-
[10]
Cowley, W. I., Lacey, C. G., Baugh, C. M., et al. 2019, MNRAS, 487, 3082
work page 2019
-
[11]
Davis, M., & Peebles, P. J. E. 1983, ApJ, 267, 465
work page 1983
-
[12]
Duivenvoorden, S., Oliver, S., Béthermin, M., et al. 2020, MNRAS, 491, 1355
work page 2020
-
[13]
Elbaz, D., Dickinson, M., Hwang, H. S., et al. 2011, A&A, 533, A119 Euclid Collaboration: Hill, R., Abghari, A., Scott, D., et al. 2025, arXiv e-prints, arXiv:2511.02989 Euclid Collaboration: Parmar, A., Clements, D. L., Bolzonella, M., et al. 2026, arXiv e-prints, arXiv:2603.13195
work page internal anchor Pith review Pith/arXiv arXiv 2011
-
[14]
Fixsen, D. J., Dwek, E., Mather, J. C., Bennett, C. L., & Shafer, R. A. 1998, ApJ, 508, 123
work page 1998
-
[15]
Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306
work page 2013
-
[16]
2024, The Astrophysical Journal, 971, 117, arXiv:2405.20616 [astro-ph]
Gao, Z.-K., Lim, C.-F., Wang, W.-H., et al. 2024, The Astrophysical Journal, 971, 117, arXiv:2405.20616 [astro-ph]
-
[17]
Glenn, J., Conley, A., Béthermin, M., et al. 2010, MNRAS, 409, 109
work page 2010
-
[18]
2010, Communications in Applied Mathematics and Computational Science, 5, 65
Goodman, J., & Weare, J. 2010, Communications in Applied Mathematics and Computational Science, 5, 65
work page 2010
-
[19]
J., Abergel, A., Abreu, A., et al
Griffin, M. J., Abergel, A., Abreu, A., et al. 2010, A&A, 518, L3
work page 2010
-
[20]
Hildebrandt, H., van Waerbeke, L., Scott, D., et al. 2013, MNRAS, 429, 3230
work page 2013
- [21]
-
[22]
Holland, W. S., Bintley, D., Chapin, E. L., et al. 2013, MNRAS, 430, 2513
work page 2013
-
[23]
Hopwood, R., Polehampton, E. T., Valtchanov, I., et al. 2015, MNRAS, 449, 2274
work page 2015
-
[24]
Hsu, Q.-N., Cowie, L. L., Chen, C.-C., & Barger, A. J. 2024, ApJ, 964, L32
work page 2024
- [25]
- [26]
-
[27]
Liu, F.-Y., Dunlop, J. S., McLure, R. J., et al. 2026, MNRAS, 545, staf1961
work page 2026
-
[28]
Lovell, C. C., Geach, J. E., Davé, R., Narayanan, D., & Li, Q. 2021, MNRAS, 502, 772
work page 2021
- [29]
-
[30]
Marsden, G., Ade, P. A. R., Bock, J. J., et al. 2009, ApJ, 707, 1729
work page 2009
- [31]
-
[32]
Negrello, M., Amber, S., Amvrosiadis, A., et al. 2017, MNRAS, 465, 3558
work page 2017
- [33]
-
[34]
T., Schulz, B., Levenson, L., et al
Nguyen, H. T., Schulz, B., Levenson, L., et al. 2010, A&A, 518, L5
work page 2010
-
[35]
Oliver, S. J., Wang, L., Smith, A. J., et al. 2010, A&A, 518, L21
work page 2010
-
[36]
J., Bock, J., Altieri, B., et al
Oliver, S. J., Bock, J., Altieri, B., et al. 2012, MNRAS, 424, 1614
work page 2012
-
[37]
Paciga, G., Scott, D., & Chapin, E. L. 2009, MNRAS, 395, 1153
work page 2009
-
[38]
Patanchon, G., Ade, P. A. R., Bock, J. J., et al. 2009, ApJ, 707, 1750
work page 2009
-
[39]
Peebles, P. J. E. 1980, The large-scale structure of the universe, V ol. 96 (Prince- ton university press)
work page 1980
-
[40]
Press, W. H., Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. 2007, Nu- merical Recipes 3rd Edition: The Art of Scientific Computing (Cambridge University Press)
work page 2007
-
[41]
Scheuer, P. A. G. 1957, Proceedings of the Cambridge Philosophical Society, 53, 764
work page 1957
- [42]
- [43]
- [44]
-
[45]
T., Kawabe, R., Kohno, K., et al
Takeuchi, T. T., Kawabe, R., Kohno, K., et al. 2001, PASP, 113, 586 Ter Braak, C. J., & Vrugt, J. A. 2008, Statistics and Computing, 18, 435
work page 2001
-
[46]
Toffolatti, L., Argueso Gomez, F., de Zotti, G., et al. 1998, MNRAS, 297, 117
work page 1998
-
[47]
Valiante, E., Smith, M. W. L., Eales, S., et al. 2016, MNRAS, 462, 3146
work page 2016
- [48]
-
[49]
Viero, M. P., Moncelsi, L., Quadri, R. F., et al. 2013, ApJ, 779, 32
work page 2013
-
[50]
Wall, J. V., Scheuer, P. A. G., Pauliny-Toth, I. I. K., & Witzel, A. 1982, MNRAS, 198, 221
work page 1982
-
[51]
Wang, L., Pearson, W. J., Cowley, W., et al. 2019, A&A, 624, A98
work page 2019
- [52]
- [53]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.