Cosmological Observational Tests in the JWST Era. II: The Tolman Test
Pith reviewed 2026-05-07 05:30 UTC · model grok-4.3
The pith
JWST data shows galaxy surface brightness declines with redshift but deviates from the (1 + z)^{-4} law of standard cosmology.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Using 6860 galaxies with reliable spectroscopic redshifts from the ASTRODEEP-JWST photometric catalogue, the mean surface brightness is observed to decrease with increasing redshift, but the trend shows a significant departure from the decline proportional to (1 + z)^{-4} that the standard cosmological model requires.
What carries the argument
The Tolman test, which measures the redshift dependence of galaxy surface brightness to probe cosmological expansion.
If this is right
- The standard model's surface-brightness prediction is not supported by the JWST sample at the level of the observed trend.
- Either galaxy populations evolve in surface brightness or the cosmological dimming law requires adjustment.
- Larger JWST samples can tighten the measured power-law index of the brightness-redshift relation.
- The test supplies an independent observational constraint on expansion history that does not rely on supernova or CMB data.
Where Pith is reading between the lines
- If the departure survives better control for galaxy evolution, it would reopen discussion of whether surface-brightness selection in deep fields systematically favors intrinsically brighter objects at high redshift.
- Cross-checking the same galaxies in multiple JWST filters could separate wavelength-dependent evolutionary effects from purely cosmological dimming.
- The result invites comparison with hydrodynamic simulations that track surface-brightness evolution to see whether the observed trend can be reproduced without changing the background cosmology.
Load-bearing premise
Galaxies selected at different redshifts are intrinsically similar enough that measured surface brightness differences arise only from distance and expansion effects, not from evolutionary changes in the galaxies themselves or from selection biases.
What would settle it
Repeating the surface-brightness measurement on an independent high-redshift sample chosen with different selection criteria or wavelength coverage and checking whether the same departure from (1 + z)^{-4} appears.
Figures
read the original abstract
In this work, we investigate a classical cosmological test - the dependence of galaxy surface brightness on redshift z (the Tolman test). We analyzed 6 860 galaxies with reliably determined spectroscopic redshifts from the ASTRODEEP-JWST photometric catalogue. We find that (a) the mean surface brightness of galaxies indeed decreases with increasing distance, and (b) the observed trend shows a significant departure from the prediction of the standard cosmological model, which expects the mean surface brightness to decline as ~ (1 + z)^-4.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript analyzes surface brightness measurements for 6860 galaxies with spectroscopic redshifts drawn from the ASTRODEEP-JWST photometric catalog. It reports that mean surface brightness declines with redshift but deviates significantly from the (1 + z)^{-4} scaling predicted by the standard cosmological model under the Tolman test.
Significance. If the reported departure survives rigorous controls for galaxy evolution and selection effects, the result would challenge the standard model and motivate re-examination of high-redshift surface-brightness data. The large sample size is a potential strength, but the absence of machine-checked derivations, reproducible code, or falsifiable quantitative predictions in the current draft limits immediate impact.
major comments (3)
- [Observational results / Methods] The manuscript provides no description of the surface-brightness measurement procedure, k-corrections, aperture corrections, or error estimation (see the abstract and the section presenting the observational results). Without these details it is impossible to judge whether the claimed departure from (1 + z)^{-4} is supported by the data.
- [Sample selection and analysis] The central claim requires that galaxies at different redshifts are intrinsically comparable. The ASTRODEEP-JWST selection favors brighter, more compact, or star-forming systems at higher z; no luminosity-function matching, size-evolution corrections, or morphological cuts that hold intrinsic properties fixed are reported. This omission is load-bearing for the cosmological interpretation.
- [Results] The abstract states that the observed trend shows a 'significant departure' from the standard-model prediction, yet no quantitative test (e.g., fitted exponent with uncertainties, survival after evolutionary controls, or comparison to a null model) is supplied in the results section.
minor comments (2)
- [Abstract] The redshift range and exact definition of 'mean surface brightness' (e.g., rest-frame band, isophotal limit) should be stated explicitly in the abstract and introduction.
- [Figures] Figure captions should include the number of galaxies per redshift bin and the precise quantity plotted (observed vs. rest-frame surface brightness).
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments, which identify key areas where the manuscript requires clarification and additional analysis to strengthen the presentation of the Tolman test results. We address each major comment point by point below and will incorporate revisions accordingly.
read point-by-point responses
-
Referee: The manuscript provides no description of the surface-brightness measurement procedure, k-corrections, aperture corrections, or error estimation (see the abstract and the section presenting the observational results). Without these details it is impossible to judge whether the claimed departure from (1 + z)^{-4} is supported by the data.
Authors: We acknowledge that the current draft does not include a dedicated description of the surface-brightness measurement procedure, k-corrections, aperture corrections, or error estimation. The analysis draws surface-brightness values from the ASTRODEEP-JWST catalog, but explicit details on the computation (e.g., conversion from apparent magnitudes and angular sizes, application of k-corrections via SED fitting, aperture choices, and uncertainty propagation from catalog errors) are absent. In the revised manuscript we will add a Methods subsection providing these details, including the relevant formulas, references to the catalog processing papers, and a step-by-step outline of the calculations to enable reproducibility. revision: yes
-
Referee: The central claim requires that galaxies at different redshifts are intrinsically comparable. The ASTRODEEP-JWST selection favors brighter, more compact, or star-forming systems at higher z; no luminosity-function matching, size-evolution corrections, or morphological cuts that hold intrinsic properties fixed are reported. This omission is load-bearing for the cosmological interpretation.
Authors: This is a substantive concern for the cosmological interpretation of the Tolman test. Our sample of 6860 galaxies is defined by the availability of spectroscopic redshifts in the ASTRODEEP-JWST catalog and therefore inherits the catalog's selection function, which can favor brighter or more actively star-forming systems at higher redshift. No explicit luminosity-function matching, size-evolution corrections, or fixed morphological cuts were applied in the presented results. In the revision we will add a dedicated discussion of selection effects and perform additional analyses that restrict the sample to galaxies of comparable absolute magnitude or stellar mass (using catalog-derived quantities) and, where morphological information is available, to similar morphological types. The surface-brightness trend will be re-evaluated within these controlled subsamples and the results reported transparently, with the interpretation qualified if the departure from (1 + z)^{-4} depends on these controls. revision: yes
-
Referee: The abstract states that the observed trend shows a 'significant departure' from the standard-model prediction, yet no quantitative test (e.g., fitted exponent with uncertainties, survival after evolutionary controls, or comparison to a null model) is supplied in the results section.
Authors: We agree that the results section would be strengthened by explicit quantitative statistics. The current text describes the binned mean surface-brightness trend versus redshift and notes its visual departure from the (1 + z)^{-4} expectation. In the revised version we will add a quantitative analysis that includes (i) a power-law fit of the form mean surface brightness ∝ (1 + z)^α with best-fit α and uncertainty obtained via bootstrap resampling or equivalent, (ii) a statistical comparison (e.g., χ² or likelihood-ratio test) of the observed trend against the null hypothesis of α = −4, and (iii) an assessment of whether the departure persists after the subsample controls for luminosity and morphology described in the response to the second comment. These additions will be placed in the results section and summarized in the abstract if they alter the stated significance. revision: yes
Circularity Check
Direct observational comparison to external Tolman prediction; no internal derivation or fitted parameter reduces to the result
full rationale
The paper measures mean surface brightness from 6860 galaxies in the ASTRODEEP-JWST catalog across spectroscopic redshifts and compares the observed trend directly to the independent theoretical expectation of surface-brightness dimming ~ (1 + z)^{-4} from the standard Tolman test in FLRW cosmology. No equations derive a quantity from itself, no parameters are fitted to a subset and then called a prediction of the same data, and no self-citation supplies a load-bearing uniqueness theorem or ansatz. The central claim is a statistical test of data against an external benchmark; the assumption that galaxies remain intrinsically comparable is a validity concern, not a circularity in the derivation chain.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Standard cosmology predicts surface-brightness dimming proportional to (1 + z)^{-4}
Reference graph
Works this paper leans on
-
[1]
N. Aghanim, Y. Akrami, M. Ashdown, J. Aumont, C. Baccigalupi, M. Ballardini, A. J. Banday, R. Barreiro, N. Bartolo, S. Basak, et al. Planck 2018 results-vi. cosmological parameters. Astronomy & Astrophysics, 641: 0 A6, 2020
work page 2018
-
[2]
E. Di Valentino, O. Mena, S. Pan, L. Visinelli, W. Yang, A. Melchiorri, D. F. Mota, A. G. Riess, and J. Silk. In the realm of the hubble tension—a review of solutions. Classical and Quantum Gravity, 38 0 (15): 0 153001, 2021
work page 2021
-
[3]
A. D. Dolgov. Massive and supermassive black holes in the contemporary and early universe and problems in cosmology and astrophysics. Physics--Uspekhi, 61 0 (2): 0 115--132, 2018
work page 2018
-
[4]
M. Geller and P. Peebles. Test of the expanding universe postulate. Astrophysical Journal, vol. 174, p. 1, 174: 0 1, 1972
work page 1972
-
[5]
M. Kamionkowski and A. G. Riess. The hubble tension and early dark energy. Annual Review of Nuclear and Particle Science, 73 0 (1): 0 153--180, 2023
work page 2023
-
[6]
A. Leauthaud and A. Riess. Looking beyond lambda. Nature Astronomy, 9 0 (8): 0 1123--1128, 2025
work page 2025
-
[7]
L. M. Lubin and A. Sandage. The tolman surface brightness test for the reality of the expansion. ii. the effect of thepoint-spread function and galaxy ellipticityon the derived photometricparameters. The Astronomical Journal, 121 0 (5): 0 2289, 2001 a
work page 2001
-
[8]
L. M. Lubin and A. Sandage. The tolman surface brightness test for the reality of the expansion. iii. hubble spacetelescope profile and surface brightness data forearly-type galaxies inthree high-redshift clusters. The Astronomical Journal, 122 0 (3): 0 1071, 2001 b
work page 2001
-
[9]
L. M. Lubin and A. Sandage. The tolman surface brightness test for the reality of the expansion. iv. a measurement of the tolman signal and the luminosity evolution of early-type galaxies. The Astronomical Journal, 122 0 (3): 0 1084, 2001 c
work page 2001
-
[10]
F. Melia. The cosmic timeline implied by the jwst reionization crisis. Astronomy & Astrophysics, 689: 0 A10, 2024
work page 2024
-
[11]
E. Merlin, P. Santini, D. Paris, M. Castellano, A. Fontana, T. Treu, S. Finkelstein, J. Dunlop, P. A. Haro, M. Bagley, et al. Astrodeep-jwst: Nircam-hst multi-band photometry and redshifts for half a million sources in six extragalactic deep fields. Astronomy & Astrophysics, 691: 0 A240, 2024
work page 2024
-
[12]
J. B. Mu \ n oz, J. Mirocha, J. Chisholm, S. R. Furlanetto, and C. Mason. Reionization after jwst: a photon budget crisis? Monthly Notices of the Royal Astronomical Society: Letters, 535 0 (1): 0 L37--L43, 2024
work page 2024
-
[13]
V. Orlov and A. Raikov. Cosmological tests and the evolution of extragalactic objects. Astronomy Reports, 60: 0 477--485, 2016
work page 2016
- [14]
-
[15]
A. Sandage. Observational tests of world models. IN: Annual review of astronomy and astrophysics. Volume 26 (A89-14601 03-90). Palo Alto, CA, Annual Reviews, Inc., 1988, p. 561-630., 26: 0 561--630, 1988
work page 1988
-
[16]
A. Sandage. The tolman surface brightness test for the reality of the expansion. v. provenance of the test and a new representation of the data for three remote hubble space telescope galaxy clusters. The Astronomical Journal, 139 0 (2): 0 728, 2010
work page 2010
-
[17]
A. Sandage and L. M. Lubin. The tolman surface brightness test for the reality of the expansion. i. calibration of thenecessary local parameters. The Astronomical Journal, 121 0 (5): 0 2271, 2001
work page 2001
-
[18]
R. C. Tolman. On the estimation of distances in a curved universe with a non-static line element. Proceedings of the National Academy of Sciences, 16 0 (7): 0 511--520, 1930
work page 1930
- [19]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.