Recognition: 2 theorem links
· Lean TheoremMachine Learning Techniques for Astrophysics and Cosmology: Photometric Redshifts
Pith reviewed 2026-05-11 00:46 UTC · model grok-4.3
The pith
Discriminative machine learning for photometric redshifts has converged because limits now come from spectroscopic training data rather than from the choice of algorithm.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The diversity of discriminative AI methods applied to regress redshift from photometric observables has effectively converged. Progress is now limited not by the AI methodology but by the size and substantial systematic uncertainties and selection effects in spectroscopic training samples. In order to progress, either an unobtainable quantity and quality of training data or a more principled approach in using it is required, in the form of generative models for representing the distribution of intrinsic properties and outcomes of telescope observations of the galaxy population.
What carries the argument
Discriminative regression that maps photometric observables directly to redshift estimates, set against generative models that represent the underlying distribution of galaxy properties and observational effects.
If this is right
- Further tuning of existing regression methods will produce only application-specific improvements rather than broad advances in accuracy.
- Substantial progress in photometric redshift quality requires spectroscopic training samples that are larger and freer of systematic biases than those currently available.
- Integrating artificial intelligence into Bayesian generative modeling allows explicit representation of intrinsic galaxy properties and telescope observation effects.
- Generative approaches can capture selection effects and uncertainties in a way that purely discriminative regression cannot.
Where Pith is reading between the lines
- Hybrid models that combine physical galaxy evolution with machine learning components could reduce reliance on perfect training data.
- Upcoming imaging surveys would see reduced systematic errors in cosmological analyses if generative Bayesian methods replace current photo-z pipelines.
- Controlled tests on simulated catalogs with injected selection biases could quantify whether the proposed generative shift improves calibration.
Load-bearing premise
The surveyed variety of AI regression techniques is representative of what is possible, and further large gains from purely discriminative machine learning are not feasible without changes to the training data or the modeling framework.
What would settle it
A new discriminative machine learning algorithm that produces substantially more accurate photometric redshifts than current methods when tested on the same existing spectroscopic samples, without using extra data or generative components, would falsify the convergence claim.
Figures
read the original abstract
The cosmological redshift of a galaxy's light is inferable from its observable properties in images. Because imaging is much easier to acquire than spectroscopic observations that would allow the identification of distinct line features, this motivates the technique of photometric redshift estimation (photo-$z$). Photo-$z$ has been an early and sustained driver for the utilization of artificial intelligence (AI) in astrophysics, and conversely AI methods are underlying most of the recent advances in photo-$z$. Here we review the diversity of AI methods applied to the photo-$z$ problem over the years in a discriminative way, that is, to regress redshift from photometric observables. We argue that, besides optimization suiting specific applications, this approach has effectively converged. It is limited not by the AI methodology but by the size and substantial systematic uncertainties and selection effects in spectroscopic training samples. In order to progress, either an unobtainable quantity and quality of training data or a more principled approach in using it is required. We thus outline ongoing research of integrating AI in a Bayesian modeling of galaxy data. This comes in the form of generative models for representing the distribution of intrinsic properties and outcomes of telescope observations of the galaxy population.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. This review surveys the application of discriminative machine learning methods to photometric redshift (photo-z) estimation, arguing that these approaches have effectively converged and are now limited primarily by the size, systematics, and selection effects in spectroscopic training samples rather than by algorithmic innovation. It outlines the historical diversity of AI techniques for regressing redshift from photometric observables and advocates shifting toward generative models integrated with Bayesian galaxy modeling to enable further progress.
Significance. If the convergence assessment holds, the paper would usefully redirect community effort from new discriminative architectures toward data curation and generative frameworks, with potential impact on upcoming surveys (e.g., LSST, Euclid) where photo-z precision is critical. The review synthesizes a broad literature and correctly identifies training-data bottlenecks as a central constraint; however, the absence of quantitative meta-analysis reduces its immediate utility as a definitive benchmark.
major comments (2)
- [Abstract] Abstract and opening sections: the central claim that 'this approach has effectively converged' is asserted without a quantitative meta-analysis or trend plot of performance metrics (e.g., σ_NMAD, bias, or catastrophic outlier fraction) versus publication year or model class on fixed benchmark datasets such as SDSS, COSMOS, or DES. A qualitative survey alone cannot rule out incremental gains from architectural or training advances under current data constraints.
- [Generative models outline] Section on generative models (final paragraphs): the proposed integration of AI via generative models for galaxy populations and telescope observations is presented at a high level only, without concrete examples, likelihood formulations, or direct performance comparisons against the reviewed discriminative methods on the same datasets.
minor comments (2)
- Notation for photometric observables and redshift estimators should be standardized across the review to avoid reader confusion when comparing methods.
- A table summarizing key papers, their architectures, and reported metrics on common benchmarks would improve readability and support the convergence argument.
Simulated Author's Rebuttal
We thank the referee for their thoughtful and constructive review. We address each major comment below, providing our response and indicating the revisions we will make to the manuscript.
read point-by-point responses
-
Referee: [Abstract] Abstract and opening sections: the central claim that 'this approach has effectively converged' is asserted without a quantitative meta-analysis or trend plot of performance metrics (e.g., σ_NMAD, bias, or catastrophic outlier fraction) versus publication year or model class on fixed benchmark datasets such as SDSS, COSMOS, or DES. A qualitative survey alone cannot rule out incremental gains from architectural or training advances under current data constraints.
Authors: We appreciate the referee's point that a quantitative meta-analysis would provide stronger support for the convergence claim. Our review is a synthesis of the existing literature rather than a new benchmark study, and performing a full meta-analysis would require reprocessing heterogeneous datasets and metrics under uniform conditions, which lies outside the scope of this work. However, we acknowledge that the current presentation relies on qualitative assessment. In the revised manuscript, we will expand the introduction and discussion sections to include a summary of representative performance trends drawn from key published studies on common benchmarks (e.g., SDSS and COSMOS), citing papers that report or plot metrics such as σ_NMAD and outlier fractions over time. This will offer additional evidence of saturation without claiming a comprehensive new meta-analysis. revision: partial
-
Referee: [Generative models outline] Section on generative models (final paragraphs): the proposed integration of AI via generative models for galaxy populations and telescope observations is presented at a high level only, without concrete examples, likelihood formulations, or direct performance comparisons against the reviewed discriminative methods on the same datasets.
Authors: We agree that the generative models section is presented at a high level, as it is intended to outline promising future directions rather than deliver a detailed technical review of that emerging area. To address this, we will revise the final section to incorporate specific examples from recent literature on generative approaches (such as variational autoencoders and normalizing flows applied to galaxy spectral energy distributions and observational effects). We will include brief descriptions of likelihood formulations where they appear in the cited works and note the current status of comparisons to discriminative methods, emphasizing that standardized benchmarks are still developing. This will provide more substance while preserving the review's focus on the shift in research direction. revision: yes
Circularity Check
Review synthesis with no derivations or self-referential reductions
full rationale
This paper is a literature review of AI methods for photometric redshift estimation. It presents no equations, derivations, fitted parameters, or predictions that could reduce to inputs by construction. The central claim that discriminative ML approaches have converged (limited by training data rather than methodology) is a qualitative synthesis of external work, not a self-contained derivation or self-citation chain. No load-bearing steps exist that match the enumerated circularity patterns, making the analysis self-contained as a survey.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We argue that, besides optimization suiting specific applications, this approach has effectively converged. It is limited not by the AI methodology but by the size and substantial systematic uncertainties and selection effects in spectroscopic training samples.
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We thus outline ongoing research of integrating AI in a Bayesian modeling of galaxy data. This comes in the form of generative models for representing the distribution of intrinsic properties and outcomes of telescope observations of the galaxy population.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
ˇZ. Ivezi´c, S. M. Kahn, J. A. Tyson, B. Abel, E. Acosta, R. Allsman et al.,LSST: From Science Drivers to Reference Design and Anticipated Data Products,The Astrophysical Journal873(2019) 111 [0805.2366]
-
[2]
Euclid Collaboration, Y . Mellier, Abdurro’uf, J. A. Acevedo Barroso, A. Ach´ucarro, J. Adamek et al.,Euclid: I. Overview of the Euclid mission,Astronomy & Astrophysics697 (2025) A1 [2405.13491]
-
[3]
DESI Collaboration, B. Abareshi, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander et al., Overview of the Instrumentation for the Dark Energy Spectroscopic Instrument,The Astronomical Journal164(2022) 207 [2205.10939]
- [4]
-
[5]
M. Takada, R. S. Ellis, M. Chiba, J. E. Greene, H. Aihara, N. Arimoto et al.,Extragalactic science, cosmology, and Galactic archaeology with the Subaru Prime Focus Spectrograph, Publications of the Astronomical Society of Japan66(2014) R1 [1206.0737]
work page Pith review arXiv 2014
-
[6]
Report of the Dark Energy Task Force
A. Albrecht, G. Bernstein, R. Cahn, W. L. Freedman, J. Hewitt, W. Hu et al.,Report of the Dark Energy Task Force,arXiv e-prints(2006) astro [astro-ph/0609591]
work page Pith review arXiv 2006
- [7]
-
[8]
M. Salvato, O. Ilbert and B. Hoyle,The many flavours of photometric redshifts,Nature Astronomy3(2019) 212 [1805.12574]
-
[9]
R. Buchs, C. Davis, D. Gruen, J. DeRose, A. Alarcon, G. M. Bernstein et al.,Phenotypic redshifts with self-organizing maps: A novel method to characterize redshift distributions of source galaxies for weak lensing,Monthly Notices of the Royal Astronomical Society489 (2019) 820 [1901.05005]
- [10]
-
[11]
G. B. Dalton, M. Caldwell, A. K. Ward, M. S. Whalley, G. Woodhouse, R. L. Edeson et al., The VISTA infrared camera, inGround-based and Airborne Instrumentation for Astronomy (I. S. McLean and M. Iye, eds.), vol. 6269 ofSociety of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, p. 62690X, June, 2006, DOI. 22 Luca Tortorelli and Daniel Gr ¨un
2006
- [12]
-
[13]
Guzzoet al., A&A566, A108 (2014), arXiv:1303.2623 [astro-ph.CO]
L. Guzzo, M. Scodeggio, B. Garilli, B. R. Granett, A. Fritz, U. Abbas et al.,The VIMOS Public Extragalactic Redshift Survey (VIPERS). An unprecedented view of galaxies and large-scale structure at 0.5 ¡ z ¡ 1.2,Astronomy & Astrophysics566(2014) A108 [1303.2623]
-
[14]
S. P. Driver, S. Bellstedt, A. S. G. Robotham, I. K. Baldry, L. J. Davies, J. Liske et al., Galaxy And Mass Assembly (GAMA): Data Release 4 and the z ¡ 0.1 total and z ¡ 0.08 morphological galaxy stellar mass functions,Monthly Notices of the Royal Astronomical Society513(2022) 439 [2203.08539]
- [15]
-
[16]
M. Oguri,A cluster finding algorithm based on the multiband identification of red sequence galaxies,Monthly Notices of the Royal Astronomical Society444(2014) 147 [1407.4693]
-
[17]
F. Bellagamba, M. Roncarelli, M. Maturi and L. Moscardini,AMICO: optimized detection of galaxy clusters in photometric surveys,Monthly Notices of the Royal Astronomical Society473(2018) 5221 [1705.03029]
- [18]
-
[19]
Weak lensing for precision cosmology
R. Mandelbaum,Weak Lensing for Precision Cosmology,Annual Review of Astronomy and Astrophysics56(2018) 393 [1710.03235]
work page Pith review arXiv 2018
-
[20]
J. Prat and D. Bacon,Weak gravitational lensing, inEncyclopedia of Astrophysics, Volume 5, vol. 5, pp. 508–537, Jan., 2026,2501.07938, DOI
-
[21]
KiDS-1000 Cosmology: Cosmic shear constraints and comparison between two point statistics
M. Asgari, C.-A. Lin, B. Joachimi, B. Giblin, C. Heymans, H. Hildebrandt et al., KiDS-1000 cosmology: Cosmic shear constraints and comparison between two point statistics,Astronomy & Astrophysics645(2021) A104 [2007.15633]
-
[22]
and Sugiyama, Sunao and Zhang, Tianqing and Rau, Markus M
R. Dalal, X. Li, A. Nicola, J. Zuntz, M. A. Strauss, S. Sugiyama et al.,Hyper Suprime-Cam Year 3 results: Cosmology from cosmic shear power spectra,Physical Review D108(2023) 123519 [2304.00701]
- [23]
-
[25]
I. Pantos and L. Perivolaropoulos,Status of the S 8 tension: A 2026 review of probe discrepancies,Physics of the Dark Universe52(2026) 102286 [2602.12238]
-
[26]
Planck 2018 results. VI. Cosmological parameters
Planck Collaboration, N. Aghanim, Y . Akrami, M. Ashdown, J. Aumont, C. Baccigalupi et al.,Planck 2018 results. VI. Cosmological parameters,Astronomy & Astrophysics641 (2020) A6 [1807.06209]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[27]
A. Amon and G. Efstathiou,A non-linear solution to the S 8 tension?,Monthly Notices of the Royal Astronomical Society516(2022) 5355 [2206.11794]
- [28]
-
[29]
J. Choppin de Janvry, B. Dai, S. G. A. Gontcho, U. Seljak and T. Zhang,Cosmic Shear constraints from HSC Year 3 with clustering calibration of the tomographic redshift distributions from DESI,arXiv e-prints(2025) arXiv:2511.18134 [2511.18134]. 1 Photometric Redshifts 23
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[30]
L. Bigwood, J. McCullough, J. Siegel, A. Amon, G. Efstathiou, D. Sanchez-Cid et al., Confronting cosmic shear astrophysical uncertainties: DES Year 3 revisited,arXiv e-prints (2025) arXiv:2512.04209 [2512.04209]
-
[31]
2018, arXiv e-prints, arXiv:1809.01669
The LSST Dark Energy Science Collaboration, R. Mandelbaum, T. Eifler, R. Hlo ˇzek, T. Collett, E. Gawiser et al.,The LSST Dark Energy Science Collaboration (DESC) Science Requirements Document,arXiv e-prints(2018) arXiv:1809.01669 [1809.01669]
-
[32]
Euclid Definition Study Report
R. Laureijs, J. Amiaux, S. Arduini, J. L. Augu `eres, J. Brinchmann, R. Cole et al.,Euclid Definition Study Report,arXiv e-prints(2011) arXiv:1110.3193 [1110.3193]
work page internal anchor Pith review Pith/arXiv arXiv 2011
-
[33]
M. M. Rau, R. Dalal, T. Zhang, X. Li, A. J. Nishizawa, S. More et al.,Weak lensing tomographic redshift distribution inference for the Hyper Suprime-Cam Subaru Strategic Program three-year shape catalogue,Monthly Notices of the Royal Astronomical Society 524(2023) 5109 [2211.16516]
- [34]
- [35]
-
[36]
C. S ´anchez and G. M. Bernstein,Redshift inference from the combination of galaxy colours and clustering in a hierarchical Bayesian model,Monthly Notices of the Royal Astronomical Society483(2019) 2801 [1807.11873]
work page Pith review arXiv 2019
- [37]
- [38]
-
[39]
W. A. Baum,Photoelectric Magnitudes and Red-Shifts, inProblems of Extra-Galactic Research(G. C. McVittie, ed.), vol. 15 ofIAU Symposium, p. 390, Jan., 1962
1962
- [40]
- [41]
- [42]
-
[43]
Tagliaferri, G
R. Tagliaferri, G. Longo, S. Andreon, S. Capozziello, C. Donalek and G. Giordano,Neural Networks for Photometric Redshifts Evaluation, inLecture Notes in Computer Science, vol. 2859, pp. 226–234. 2003. DOI
2003
-
[44]
A. Collister, O. Lahav, C. Blake, R. Cannon, S. Croom, M. Drinkwater et al.,MegaZ-LRG: a photometric redshift catalogue of one million SDSS luminous red galaxies,Monthly Notices of the Royal Astronomical Society375(2007) 68 [astro-ph/0607630]
- [45]
- [46]
- [47]
- [48]
- [49]
-
[50]
D’Isanto,Uncertain Photometric Redshifts with Deep Learning Methods, in Astroinformatics(M
A. D’Isanto,Uncertain Photometric Redshifts with Deep Learning Methods, in Astroinformatics(M. Brescia, S. G. Djorgovski, E. D. Feigelson, G. Longo and S. Cavuoti, eds.), vol. 325 ofIAU Symposium, pp. 209–212, June, 2017,1703.01979, DOI
- [51]
- [52]
-
[53]
V . Amaro, S. Cavuoti, M. Brescia, C. Vellucci, C. Tortora and G. Longo,METAPHOR: Probability density estimation for machine learning based photometric redshifts, in Astroinformatics(M. Brescia, S. G. Djorgovski, E. D. Feigelson, G. Longo and S. Cavuoti, eds.), vol. 325 ofIAU Symposium, pp. 197–200, June, 2017,1703.02292, DOI
-
[54]
S. Cavuoti, V . Amaro, M. Brescia, C. Vellucci, C. Tortora and G. Longo,METAPHOR: a machine-learning-based method for the probability density estimation of photometric redshifts,Monthly Notices of the Royal Astronomical Society465(2017) 1959 [1611.02162]
- [55]
- [56]
- [57]
-
[58]
N. M. Ball, R. J. Brunner, A. D. Myers, N. E. Strand, S. L. Alberts and D. Tcheng,Robust Machine Learning Applied to Astronomical Data Sets. III. Probabilistic Photometric Redshifts for Galaxies and Quasars in the SDSS and GALEX,The Astrophysical Journal 683(2008) 12 [0804.3413]
- [59]
-
[60]
C. Bonnett, M. A. Troxel, W. Hartley, A. Amara, B. Leistedt, M. R. Becker et al.,Redshift distributions of galaxies in the Dark Energy Survey Science Verification shear catalogue and implications for weak lensing,Physical Review D94(2016) 042005 [1507.05909]
work page Pith review arXiv 2016
-
[61]
KiDS-450: Cosmological parameter constraints from tomographic weak gravitational lensing
H. Hildebrandt, M. Viola, C. Heymans, S. Joudaki, K. Kuijken, C. Blake et al.,KiDS-450: cosmological parameter constraints from tomographic weak gravitational lensing,Monthly Notices of the Royal Astronomical Society465(2017) 1454 [1606.05338]
work page Pith review arXiv 2017
- [62]
- [63]
-
[64]
DNF - Galaxy photometric redshift by Directional Neighbourhood Fitting
J. De Vicente, E. S ´anchez and I. Sevilla-Noarbe,DNF - Galaxy photometric redshift by Directional Neighbourhood Fitting,Monthly Notices of the Royal Astronomical Society459 (2016) 3078 [1511.07623]
work page Pith review arXiv 2016
-
[65]
A. Porredon, M. Crocce, P. Fosalba, J. Elvin-Poole, A. Carnero Rosell, R. Cawthon et al., Dark Energy Survey Year 3 results: Optimizing the lens sample in a combined galaxy clustering and galaxy-galaxy lensing analysis,Physical Review D103(2021) 043503 [2011.03411]. 1 Photometric Redshifts 25
-
[66]
N. Weaverdyck, M. Rodr ´ıguez-Monroy, J. Elvin-Poole, I. Sevilla-Noarbe, A. Porredon, S. Avila et al.,Dark Energy Survey Year 6 Results: MagLim++ Lens Sample Selection and Measurements of Galaxy Clustering,arXiv e-prints(2026) arXiv:2601.14484 [2601.14484]
-
[67]
S. Carliles, T. Budav ´ari, S. Heinis, C. Priebe and A. Szalay,Photometric Redshift Estimation on SDSS Data Using Random Forests, inAstronomical Data Analysis Software and Systems XVII(R. W. Argyle, P. S. Bunclark and J. R. Lewis, eds.), vol. 394 of Astronomical Society of the Pacific Conference Series, p. 521, Aug., 2008,0711.2477, DOI
- [68]
-
[69]
M. Carrasco Kind and R. J. Brunner,TPZ: photometric redshift PDFs and ancillary information by using prediction trees and random forests,Monthly Notices of the Royal Astronomical Society432(2013) 1483 [1303.7269]
- [70]
-
[71]
Y . Wadadekar,Estimating Photometric Redshifts Using Support Vector Machines,The Publications of the Astronomical Society of the Pacific117(2005) 79 [astro-ph/0412005]
-
[72]
E. Jones and J. Singal,Analysis of a custom support vector machine for photometric redshift estimation and the inclusion of galaxy shape information,Astronomy & Astrophysics600 (2017) A113 [1607.00044]
-
[73]
D. Masters, P. Capak, D. Stern, O. Ilbert, M. Salvato, S. Schmidt et al.,Mapping the Galaxy Color-Redshift Relation: Optimal Photometric Redshift Calibration Strategies for Cosmology Surveys,The Astrophysical Journal813(2015) 53 [1509.03318]
work page Pith review arXiv 2015
- [74]
- [75]
- [76]
-
[77]
I. Davidzon, K. Jegatheesan, O. Ilbert, S. de la Torre, S. K. Leslie, C. Laigle et al., COSMOS2020: Manifold learning to estimate physical parameters in large galaxy surveys, Astronomy & Astrophysics665(2022) A34 [2206.06373]
-
[78]
H. Teimoorinia, F. Archinuk, J. Woo, S. Shishehchi and A. F. L. Bluck,Mapping the Diversity of Galaxy Spectra with Deep Unsupervised Machine Learning,The Astronomical Journal163(2022) 71 [2112.03425]
- [79]
-
[80]
F. Abedini, G. Gozaliasl, A. H. Zonoozi, A. Kalantari, M. Korpi-Lagg, O. Ilbert et al., COSMOS-Web: Estimating Physical Parameters of Galaxies Using Self-Organizing Maps, Monthly Notices of the Royal Astronomical Society(2026) [2506.04138]
-
[81]
F. Ashmead, J. A. Newman, B. H. Andrews, R. Bezanson, B. Dey, D. C. Masters et al., Optimizing Photometric Redshift Training Sets I: Efficient Compression of the Galaxy Color-Redshift Relation with UMAP,arXiv e-prints(2025) arXiv:2512.09032 [2512.09032]. 26 Luca Tortorelli and Daniel Gr ¨un
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.