A Value-added Physical Properties Catalog for Low-redshift Galaxies from DESI Legacy Imaging Surveys DR10
Pith reviewed 2026-05-20 04:13 UTC · model grok-4.3
The pith
A multimodal deep learning model generates a catalog of star formation rates, stellar masses, and metallicities for roughly 547 million low-redshift galaxies in the DESI Legacy Imaging Surveys DR10.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that a multimodal neural network, using a ResNet convolutional component on images and a multilayer perceptron on catalog features, can be trained on the MPA-JHU DR8 catalog and then applied to DESI LS DR10 to deliver a value-added catalog of star formation rate, stellar mass, and oxygen abundance for approximately 547 million galaxies at z less than or equal to 0.5. Validation against independent catalogs and scaling relations confirms that the estimates capture the dominant astrophysical patterns required for ensemble analyses of the local galaxy population, even though the method is not designed for high-precision measurements of individual objects.
What carries the argument
Multimodal deep learning model that fuses a ResNet-based convolutional neural network extracting spatial features from multiband images with a multilayer perceptron processing photometric catalog data.
If this is right
- The catalog supplies a homogeneous resource for statistical studies of galaxy populations in the local universe.
- It enables direct comparisons between photometric surveys and ongoing or future spectroscopic programs.
- Dominant scaling relations such as the star-formation main sequence become accessible across hundreds of millions of objects.
- The approach demonstrates that large-scale property estimation can proceed without full spectroscopy for every galaxy.
Where Pith is reading between the lines
- Similar models could be retrained on deeper imaging to extend property estimates to higher redshifts.
- Merging this catalog with other large datasets could uncover population trends that smaller samples miss.
- The method suggests a practical route to reduce the spectroscopic follow-up needed for initial galaxy characterization in future surveys.
Load-bearing premise
The model trained on MPA-JHU DR8 spectroscopic data generalizes to DESI LS DR10 imaging and photometry without introducing large systematic biases or selection effects.
What would settle it
A comparison of the model's predicted star formation rates, stellar masses, and metallicities against new spectroscopic measurements for a statistically large sample of DESI LS DR10 galaxies would reveal whether the estimates match observed trends or show systematic offsets.
read the original abstract
Galaxy physical properties-such as star formation rate (SFR), stellar mass, and gas-phase metallicity-are essential for population studies and evolutionary analyses. Deriving these quantities for billions of galaxies in modern imaging surveys presents significant challenges due to limited spectroscopy and the computational costs associated with traditional spectral energy distribution fitting. As a result, many galaxies in large photometric surveys still lack homogeneous property estimates. This study introduces a multimodal deep learning model that integrates optical imaging with photometric catalog features to estimate SFR, stellar mass, and oxygen abundance in low-redshift galaxies. The model incorporates a ResNet-based convolutional neural network to extract spatial information from multiband images and a multilayer perceptron that processes catalog-level photometric features, leveraging complementary constraints from morphology, surface brightness, and broadband colors. Trained on reference measurements from the MPA-JHU DR8 catalog, the model is optimized for efficient large-scale estimation. When applied to the DESI Legacy Imaging Surveys (LS) DR10, the model generates a value-added catalog containing physical property estimates for approximately 547 million galaxies with redshifts z <= 0.5. Validation through comparisons with independent catalogs and exploration of key scaling relations demonstrates that while the derived properties are not intended for precision measurements of individual objects, they effectively capture the dominant astrophysical trends necessary for ensemble studies. This catalog represents the first homogeneous set of photometry-based SFR, stellar mass, and metallicity estimates for DESI LS DR10, providing a vital resource for statistical studies of galaxies in the local Universe and facilitating comparisons with current and future spectroscopic surveys.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a multimodal deep learning approach that combines a ResNet convolutional neural network applied to multiband galaxy images with a multilayer perceptron processing photometric catalog features to estimate star formation rate (SFR), stellar mass (M*), and gas-phase metallicity (12+log(O/H)) for low-redshift galaxies. The model is trained exclusively on reference values from the MPA-JHU DR8 catalog and applied to the DESI Legacy Imaging Surveys DR10 to produce a value-added catalog of these properties for approximately 547 million galaxies at z ≤ 0.5. Validation consists of comparisons against independent catalogs and checks against known scaling relations; the authors emphasize that the estimates are intended for ensemble statistical studies rather than precision measurements of individual objects.
Significance. If the generalization from the training catalog holds without substantial domain-shift biases, the resulting catalog would constitute a major resource for large-scale population studies of local galaxies, enabling homogeneous analyses across hundreds of millions of objects where spectroscopic coverage is sparse. The multimodal architecture efficiently exploits both morphological information from imaging and broadband colors, representing a scalable alternative to traditional SED fitting for future wide-field surveys.
major comments (2)
- [§4] §4 (Validation): The comparisons with independent catalogs are presented without quantified cross-survey residuals (e.g., mean bias, scatter, or Kolmogorov-Smirnov statistics) specifically for the subset of galaxies in the SDSS-DESI LS overlap region. This omission leaves the central claim that dominant astrophysical trends are preserved vulnerable to untested domain-shift effects arising from the grz versus ugriz filter sets and differing photometric pipelines.
- [§3.2] §3.2 (Model architecture and training): The description of input preparation for DESI LS DR10 images and photometry does not specify any explicit domain-adaptation steps or color transformations to compensate for the mismatch with the MPA-JHU DR8 training distribution (different filters, depth, and seeing). Without such calibration or ablation tests, the assumption that the learned features transfer without systematic offsets remains unverified and load-bearing for the catalog's utility.
minor comments (2)
- [Figure 2] Figure 2 and associated text: The scaling-relation panels would be clearer if the authors overlaid the 1:1 line or reference relations from the literature with explicit labels for the independent comparison samples.
- The manuscript would benefit from a short table summarizing the training/validation split sizes, hyperparameter choices, and achieved loss values on the held-out MPA-JHU test set.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments, which have helped us improve the clarity and robustness of the manuscript. We address each major comment below and have revised the text to incorporate additional validation metrics and preprocessing details.
read point-by-point responses
-
Referee: [§4] §4 (Validation): The comparisons with independent catalogs are presented without quantified cross-survey residuals (e.g., mean bias, scatter, or Kolmogorov-Smirnov statistics) specifically for the subset of galaxies in the SDSS-DESI LS overlap region. This omission leaves the central claim that dominant astrophysical trends are preserved vulnerable to untested domain-shift effects arising from the grz versus ugriz filter sets and differing photometric pipelines.
Authors: We agree that quantified cross-survey residuals for the overlap region would provide a more direct test of domain-shift effects. In the revised manuscript we have added these statistics to §4, including mean bias, scatter, and Kolmogorov-Smirnov tests restricted to galaxies with both MPA-JHU and DESI LS photometry. The new results show that the dominant scaling relations remain intact with residuals consistent with the overall validation sample, thereby strengthening the claim that the model generalizes across the filter and pipeline differences. revision: yes
-
Referee: [§3.2] §3.2 (Model architecture and training): The description of input preparation for DESI LS DR10 images and photometry does not specify any explicit domain-adaptation steps or color transformations to compensate for the mismatch with the MPA-JHU DR8 training distribution (different filters, depth, and seeing). Without such calibration or ablation tests, the assumption that the learned features transfer without systematic offsets remains unverified and load-bearing for the catalog's utility.
Authors: The original submission relied on data augmentations during training to improve robustness to depth and seeing variations, together with the multimodal combination of imaging and photometry to reduce sensitivity to filter differences. We acknowledge that an explicit discussion of these choices and any ablation tests was missing. We have therefore expanded §3.2 to describe the input normalization and preprocessing pipeline in detail and have added a brief ablation study in the appendix that quantifies the effect of omitting color transformations. These additions make the transfer assumptions more transparent while preserving the empirical validation already presented. revision: partial
Circularity Check
No circularity: supervised model trained on external catalog and applied to new survey data
full rationale
The paper trains a multimodal ResNet+MLP model exclusively on reference measurements from the external MPA-JHU DR8 catalog, then performs inference on DESI LS DR10 imaging and photometry for z ≤ 0.5 galaxies. This is a standard supervised learning pipeline whose outputs are learned predictions, not quantities defined in terms of themselves or statistically forced by fitting to the target data. No equations, self-citations, or uniqueness theorems are invoked in the provided text that would reduce the catalog generation to a tautology. Validation against independent catalogs is described as external checks rather than internal consistency tests. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (1)
- Neural network weights and hyperparameters
axioms (1)
- domain assumption MPA-JHU DR8 catalog provides accurate and representative ground-truth measurements for SFR, stellar mass, and oxygen abundance
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean, IndisputableMonolith/Cost/FunctionalEquation.leanreality_from_one_distinction, washburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
multimodal deep learning model that integrates optical imaging with photometric catalog features... ResNet-based convolutional neural network... multilayer perceptron... Trained on reference measurements from the MPA-JHU DR8 catalog
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanabsolute_floor_iff_bare_distinguishability unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
generates a value-added catalog containing physical property estimates for approximately 547 million galaxies
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Andika, I. T., Suyu, S. H., Cañameras, R., et al. 2023, A&A, 678, A103 Astropy Collaboration, Price-Whelan, A. M., Lim, P. L., et al. 2022, ApJ, 935, 167 Astropy Collaboration, Price-Whelan, A. M., Sipőcz, B. M., et al. 2018, AJ, 156, 123 Astropy Collaboration, Robitaille, T. P., Tollerud, E. J., et al. 2013, A&A, 558, A33
work page 2023
-
[2]
K., Glazebrook, K., Brinkmann, J., et al
Baldry, I. K., Glazebrook, K., Brinkmann, J., et al. 2004, ApJ, 600, 681
work page 2004
- [3]
- [4]
- [5]
-
[6]
Bonjean, V., Aghanim, N., Salomé, P., et al. 2019, A&A, 622, A137
work page 2019
-
[7]
Boquien, M., Burgarella, D., Roehlly, Y., et al. 2019, A&A, 622, A103
work page 2019
-
[8]
Brammer, G. B., van Dokkum, P. G., & Coppi, P. 2008, ApJ, 686, 1503
work page 2008
-
[9]
Brinchmann, J., Charlot, S., White, S. D. M., et al. 2004, MNRAS, 351, 1151
work page 2004
- [10]
- [11]
- [12]
-
[13]
Carliles, S., Budavári, T., Heinis, S., Priebe, C., & Szalay, A. 2008, ASPC, 394, 521
work page 2008
-
[14]
Carnall, A. C., Leja, J., Johnson, B. D., et al. 2019, ApJ, 873, 44
work page 2019
-
[15]
Chang, Y.-Y., van der Wel, A., da Cunha, E., & Rix, H.-W. 2015, ApJS, 219, 8
work page 2015
- [16]
-
[17]
2024, MNRAS, 528, 6354 Cid Fernandes, R., Mateus, A., Sodré, L., Stasińska, G., & Gomes, J
Chu, J., Tang, H., Xu, D., Lu, S., & Long, R. 2024, MNRAS, 528, 6354 Cid Fernandes, R., Mateus, A., Sodré, L., Stasińska, G., & Gomes, J. M. 2005, MNRAS, 358, 363
work page 2024
- [18]
-
[19]
The 2dF Galaxy Redshift Survey: Final Data Release
Colless, M., Peterson, B. A., Jackson, C., et al. 2003, arXiv:astro-ph/0306581
work page internal anchor Pith review Pith/arXiv arXiv 2003
-
[20]
Collister, A., Lahav, O., Blake, C., et al. 2007, MNRAS, 375, 68
work page 2007
-
[21]
Comparat, J., Maraston, C., Goddard, D., et al. 2017, arXiv:1711.06575
work page internal anchor Pith review Pith/arXiv arXiv 2017
- [22]
- [23]
-
[24]
Conselice, C. J. 2003, ApJS, 147, 1
work page 2003
-
[25]
Cool, R. J., Moustakas, J., Blanton, M. R., et al. 2013, ApJ, 767, 118
work page 2013
-
[26]
The DESI Experiment Part I: Science,Targeting, and Survey Design
Cui, X.-Q., Zhao, Y.-H., Chu, Y.-Q., et al. 2012, RAA, 12, 1197 Dark Energy Survey Collaboration, Abbott, T., Abdalla, F. B., et al. 2016, MNRAS, 460, 1270 Delli Veneri, M., Cavuoti, S., Brescia, M., Longo, G., & Riccio, G. 2019, MNRAS, 486, 1377 DESI Collaboration, Aghamousa, A., Aguilar, J., et al. 2016a, arXiv:1611.00036 DESI Collaboration, Aghamousa, ...
work page internal anchor Pith review Pith/arXiv arXiv 2012
- [27]
-
[28]
Dey, B., Andrews, B. H., Newman, J. A., et al. 2022, MNRAS, 515, 5285 D’Isanto, A., & Polsterer, K. L. 2018, A&A, 609, A111
work page 2022
- [29]
- [30]
-
[31]
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al. 2020, arXiv:2010.11929
work page internal anchor Pith review Pith/arXiv arXiv 2020
-
[32]
Drinkwater, M. J., Jurek, R. J., Blake, C., et al. 2010, MNRAS, 401, 1429
work page 2010
-
[33]
Drlica-Wagner, A., Carlin, J. L., Nidever, D. L., et al. 2021, ApJS, 256, 2
work page 2021
- [34]
-
[35]
Ellison, S. L., Mendel, J. T., Patton, D. R., & Scudder, J. M. 2013, MNRAS, 435, 3627
work page 2013
-
[36]
Ellison, S. L., Patton, D. R., Simard, L., & McConnachie, A. W. 2008, ApJL, 672, L107
work page 2008
-
[37]
Elmegreen, D. M., Elmegreen, B. G., Rubin, D. S., & Schaffer, M. A. 2005, ApJ, 631, 85 Euclid Collaboration, Aussel, B., Kruk, S., et al. 2024a, A&A, 689, A274 Euclid Collaboration, Bisigello, L., Conselice, C. J., et al. 2023, MNRAS, 520, 3529 Euclid Collaboration, Enia, A., Bolzonella, M., et al. 2024b, A&A, 691, A175 Euclid Collaboration, Humphrey, A.,...
-
[38]
Flaugher, B., Diehl, H. T., Honscheid, K., et al. 2015, AJ, 150, 150 Förster Schreiber, N. M., & Wuyts, S. 2020, ARA&A, 58, 661
work page 2015
-
[39]
Gai, M., Bove, M., Bonetta, G., Zago, D., & Cancelliere, R. 2024, MNRAS, 532, 1391
work page 2024
-
[40]
Gallazzi, A., Charlot, S., Brinchmann, J., White, S. D. M., & Tremonti, C. A. 2005, MNRAS, 362, 41
work page 2005
-
[41]
Garnett, D. R. 2002, ApJ, 581, 1019
work page 2002
- [42]
- [43]
- [44]
-
[45]
Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Natur, 585, 357
work page 2020
-
[46]
Deep residual learning for im- age recognition
He, K., Zhang, X., Ren, S., & Sun, J. 2016, in IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 1, doi:10.1109/CVPR.2016.90
-
[47]
Henghes, B., Thiyagalingam, J., Pettitt, C., Hey, T., & Lahav, O. 2022, MNRAS, 512, 1696
work page 2022
-
[48]
Hunter, J. D. 2007, CSE, 9, 90 Table 9 Comparison of Model Performance Using Spectroscopic (z spec ) and Photometric (z phot ) Redshifts as Input Features, Evaluated on the Same Subsample Property z spec z phot NRMSE σ Bias Outlier NMAD NRMSE σ Bias Outlier NMAD SFR 0.053 0.416 −0.032 0.012 0.338 0.070 0.357 −0.028 0.010 0.317 M � 0.019 0.106 −0.003 0.017...
work page 2007
- [49]
-
[50]
D., Leja, J., Conroy, C., & Speagle, J
Johnson, B. D., Leja, J., Conroy, C., & Speagle, J. S. 2021, ApJS, 254, 22
work page 2021
- [51]
-
[52]
Kalvankar, S., Pandit, H., Parwate, P., Patil, A., & Kamalapur, S. 2021, arXiv:2112.13865
-
[53]
Kashino, D., Silverman, J. D., Sanders, D., et al. 2019, ApJS, 241, 10
work page 2019
-
[54]
Kauffmann, G., Heckman, T. M., White, S. D. M., et al. 2003, MNRAS, 341, 33
work page 2003
-
[55]
Kewley, L. J., Dopita, M. A., Sutherland, R. S., Heisler, C. A., & Trevena, J. 2001, ApJ, 556, 121
work page 2001
- [56]
- [57]
-
[58]
Lang, D., Hogg, D. W., Mykytyn, D., et al. 2016, The Tractor: Probabilistic astronomical source detection and measurement, Astrophysics Source Code Library, ascl:1604.008
work page 2016
-
[59]
J., Duvet, L., Escudero Sanz, I., et al
Laureijs, R. J., Duvet, L., Escudero Sanz, I., et al. 2010, SPIE, 7731, 77311H
work page 2010
- [60]
-
[61]
F., Serrano, A., & Torres-Peimbert, S
Lequeux, J., Peimbert, M., Rayo, J. F., Serrano, A., & Torres-Peimbert, S. 1979, A&A, 80, 155
work page 1979
-
[62]
The DESI Experiment, a whitepaper for Snowmass 2013
Levi, M., Bebek, C., Beers, T., et al. 2013, arXiv:1308.0847
work page internal anchor Pith review Pith/arXiv arXiv 2013
-
[63]
Li, C. 2024, A photometric redshift catalogue of galaxies from the DESI legacy imaging surveys DR10, v1.0, National Astronomical Data Center of China, doi:10.12149/101486
- [64]
- [65]
- [66]
-
[67]
Li, P., Wang, L.-L., Yang, G.-J., Feng, J.-B., & Tang, Y.-K. 2025, A&A, 698, A222
work page 2025
- [68]
-
[69]
Lilly, S. J., Carollo, C. M., Pipino, A., Renzini, A., & Peng, Y. 2013, ApJ, 772, 119
work page 2013
-
[70]
J., Le Brun, V., Maier, C., et al
Lilly, S. J., Le Brun, V., Maier, C., et al. 2009, ApJS, 184, 218
work page 2009
- [71]
-
[72]
L., Zhao, Y.-H., Zhao, G., et al
Luo, A. L., Zhao, Y.-H., Zhao, G., et al. 2015, RAA, 15, 1095
work page 2015
- [73]
- [74]
-
[75]
Mannucci, F., Cresci, G., Maiolino, R., Marconi, A., & Gnerucci, A. 2010, MNRAS, 408, 2115
work page 2010
-
[76]
C., Fanson, J., Schiminovich, D., et al
Martin, D. C., Fanson, J., Schiminovich, D., et al. 2005, ApJL, 619, L1 Martínez-Solaeche, G., García-Benito, R., González Delgado, R. M., et al. 2024, A&A, 688, A160
work page 2005
- [77]
-
[78]
Michel-Dansac, L., Lambas, D. G., Alonso, M. S., & Tissera, P. 2008, MNRAS, 386, L82
work page 2008
-
[79]
Moustakas, J., Scholte, D., Dey, B., & Khederlarian, A. 2023, FastSpecFit: Fast spectral synthesis and emission-line fitting of DESI spectra, Astrophysics Source Code Library, ascl:2308.005
work page 2023
-
[80]
Mucesh, S., Hartley, W. G., Palmese, A., et al. 2021, MNRAS, 502, 2770
work page 2021
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.