SE3D: Testing the recovery of stellar population, dust and structural properties on mock-observed toy model and simulated galaxies

Cheng Li; Junkai Zhang; Steven Ramnichal; Stijn Wuyts

arxiv: 2511.19614 · v2 · pith:ZSTNYECQnew · submitted 2025-11-24 · 🌌 astro-ph.GA · astro-ph.IM

SE3D: Testing the recovery of stellar population, dust and structural properties on mock-observed toy model and simulated galaxies

Junkai Zhang , Steven Ramnichal , Stijn Wuyts , Cheng Li This is my paper

Pith reviewed 2026-05-21 19:16 UTC · model grok-4.3

classification 🌌 astro-ph.GA astro-ph.IM

keywords galaxy modelingradiative transferstellar populationsdust attenuationmock observationsTNG simulationstar formation historiesresolved properties

0 comments

The pith

SE3D recovers stellar mass, dust mass, SFR and radial extents to within 0.1 dex from mock multi-wavelength galaxy data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces SE3D, a modelling framework built around a radiative transfer emulator designed to recover intrinsic galaxy properties while handling star-dust geometry and varying stellar populations. It performs recovery tests by jointly fitting mock photometric and structural observations of toy models and TNG50 simulated galaxies. The tests show encouraging accuracy at the 0.1 dex level for integrated stellar mass, dust mass, star formation rate and the radial extents of these quantities. Star formation history shape mismatches emerge as the dominant source of residuals, while radial and azimuthal structure and metallicity differences play smaller roles. The work also tracks how resolved stellar and dust properties evolve in the simulations from redshift 2 to 0 and compares diagnostic diagram placements to observations.

Core claim

SE3D is a radiative transfer emulator framework that jointly fits multi-wavelength photometric and structural mock observations to recover known intrinsic stellar population, dust and structural properties of galaxies. On both toy models and TNG50 galaxies it achieves recovery of bulk stellar mass, dust mass, SFR and their respective radial extents at the level of ≲ 0.1 dex. Mismatches in star formation history shapes contribute most to the residuals, with radial and azimuthal structure and stellar metallicity distributions playing progressively smaller roles. Different methods of assigning dust in the simulations produce limited Mdust/Mstar evolution and a narrower dynamic range across UVJ,

What carries the argument

The radiative transfer emulator inside SE3D, which models the spatially varying effects of dust on stellar light to enable joint fitting of photometry and structural data.

If this is right

Resolved radial profiles of stellar mass, dust and SFR can be extracted reliably enough for statistical studies of galaxy growth.
Star formation history shape is the dominant modelling uncertainty that must be addressed in future applications.
The limited dust-to-stellar mass evolution seen in TNG50 persists across different dust assignment methods and can be tested directly against observations.
Diagnostic diagrams such as UVJ and IRX-beta show narrower ranges in the simulations than in real data, pointing to specific mismatches that the framework can quantify.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the same accuracy holds on real multi-band imaging, SE3D could map how dust and stars are distributed inside galaxies at different cosmic epochs.
The framework offers a route to reduce the mismatch between simulated and observed placements on UVJ and IRX-beta diagrams by directly fitting the geometry.
Extending the emulator to include more flexible star-formation history libraries would likely shrink the dominant error source identified in the tests.

Load-bearing premise

That mismatches between the fitting model's star-formation histories and the true histories can be isolated without the emulator itself introducing systematic bias in complex star-dust geometries.

What would settle it

Applying SE3D to real galaxies and finding systematic offsets larger than 0.1 dex when compared against independent spectroscopic or dynamical mass measurements would challenge the claimed recovery performance.

Figures

Figures reproduced from arXiv: 2511.19614 by Cheng Li, Junkai Zhang, Steven Ramnichal, Stijn Wuyts.

**Figure 1.** Figure 1: Structural and SFH properties of an example TNG galaxy contrasted with the closest toy model approximation. The PDF of the radial and vertical stellar mass and dust distribution as well as the stellar age distribution are shown in blue for TNG, and in red for the associated toy model approximation. The distribution of stellar mass across the age-radius plane (bottom-middle panel) is contrasted to that for … view at source ↗

**Figure 2.** Figure 2: Top: Age gradient ∇agew versus the relative radial extent of dust and stellar distributions, 𝑅dust/𝑅star, of TNG50 galaxies at redshifts 𝑧 = 0, 1 and 2. Bottom: Geometric thickness of the dust versus stellar distribution. In each panel and for each redshift, the filled circle and cross mark the median and central 68th percentile of the distribution, respectively. what constitutes outer and inner stars (i.e… view at source ↗

**Figure 3.** Figure 3: 𝑈𝑉 𝐽 diagram for massive SFGs in TNG50 (left), for the full set of toy model galaxies in our SKIRT library (middle), and for a mass-complete sample of observed galaxies at 0.5 < 𝑧 < 2.5 (right). Contours depict the regions enclosing 20%, 60%, 90% and 99.9% of the data. The colours predicted for TNG galaxies get bluer with increasing redshift and do not extend into the red 𝑉 − 𝐽 regime. At first glance, it … view at source ↗

**Figure 4.** Figure 4: 𝑈𝑉 𝐽 diagram for our toy model library, colour-coded by the effective reddening, 𝐸 (𝐵 − 𝑉), and the effective visual attenuation, 𝐴𝑉 . Vectors of increasing reddening versus attenuation have different slopes in the 𝑈𝑉 𝐽 plane. mass log(𝑀star) > 9.5 without further consideration of their star formation activity (hence the prominent clump of objects in the quiescent wedge). They were extracted from the DAWN… view at source ↗

**Figure 5.** Figure 5: IRX - 𝛽 diagram for massive SFGs in TNG50 (left), for the full set of toy model galaxies in our SKIRT library (middle), and for a mass-limited sample of observed galaxies at 0.5 < 𝑧 < 2.5 (right). Contours depict the regions enclosing 20%, 60%, 90% and 99.9% of the data. On the right panel, we overlay the distribution of available 𝛽 from observations, whilst the IRX - 𝛽 contours are only shown for galaxies… view at source ↗

**Figure 6.** Figure 6: Recovered parameters from SE3D fitting contrasted with true intrinsic toy model parameters for the filter band combination ‘IRres’ and 0.05 dex uncertainty. We fit a sample of 600 toy model galaxies equally divided between three redshifts: z=0 (blue), z=1 (green), and z=2 (red). tually improves with increasing redshift, presumably because shorter rest-wavelengths are probed, sensitive to unobscured young s… view at source ↗

**Figure 7.** Figure 7: SE3D fitting on TNG galaxies with filter band combination ‘IRres’ and uncertainty 0.05 dex. TNG galaxies at redshift 𝑧 = 0, 1, 2 are presented in blue, green and red colours, respectively. dust components as presented in [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗

**Figure 8.** Figure 8: Recovery residuals R versus different aspects of “model mismatch”: ΔSFH, ΔMetal, ΔRad, and Δ𝜙. TNG galaxies at redshift 𝑧 = 0, 1, 2 are presented in blue, green and red colours, respectively. match due to the azimuthally smooth nature of the diffuse stellar and dust components of our toy models. We investigate the impact of model mismatch in two ways: first using the fitting results presented in Section 4.… view at source ↗

**Figure 9.** Figure 9: Tests on how recovery residuals R reduce when artificially simplifying TNG galaxies. Position along the x-axis denotes to what extent the TNG galaxies have been simplified to match the toy model description, from simplifying one of the characteristics shown in the colour legend (labelled as “TNG1”), to simplifying all four characteristics (“TNG4”), and ultimately a complete toy model galaxy description (“… view at source ↗

read the original abstract

The translation from direct observables to physical properties of galaxies is a key step in reconstructing their evolutionary histories. Star-dust geometry and inhomogeneous stellar populations can induce spatial variations in the mass-to-light ratio, complicating this process. In this paper, we present tests of SE3D, a novel modelling framework built around a radiative transfer emulator, aimed at tackling this problem. We test the ability to recover known intrinsic properties of toy model and TNG50 simulated galaxies by jointly fitting mock observations of their multi-wavelength photometric and structural properties. We find an encouraging performance ($\lesssim$ 0.1 dex) for several key characteristics, including the bulk stellar mass, dust mass and SFR, as well as their respective radial extents. We point out limitations, and investigate the impact of various sources of model mismatch. Among them, mismatch in the shapes of star formation histories contributes most, with radial and azimuthal structure and stellar metallicity distributions playing a progressively more minor role. We also analyse the evolution from z=2 to z=0 of resolved stellar and dust properties of TNG galaxies, as measured intrinsically and expressed in their distribution across UVJ and IRX-$\beta$ diagnostic diagrams. We test different methods to assign dust to the simulation, and find a persistent lack of Mdust/Mstar evolution and a more limited dynamic range across the diagnostic diagrams compared to observations.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The manuscript introduces SE3D, a radiative transfer emulator-based framework for jointly fitting multi-wavelength photometric and structural mock observations to recover stellar population, dust, and structural properties of galaxies. Tests on toy models and TNG50 simulations report ≲0.1 dex recovery for bulk stellar mass, dust mass, SFR, and radial extents, with SFH shape mismatch identified as the dominant error source among various model mismatches; the work also analyzes redshift evolution of resolved properties in TNG galaxies and compares diagnostic diagrams to observations after testing dust assignment methods.

Significance. If the central recovery claims hold after addressing validation gaps, the results would be significant for quantifying biases in galaxy property inference arising from star-dust geometry and SFH complexity, aiding interpretation of resolved observations. The manuscript is strengthened by its direct recovery tests on known intrinsic values supplied by the toy models and TNG50 simulations, which supply external benchmarks.

major comments (1)

[§5] §5 (model-mismatch sources analysis): the attribution of the dominant error to SFH shape mismatch, which underpins the interpretation of the ≲0.1 dex bulk recovery performance, assumes the SE3D emulator produces unbiased fluxes and structural metrics even for inhomogeneous star-dust geometries; no direct comparison of emulator outputs against full Monte-Carlo radiative transfer on identical complex configurations is reported, leaving open the possibility that emulator systematics are absorbed into or misattributed from the quoted accuracy.

minor comments (2)

[Figure captions] Figure captions for the UVJ and IRX-β diagrams could more explicitly define the plotted quantities and error bars to improve readability for readers unfamiliar with the exact mock-observation pipeline.
[Abstract] The abstract states that radial and azimuthal structure play a minor role, but the corresponding quantitative breakdown (e.g., dex contributions) would benefit from a dedicated table or panel for clarity.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their constructive feedback and recommendation for major revision. The central recovery claims are supported by direct tests against known intrinsic values from toy models and TNG50, but we acknowledge the need to strengthen the discussion of emulator validation in the model-mismatch analysis. We address the major comment point-by-point below and will revise the manuscript accordingly.

read point-by-point responses

Referee: §5 (model-mismatch sources analysis): the attribution of the dominant error to SFH shape mismatch, which underpins the interpretation of the ≲0.1 dex bulk recovery performance, assumes the SE3D emulator produces unbiased fluxes and structural metrics even for inhomogeneous star-dust geometries; no direct comparison of emulator outputs against full Monte-Carlo radiative transfer on identical complex configurations is reported, leaving open the possibility that emulator systematics are absorbed into or misattributed from the quoted accuracy.

Authors: We thank the referee for highlighting this important caveat. The SE3D emulator was trained and validated against full Monte Carlo radiative transfer calculations across a diverse set of configurations that include inhomogeneous star-dust geometries (see Methods section for details on training set construction and accuracy metrics). However, we did not report an explicit side-by-side comparison of emulator versus full RT outputs for the precise complex geometries drawn from the TNG50 galaxies analyzed in §5. This means that while SFH mismatch is the largest identified contributor in our tests, some fraction of the residual error could indeed arise from emulator approximations in those specific cases. We will revise §5 to explicitly discuss the scope of emulator validation, add a caveat on potential misattribution, and include a brief quantitative summary of emulator performance on inhomogeneous test cases. If space permits, we will also explore adding a supplementary figure comparing emulator and full RT for a representative TNG-like setup. revision: partial

Circularity Check

0 steps flagged

Recovery performance validated against external simulation benchmarks with no circularity

full rationale

The paper evaluates SE3D recovery accuracy by direct comparison of fitted stellar mass, dust mass, SFR and radial extents to the known intrinsic values supplied by independently constructed toy models and TNG50 hydrodynamical simulations. These benchmarks are external to the fitting procedure and emulator; the quoted ≲0.1 dex performance is therefore a measured difference rather than a quantity defined in terms of the model's own parameters or outputs. Error-source isolation (SFH mismatch dominant) is performed via controlled input variations in the test setups, not by re-deriving the same quantities from the fit. No self-definitional equations, fitted-input predictions, or load-bearing self-citations appear in the reported chain.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the radiative transfer emulator faithfully reproducing the effects of star-dust geometry and on the mock observations being sufficiently realistic representations of real galaxies; no new physical entities are postulated.

axioms (1)

domain assumption The radiative transfer emulator accurately approximates full radiative transfer for the tested galaxy models and geometries.
Invoked as the core of the SE3D modeling framework that enables the joint fitting of mock observations.

pith-pipeline@v0.9.0 · 5792 in / 1370 out tokens · 98603 ms · 2026-05-21T19:16:31.655332+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

SE3D is a novel modelling framework, built around a radiative transfer emulator... jointly fitting mock observations of their multi-wavelength photometric and structural properties... recovery accuracy... ≲0.1 dex for bulk stellar mass, dust mass and SFR, as well as their respective radial extents.
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We point out limitations, and investigate the impact of various sources of model mismatch. Among them, mismatch in the shapes of star formation histories contributes most...

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

1 extracted references · 1 canonical work pages

[1]

2014, MNRAS, 444, 1453 Euclid Collaboration, Aussel, H., Tereno, I., et al

AkinsH.B.,NarayananD.,WhitakerK.E.,DavéR.,LowerS.,BezansonR., Feldmann R., Kriek M., 2022, ApJ, 929, 94 Blain A. W., Smail I., Ivison R. J., Kneib J. P., 1999, MNRAS, 302, 632 Bruzual G., Charlot S., 2003, MNRAS, 344, 1000 Calzetti D., Kinney A. L., Storchi-Bergmann T., 1994, ApJ, 429, 582 Camps P., Baes M., 2020, Astronomy and Computing, 31, 100381 Camps...

work page arXiv 2022

[1] [1]

2014, MNRAS, 444, 1453 Euclid Collaboration, Aussel, H., Tereno, I., et al

AkinsH.B.,NarayananD.,WhitakerK.E.,DavéR.,LowerS.,BezansonR., Feldmann R., Kriek M., 2022, ApJ, 929, 94 Blain A. W., Smail I., Ivison R. J., Kneib J. P., 1999, MNRAS, 302, 632 Bruzual G., Charlot S., 2003, MNRAS, 344, 1000 Calzetti D., Kinney A. L., Storchi-Bergmann T., 1994, ApJ, 429, 582 Camps P., Baes M., 2020, Astronomy and Computing, 31, 100381 Camps...

work page arXiv 2022