Recognition: 2 theorem links
· Lean TheoremCosmology with one galaxy: An analytic formula relating Ω_{rm m} with galaxy properties
Pith reviewed 2026-05-16 06:04 UTC · model grok-4.3
The pith
A compact analytic formula recovers the cosmic matter density Ω_m from the properties of a single galaxy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Symbolic regression on the CAMELS hydrodynamical simulations yields a compact functional form that recovers Ω_m from intrinsic galaxy-scale observables. The expression admits a transparent interpretation in terms of baryonic retention and enrichment efficiency regulated by gravitational potential depth, and it generalizes across multiple simulation codes (IllustrisTNG, ASTRID, SIMBA, Swift-EAGLE) with only small recalibrations of coefficients.
What carries the argument
A symbolic-regression-derived analytic expression linking Ω_m to galaxy baryonic content and gravitational potential depth.
If this is right
- Cosmological parameters can be inferred from single galaxies without population statistics or clustering measurements.
- The relation holds across different hydrodynamical codes after limited recalibration of coefficients.
- It supplies a direct physical account of why Ω_m appears in local galaxy properties through baryon retention regulated by potential depth.
- The approach bypasses traditional large-scale structure probes and creates a new synergy between galaxy formation modeling and precision cosmology.
Where Pith is reading between the lines
- Observational catalogs from surveys such as SDSS or DESI could be used to test whether the simulated relation produces consistent Ω_m values.
- At high redshift the method might provide cosmological constraints where clustering measurements are observationally difficult.
- Discrepancies between the formula and real data could directly constrain the strength of feedback processes in galaxy formation models.
Load-bearing premise
The functional form found in the simulations correctly describes galaxies in the real universe rather than depending on the specific subgrid physics of the training runs.
What would settle it
Apply the formula to a real galaxy with well-measured stellar mass, gas fraction, and metallicity; the predicted Ω_m deviates substantially from the value measured independently by CMB or supernova surveys.
read the original abstract
Standard cosmological analyses typically treat galaxy formation and cosmological parameter inference as decoupled problems, relying on population-level statistics such as clustering, lensing, or halo abundances. However, classical studies of baryon fractions in massive galaxy clusters have long suggested that gravitationally bound systems may retain cosmological information through their baryonic content. Building on this insight, we present the first analytic and physically interpretable cosmological tracer that links the matter density parameter, $\Omega_m$, directly to intrinsic galaxy-scale observables, demonstrating that cosmological information can be extracted from individual galaxies. Using symbolic regression applied to state-of-the-art hydrodynamical simulations from the CAMELS project, we identify a compact functional form that robustly recovers $\Omega_m$ across multiple simulation suites (IllustrisTNG, ASTRID, SIMBA, and Swift-EAGLE), requiring only modest recalibration of a small number of coefficients. The resulting expression admits a transparent physical interpretation in terms of baryonic retention and enrichment efficiency regulated by gravitational potential depth, providing a clear explanation for why $\Omega_m$ is locally encoded in galaxy properties. Our work establishes a direct, interpretable bridge between small-scale galaxy physics and large-scale cosmology, opening a complementary pathway to cosmological inference that bypasses traditional clustering-based statistics and enables new synergies between galaxy formation theory and precision cosmology.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to derive, via symbolic regression on CAMELS hydrodynamical simulations, a compact analytic formula that directly relates the matter density parameter Ω_m to intrinsic galaxy-scale observables (e.g., stellar mass, metallicity, and star-formation rate). The authors report that this expression recovers Ω_m robustly across four simulation suites (IllustrisTNG, SIMBA, ASTRID, Swift-EAGLE) after only modest coefficient recalibration and admits a physical interpretation in terms of baryonic retention efficiency regulated by gravitational potential depth.
Significance. If the claimed relation proves robust to variations in subgrid physics and generalizes to observational data, the result would be significant: it would establish a direct, analytic bridge between single-galaxy properties and cosmology, bypassing traditional clustering or abundance statistics and enabling new synergies between galaxy-formation modeling and parameter inference. The analytic, interpretable form is a clear strength relative to black-box machine-learning approaches.
major comments (3)
- [§3.2] §3.2 (symbolic regression procedure): the functional form is obtained by fitting to simulation outputs in which Ω_m is an explicit input parameter; the reported 'recovery' therefore amounts to learning a mapping from the simulation's response to Ω_m rather than an independent derivation, and the manuscript must demonstrate that the expression encodes cosmological information beyond the specific subgrid implementations used in CAMELS.
- [§4.3] §4.3 (cross-suite validation): while modest recalibration of coefficients yields acceptable recovery across suites, the manuscript provides no quantitative assessment of residual scatter, systematic offsets, or sensitivity to the precise range of Ω_m varied in the training set; without these metrics it is unclear whether the formula remains predictive outside the narrow CAMELS prior.
- [§5.1] §5.1 (physical interpretation): the link to gravitational potential depth and baryonic retention is presented post-hoc; an a-priori test (e.g., controlled runs varying only the potential depth while holding subgrid parameters fixed) is needed to show that the relation is not an artifact of how the CAMELS feedback models respond to background density.
minor comments (2)
- [Abstract] The abstract asserts this is the 'first' such analytic tracer; a brief discussion of earlier analytic work on baryon fractions in clusters (e.g., references to Kravtsov et al. or related studies) would place the novelty claim in context.
- [Figure 3] Figure 3 (recovery plots) lacks error bars on the inferred Ω_m values and does not show the distribution of residuals as a function of galaxy mass or environment.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed report. We address each major comment below and have revised the manuscript to incorporate additional quantitative analysis and clarification where feasible.
read point-by-point responses
-
Referee: [§3.2] §3.2 (symbolic regression procedure): the functional form is obtained by fitting to simulation outputs in which Ω_m is an explicit input parameter; the reported 'recovery' therefore amounts to learning a mapping from the simulation's response to Ω_m rather than an independent derivation, and the manuscript must demonstrate that the expression encodes cosmological information beyond the specific subgrid implementations used in CAMELS.
Authors: We agree that the symbolic regression identifies a mapping learned from the simulation outputs in which Ω_m is varied as an input. The manuscript's central claim, however, rests on the fact that the identical functional form recovers Ω_m across four independent simulation suites that employ substantially different subgrid physics implementations. We have revised §3.2 to include an expanded discussion of this cross-suite robustness, emphasizing that the persistence of the same analytic expression (with only coefficient recalibration) across IllustrisTNG, SIMBA, ASTRID, and Swift-EAGLE indicates that the relation encodes cosmological dependence rather than being an artifact of any single subgrid model. revision: yes
-
Referee: [§4.3] §4.3 (cross-suite validation): while modest recalibration of coefficients yields acceptable recovery across suites, the manuscript provides no quantitative assessment of residual scatter, systematic offsets, or sensitivity to the precise range of Ω_m varied in the training set; without these metrics it is unclear whether the formula remains predictive outside the narrow CAMELS prior.
Authors: We acknowledge the absence of these quantitative metrics in the original submission. In the revised manuscript we have added a new table and accompanying text in §4.3 that report the root-mean-square error, mean bias, and scatter for each suite after recalibration, together with an explicit assessment of performance across the CAMELS Ω_m range (0.1–0.5). We also discuss the expected degradation for values outside this prior and note the limitations for extrapolation. revision: yes
-
Referee: [§5.1] §5.1 (physical interpretation): the link to gravitational potential depth and baryonic retention is presented post-hoc; an a-priori test (e.g., controlled runs varying only the potential depth while holding subgrid parameters fixed) is needed to show that the relation is not an artifact of how the CAMELS feedback models respond to background density.
Authors: The physical interpretation was developed after the functional form was identified. We have expanded §5.1 to provide additional supporting evidence drawn from correlations between the expression terms and halo properties already present in the CAMELS data, as well as references to the existing literature on baryonic retention. However, performing new controlled hydrodynamical runs that vary only gravitational potential depth while holding all subgrid parameters fixed lies outside the scope of the present study. revision: partial
- Request for new controlled hydrodynamical simulations varying only gravitational potential depth while holding subgrid parameters fixed
Circularity Check
Symbolic regression formula fitted to CAMELS inputs presented as independent analytic tracer
specific steps
-
fitted input called prediction
[Abstract]
"Using symbolic regression applied to state-of-the-art hydrodynamical simulations from the CAMELS project, we identify a compact functional form that robustly recovers Ω_m across multiple simulation suites (IllustrisTNG, ASTRID, SIMBA, and Swift-EAGLE), requiring only modest recalibration of a small number of coefficients."
The functional form is obtained by symbolic regression on simulation data where Ω_m is a known input; the reported 'recovery' of Ω_m from galaxy observables is therefore the output of the same fitting procedure applied to the training inputs rather than an independent prediction or derivation.
full rationale
The paper's derivation chain consists of applying symbolic regression to hydrodynamical simulation outputs (CAMELS suites) in which Ω_m is a controlled input parameter, then reporting that the resulting compact functional form 'recovers' Ω_m from galaxy properties. This matches the fitted_input_called_prediction pattern: the expression is discovered by optimizing against the known simulation inputs, so the reported recovery on the same or closely related data is a direct mapping by construction. Cross-suite consistency after coefficient recalibration is noted but does not remove the dependence on the original fitting process. No self-definitional equations, load-bearing self-citations, or imported uniqueness theorems appear in the provided text; the central claim remains a post-hoc fit rather than a first-principles derivation.
Axiom & Free-Parameter Ledger
free parameters (1)
- coefficients in the analytic formula
axioms (1)
- domain assumption Hydrodynamical simulations accurately capture the baryonic processes that encode cosmological information in galaxy properties
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We present the first analytic ... Ω_m directly to intrinsic galaxy-scale observables
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
ln[σ(Z⋆ Score / f⋆ k) + c] − a R_compact
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.