Differentiable Forward Modeling for Efficient and Accurate Shear Inference

Axel Guinot; Camille Avestruz; Eleni Tsaprazi; Ismael Mendoza; Jean-Eric Campagne; Matthew R. Becker; Michael Schneider; Natalia Porqueres; The LSST Dark Energy Science Collaboration

arxiv: 2604.22048 · v2 · submitted 2026-04-23 · 🌌 astro-ph.IM · astro-ph.CO

Differentiable Forward Modeling for Efficient and Accurate Shear Inference

Ismael Mendoza , Axel Guinot , Matthew R. Becker , Camille Avestruz , Jean-Eric Campagne , Natalia Porqueres , Michael Schneider , Eleni Tsaprazi

show 1 more author

The LSST Dark Energy Science Collaboration

This is my paper

Pith reviewed 2026-05-08 13:39 UTC · model grok-4.3

classification 🌌 astro-ph.IM astro-ph.CO

keywords cosmic shearshear inferencedifferentiable forward modelingBayesian inferencemultiplicative biasLSSTGPU accelerationnoise bias

0 comments

The pith

Differentiable forward modeling infers cosmic shear with multiplicative bias below 0.0013 without external calibration.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper implements a Bayesian shear inference method that uses differentiable models to automatically propagate pixel noise into the final shear estimate. This removes the requirement for separate calibration to address noise bias. Tests on simulated images of isolated exponential galaxies with LSST-like noise levels show the absolute multiplicative bias stays below 0.9 times 10 to the minus three when galaxy property distributions are known and below 1.3 times 10 to the minus three when those distributions are inferred along with the shear. These results fall inside the LSST accuracy target of less than 2 times 10 to the minus three. The implementation leverages gradient-based samplers and GPUs to fit each galaxy in 0.45 seconds.

Core claim

The central claim is that a differentiable forward modeling approach to Bayesian shear inference, when the PSF and sky are known, automatically handles noise bias and achieves an absolute multiplicative bias |m| below 0.9 × 10^{-3} at 3σ for known galaxy property distributions and below 1.3 × 10^{-3} when inferred jointly, meeting LSST requirements in simulations of isolated exponential galaxies, while enabling efficient MCMC sampling on GPUs at 0.45 seconds per galaxy for 300 effective samples.

What carries the argument

Differentiable forward models of galaxies used within a gradient-based Markov chain Monte Carlo sampler to infer shear while accounting for pixel noise.

If this is right

The shear estimate requires no external calibration for noise bias.
The method remains accurate when galaxy property distributions are inferred simultaneously with shear.
GPU acceleration makes processing billions of galaxies computationally feasible for large surveys.
The bias performance satisfies the requirements for Stage-IV dark energy surveys like LSST.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Extending the framework to include effects like galaxy blending and detection would allow testing on more realistic simulated data.
Applying the method to actual survey images could validate its performance beyond the isolated galaxy assumption.
The use of differentiable models suggests potential for end-to-end differentiable cosmological inference pipelines.
Further optimization of the sampling could reduce computation time even more for full survey applications.

Load-bearing premise

The point spread function and sky background are known exactly and the validation uses only isolated exponential galaxies with either known or jointly inferred intrinsic property distributions.

What would settle it

A simulation study that includes blended galaxies, complex morphologies, detection and selection effects, followed by checking if the recovered |m| exceeds 2 × 10^{-3} at 3 sigma, would directly test whether the bias control holds for realistic conditions.

Figures

Figures reproduced from arXiv: 2604.22048 by Axel Guinot, Camille Avestruz, Eleni Tsaprazi, Ismael Mendoza, Jean-Eric Campagne, Matthew R. Becker, Michael Schneider, Natalia Porqueres, The LSST Dark Energy Science Collaboration.

**Figure 1.** Figure 1: — Distribution of intrinsic galaxy properties. In this figure we show distribution of intrinsic galaxy properties of our exponential galaxy dataset used in all our experiments. The mean and sigma of each distribution are shown in the title of each distribution, and the median is shown as a dashed black line and in the legend. For more details on this figure see Section 2. the likelihood P(d|g) and obtain a… view at source ↗

**Figure 2.** Figure 2: — Probabilistic graphical model representing our forward model for shear inference. A constant shear g is applied to every galaxy with intrinsic parameters ωn with corresponding pixels dn. The intrinsic properties of each galaxy are sampled from distributions characterized by hyperparameters αn, which includes the distribution of intrinsic ellipticities. The PSF πn and noise covariance Σn for each galaxy i… view at source ↗

**Figure 3.** Figure 3: — Efficiency of sampling galaxy properties. We show the time to obtain a specific number of effective samples per galaxy, for a given number of galaxies (blue to red gradient in lines) being sampled in parallel in a single A100 GPU. Producing zero effective samples corresponds to the time taken to warm up the galaxy chains, which takes a non-negligible amount of time. See Section 4.1 for more discussion of… view at source ↗

**Figure 4.** Figure 4: — Comparison of shear posteriors. In this plot we compare the inferred shear posterior 1σ and 2σ contours obtained from our method applied to our dataset of 320k isolated exponential galaxies with g1 = 0.02 and g2 = 0 true shear applied. The blue contours is the shear posterior inferred in setting all-fixed: the true prior of intrinsic galaxy properties is fixed and known, and only the shear is inferred. … view at source ↗

**Figure 5.** Figure 5: — Posteriors of the hyperparameters of distributions of intrinsic galaxy properties. In this contour plot, we show 1σ and 2σ contours of the posterior of hyperparameters of galaxy properties’ distributions based on 3000 samples. Specifically, σε corresponds to the scatter of the intrinsic ellipticity magnitude distribution (Equation 3), µf and σf are the mean and sigma of the lognormal distribution of flux… view at source ↗

**Figure 6.** Figure 6: — Shear Posterior Calibration on Toy Ellipticities. We present a coverage plot which we use to asses the calibration of the shear posteriors produced by our inference methodology (Section 3) in the context of a simplified ellipticity-only dataset. We first produce 1000 shear posteriors by running our inference procedure on a 1000 separate noisy ellipticity datasets each with a different shear applied. For … view at source ↗

**Figure 7.** Figure 7: — Multiplicative bias as a function of Spergel index. On the right of this figure, we plot the multiplicative bias m calculated when using a Spergel profile with different (fixed) indices ν as the forward model in the all-fixed case, instead of the correct Exponential profile. The red dashed line is multiplicative bias obtained when using a Gaussian profile forward model. On the left, we plot the (vertical… view at source ↗

read the original abstract

Forthcoming Stage-IV dark energy optical surveys, such as LSST, have the ambitious goal of measuring cosmological parameters at sub-percent precision. Realizing their full scientific potential requires very precise measurement of the cosmic shear signal and control of corresponding systematics. In this work, we present a modern implementation of the Bayesian shear inference framework in Schneider et al. (2014), in the case that the PSF and sky background are known. This framework automatically propagates the pixel-noise measurement error from each galaxy into the final shear estimate, and thus requires no external calibration to handle noise bias. As a first application of this new implementation, we infer the cosmic shear posterior from simulated images consisting of isolated exponential galaxies with LSST-like levels of shape and pixel noise. In this simplified scenario, we estimate the absolute multiplicative bias $|m|$ of our approach to be below $0.9 \times 10^{-3} \, [3\sigma]$ when the intrinsic distribution of galaxy properties is known, and below $1.3 \times 10^{-3}\, [3\sigma]$ when these distributions are inferred alongside shear. Both results are within the LSST requirement of $|m| < 2 \times10^{-3}$. Additionally, we make progress towards the algorithm's computational feasibility in the context of modern wide-field surveys, where billions of galaxies must be processed, by leveraging differentiable forward models of galaxies, gradient-based samplers, and GPUs. Our final galaxy-fitting MCMC produces $300$ effective samples of galaxy properties in $0.45$ seconds per galaxy using a single A100 GPU. In the future, we seek to generalize our algorithm to handle selection, detection, and model shear biases so it can be applied to real survey data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 3 minor

Summary. The manuscript presents a differentiable forward-modeling implementation of the Bayesian shear inference framework from Schneider et al. (2014), restricted to the case of known PSF and sky background. It employs gradient-based MCMC sampling on GPUs to jointly sample galaxy properties and shear while automatically propagating pixel-noise uncertainties. On simulated images of isolated exponential galaxies with LSST-like shape and pixel noise, the method reports absolute multiplicative bias |m| below 0.9 × 10^{-3} (3σ) when intrinsic property distributions are known a priori and below 1.3 × 10^{-3} (3σ) when those distributions are inferred jointly with shear; both values satisfy the LSST requirement |m| < 2 × 10^{-3}. Per-galaxy runtime is stated as 0.45 s for 300 effective samples on a single A100 GPU. The authors explicitly scope the result to this simplified matched-model regime and flag generalization to selection, detection, blending, and complex morphologies as future work.

Significance. If the reported bias levels hold under the stated assumptions, the work constitutes a meaningful step toward calibration-free shear measurement at the precision required by Stage-IV surveys. The combination of differentiable galaxy models with modern gradient-based sampling directly addresses both the noise-bias problem and the computational scaling barrier for billions of galaxies. The explicit scoping to isolated exponentials with known PSF avoids over-claim while demonstrating that full posterior propagation can meet LSST multiplicative-bias targets in a controlled setting; this provides a concrete baseline against which more realistic extensions can be judged.

major comments (2)

[§4] §4 (simulation and inference setup): the reported 3σ bounds on |m| are obtained from a finite number of simulated galaxies whose intrinsic ellipticity and size distributions are either fixed or jointly sampled; the manuscript should state the exact number of galaxies used, the convergence diagnostics applied to the MCMC chains, and whether the quoted uncertainties incorporate the finite-sample variance of the bias estimator itself.
[§3.2] §3.2 (differentiable forward model): the claim that pixel-noise error is automatically propagated without external calibration rests on the differentiability of the model; an explicit statement is needed on whether the chosen galaxy profile (exponential) and any numerical approximations in the rendering step introduce additional systematic terms that are not captured by the reported bias figures.

minor comments (3)

[Abstract] The abstract and introduction should make the scope limitation (isolated exponentials, known PSF) more prominent in the first paragraph so that readers immediately understand the controlled nature of the test.
[Figures] Figure captions for the bias-versus-shear plots should list the exact simulation parameters (noise levels, galaxy density, prior widths) rather than referring only to 'LSST-like' conditions.
[Methods] A short table summarizing the MCMC settings (number of chains, burn-in, thinning, effective sample size per galaxy) would improve reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their positive assessment of the work and for the constructive recommendation of minor revision. The comments highlight useful points for improving clarity and reproducibility, which we address below. We have prepared revisions to the manuscript that incorporate the requested details without changing the scope or conclusions of the study.

read point-by-point responses

Referee: [§4] §4 (simulation and inference setup): the reported 3σ bounds on |m| are obtained from a finite number of simulated galaxies whose intrinsic ellipticity and size distributions are either fixed or jointly sampled; the manuscript should state the exact number of galaxies used, the convergence diagnostics applied to the MCMC chains, and whether the quoted uncertainties incorporate the finite-sample variance of the bias estimator itself.

Authors: We agree that these details strengthen the presentation of the results. In the revised manuscript we will explicitly report the number of simulated galaxies used for the bias measurements in each case, describe the MCMC convergence diagnostics that were applied (including the Gelman-Rubin statistic), and clarify that the quoted 3σ uncertainties on |m| are computed from the standard error of the bias estimator across the ensemble and therefore already incorporate finite-sample variance. revision: yes
Referee: [§3.2] §3.2 (differentiable forward model): the claim that pixel-noise error is automatically propagated without external calibration rests on the differentiability of the model; an explicit statement is needed on whether the chosen galaxy profile (exponential) and any numerical approximations in the rendering step introduce additional systematic terms that are not captured by the reported bias figures.

Authors: We thank the referee for requesting this clarification. Because the analysis is performed in a strictly matched-model regime, the same exponential profile and rendering procedure are used both to generate the simulated images and to sample the posterior. Consequently, any systematic contributions from the profile choice or from numerical approximations in the rendering (e.g., pixel integration) are fully included in the measured bias values. The differentiability of the forward model ensures that pixel-noise uncertainties are propagated exactly into the shear posterior without external calibration. We will add an explicit paragraph in §3.2 stating this point. revision: yes

Circularity Check

0 steps flagged

Minor self-citation of prior framework; central bias measurement independent

full rationale

The paper implements the Bayesian shear inference framework from Schneider et al. (2014) (one author overlap) but applies it to forward-simulated isolated exponential galaxies whose noise realizations and intrinsic property distributions are generated independently of the inference model. The reported |m| bounds are obtained by direct comparison of inferred shear posteriors against the known input shear in these simulations; no equation or fitted parameter is redefined as a prediction by construction. The self-citation is not load-bearing for the bias result, which remains a first-application measurement on matched-model data. No self-definitional loops, fitted-input predictions, or ansatz smuggling are present in the derivation chain.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central performance numbers rest on the assumption that PSF and background are known exactly and that galaxies are isolated exponentials; no new physical entities are introduced.

axioms (2)

domain assumption PSF and sky background are known
Explicitly stated as the case considered in the abstract.
domain assumption Galaxies are isolated exponential profiles
The simulation setup used for the bias measurement.

pith-pipeline@v0.9.0 · 5650 in / 1386 out tokens · 34078 ms · 2026-05-08T13:39:47.518969+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

2 extracted references · 2 canonical work pages

[1]

The advantage of using theη parameterization is that the ellipticity domain is now the real numbers. Thus, Gaussian noise can be added to each component independently to obtain noisy ellipticities: ˜η1,2 ∼ N(η 1,2, ση),(A.2) whereσ η is the independent scatter of each component representing a degree of measurement error. We choose the value ofσ η = 0.1 by...

work page
[2]

| (A.3) whereε ′ 1,2 are the noiseless ellipticities in theεparam- eterization. The first factor is the likelihood defined by Equation A.2, the second factor is the interim prior on ellipticities which we choose to be the same as in Equa- tion 14, and the last factor is the absolute determinant of the Jacobian defined by the inverse of Equation A.1. In te...

work page 2000

[1] [1]

The advantage of using theη parameterization is that the ellipticity domain is now the real numbers. Thus, Gaussian noise can be added to each component independently to obtain noisy ellipticities: ˜η1,2 ∼ N(η 1,2, ση),(A.2) whereσ η is the independent scatter of each component representing a degree of measurement error. We choose the value ofσ η = 0.1 by...

work page

[2] [2]

| (A.3) whereε ′ 1,2 are the noiseless ellipticities in theεparam- eterization. The first factor is the likelihood defined by Equation A.2, the second factor is the interim prior on ellipticities which we choose to be the same as in Equa- tion 14, and the last factor is the absolute determinant of the Jacobian defined by the inverse of Equation A.1. In te...

work page 2000