Implicit inference of the reionization history with higher-order statistics of the 21-cm signal
Pith reviewed 2026-05-21 18:07 UTC · model grok-4.3
The pith
Combining higher-order statistics with the cylindrical power spectrum improves constraints on the average neutral hydrogen fraction by about a third.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In mock 21-cm observations using the AA* SKAO configuration and added noise, combining higher-order statistics with the cylindrical power spectrum improves the mean figure of merit by ∼0.25 dex, which amounts to a ∼33% reduction in σ(x̄_HI) for the average neutral hydrogen fraction at redshifts centered at 8.0, 7.2, and 6.5.
What carries the argument
Implicit inference framework learning posteriors of x̄_HI from a mix of Gaussian statistics like power spectra and non-Gaussian ones like Betti numbers and the bispectrum.
If this is right
- Betti numbers alone provide more information than the spherical or cylindrical power spectra on average.
- The bispectrum contributes limited additional constraining power.
- The relative importance of each statistic changes across different stages of reionization.
- Combining these statistics with SKAO data will increase the overall information extracted from observations of the Epoch of Reionization.
Where Pith is reading between the lines
- Real SKAO data analyses could benefit from routinely including Betti numbers alongside power spectra to reduce uncertainties.
- Similar combinations of statistics might improve constraints in other cosmological probes involving non-Gaussian signals.
- Further validation with varied noise models or telescope setups would help confirm the robustness of the improvement.
Load-bearing premise
The mock 21-cm observations with added noise for the AA* SKAO configuration accurately represent the real signal and systematics in future observations.
What would settle it
Running the inference on actual SKAO 21-cm observations and verifying that the derived uncertainties on x̄_HI are consistent with those from independent methods such as quasar spectra or CMB polarization data.
Figures
read the original abstract
The Epoch of Reionization (EoR), when the first luminous sources ionised the intergalactic medium, represents a new frontier in cosmology. The Square Kilometre Array Observatory (SKAO) will offer unprecedented insights into this era through observations of the redshifted 21-cm signal, enabling constraints on the Universe's reionization history. We investigate the information content of the average neutral hydrogen fraction ($\bar{x}_{\rm HI}$) in several Gaussian (spherical and cylindrical power spectra) and non-Gaussian (Betti numbers and bispectrum) summary statistics of the 21-cm signal. Mock 21-cm observations are generated using the AA* configuration of SKAO's low-frequency telescope, incorporating noise levels for 100 and 1000 hours. We employ a state-of-the-art implicit inference framework to learn posterior distributions of $\bar{x}_{\rm HI}$ in redshift bins centred at $z=8.0,7.2$ and $6.5$, for each statistic and noise scenario, validating the posteriors through calibration tests. Using the figure of merit to assess constraining power, we find that Betti numbers alone are on average more informative than the power spectra, while the bispectrum provides limited constraints. However, combining higher-order statistics with the cylindrical power spectrum improves the mean figure of merit by $\sim$0.25 dex ($\sim33\%$ reduction in $\sigma(\bar{x}_{\rm HI})$). The relative contribution of each statistic varies with the stage of reionization. With SKAO observations approaching, our results show that combining power spectra with higher-order statistics can significantly increase the information retrieved from the EoR, maximising the scientific return of future 21-cm observations.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper examines the constraining power of Gaussian (spherical and cylindrical power spectra) and non-Gaussian (Betti numbers, bispectrum) summary statistics of the 21-cm signal on the mean neutral hydrogen fraction x̄_HI during reionization. Using implicit inference on SKAO AA* mock observations with thermal noise for 100 and 1000 hours at z=8.0, 7.2 and 6.5, it reports that Betti numbers outperform power spectra on average, the bispectrum adds limited information, and their combination with the cylindrical power spectrum yields a mean figure-of-merit gain of ∼0.25 dex (∼33% reduction in σ(x̄_HI)). Posteriors are validated via calibration tests on the same mock suite.
Significance. If the reported improvement holds under more realistic conditions, the work demonstrates that higher-order statistics can meaningfully tighten constraints on reionization history from SKAO data beyond what power spectra alone provide, supporting the value of non-Gaussian summaries in 21-cm cosmology.
major comments (3)
- [§3] §3 (Mock observations): The central FoM gain of 0.25 dex is derived from mocks that include only thermal noise added to the AA* SKAO configuration. Real SKAO data will contain residual foregrounds, ionospheric distortions and calibration errors that are known to contaminate higher-order statistics more severely than the power spectrum; without explicit tests injecting these systematics, the claimed improvement in σ(x̄_HI) cannot be considered robust.
- [§4] §4 (Implicit inference framework): The manuscript provides insufficient detail on the neural-network architecture, training procedure, summary-statistic preprocessing and hyper-parameter choices used for the implicit inference. Because the posteriors and FoM are learned directly from these mocks, the absence of this information prevents independent assessment of calibration-test reliability and potential biases.
- [Results] Results section (redshift dependence): The relative contribution of each statistic is stated to vary with reionization stage, yet no quantitative breakdown (e.g., per-redshift FoM tables or posterior widths) is supplied to support the claim that the 0.25 dex average gain is not driven by a single redshift bin.
minor comments (2)
- [Figures] Figure captions should explicitly state the number of mock realizations used for training and validation to allow readers to judge statistical significance of the reported FoM differences.
- [§2] Notation for the cylindrical power spectrum (k_⊥, k_∥) should be defined consistently in the text and figures to avoid ambiguity with the spherical power spectrum.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed review. We address each major comment below and have revised the manuscript to incorporate clarifications and additional information where feasible. Our responses focus on strengthening the presentation of the results while acknowledging the limitations of the current mock setup.
read point-by-point responses
-
Referee: [§3] §3 (Mock observations): The central FoM gain of 0.25 dex is derived from mocks that include only thermal noise added to the AA* SKAO configuration. Real SKAO data will contain residual foregrounds, ionospheric distortions and calibration errors that are known to contaminate higher-order statistics more severely than the power spectrum; without explicit tests injecting these systematics, the claimed improvement in σ(x̄_HI) cannot be considered robust.
Authors: We agree that the mocks include only thermal noise and do not incorporate residual foregrounds, ionospheric distortions or calibration errors, which are expected to affect non-Gaussian statistics more than power spectra. This represents a genuine limitation of the present study. We have added a new paragraph in Section 3 and expanded the discussion in the conclusions to explicitly note these systematics, their potential differential impact, and the need for future work with more realistic mocks. The reported gains are therefore presented as a baseline under controlled thermal-noise conditions rather than a direct prediction for real data. We have not performed additional simulations with injected systematics, as this would require a substantially broader scope and computational resources beyond the current manuscript. revision: partial
-
Referee: [§4] §4 (Implicit inference framework): The manuscript provides insufficient detail on the neural-network architecture, training procedure, summary-statistic preprocessing and hyper-parameter choices used for the implicit inference. Because the posteriors and FoM are learned directly from these mocks, the absence of this information prevents independent assessment of calibration-test reliability and potential biases.
Authors: We thank the referee for highlighting this omission. We have revised Section 4 to include a dedicated subsection that fully specifies the neural-network architecture (layer counts, neuron numbers, activation functions), training procedure (data splits, loss function, optimizer, epochs), summary-statistic preprocessing (normalization and any dimensionality handling), and hyper-parameter choices (learning rate, batch size, regularization). These additions should now allow independent evaluation of the calibration tests and any potential biases in the learned posteriors. revision: yes
-
Referee: [Results] Results section (redshift dependence): The relative contribution of each statistic is stated to vary with reionization stage, yet no quantitative breakdown (e.g., per-redshift FoM tables or posterior widths) is supplied to support the claim that the 0.25 dex average gain is not driven by a single redshift bin.
Authors: We acknowledge that a quantitative per-redshift breakdown strengthens the claim. We have added a new table in the Results section reporting the figure of merit and 1σ posterior widths on x̄_HI for each statistic and combination at z=8.0, 7.2 and 6.5 separately. The table shows that the improvement from combining higher-order statistics with the cylindrical power spectrum is present across all three redshifts, although the magnitude varies with reionization stage, confirming that the mean 0.25 dex gain is not driven by any single bin. revision: yes
Circularity Check
No significant circularity; central results are simulation-based comparisons validated by calibration tests
full rationale
The paper generates mock 21-cm observations using the AA* SKAO configuration with added thermal noise for 100 and 1000 hours, then applies an implicit inference framework to learn posteriors on x̄_HI for individual and combined summary statistics (power spectra, Betti numbers, bispectrum). Figure of merit is computed directly from the resulting posterior widths on these mocks, with explicit calibration tests mentioned to check for biases. This constitutes a standard forward-modeling workflow rather than any self-definitional loop, fitted input renamed as prediction, or load-bearing self-citation chain that reduces the claimed 0.25 dex FoM improvement to the inputs by construction. The derivation remains self-contained against the stated simulation assumptions and does not invoke uniqueness theorems or ansatzes from prior self-work in a circular manner.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Mock 21-cm observations with AA* configuration and specified noise levels for 100 and 1000 hours accurately represent expected SKAO data properties.
- domain assumption The implicit inference framework produces well-calibrated posteriors for x̄_HI when trained on the chosen summary statistics.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/AlexanderDuality.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We compute Betti numbers ... as a function of a threshold value v ... β0 counts connected components, β1 tunnels, β2 voids.
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Combining higher-order statistics with the cylindrical power spectrum improves the mean figure of merit by ∼0.25 dex
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Forward citations
Cited by 1 Pith paper
-
Beyond power spectrum to unveil systematics on HI intensity maps
Starlet l1-norm applied to simulated HI brightness temperature maps at z~0.4 yields almost 3x higher figure of merit for cosmological parameters than angular power spectrum by capturing non-Gaussian information and sh...
Reference graph
Works this paper leans on
-
[1]
Simulation based inference of the ionization history from the 2D 21 cm power spectrum
AcharyaA.,etal.,2024,MonthlyNoticesoftheRoyalAstronomicalSociety, 527, 7835 Barry N., Hazelton B., Sullivan I., Morales M., Pober J., 2016, Monthly Notices of the Royal Astronomical Society, 461, 3135 BeckerG.D.,BoltonJ.S.,MadauP.,PettiniM.,Ryan-WeberE.V.,Venemans B. P., 2015, Monthly Notices of the Royal Astronomical Society, 447, 3402 Bianco M., Giri S....
work page internal anchor Pith review Pith/arXiv arXiv 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.