pith. sign in

arxiv: 2606.08368 · v1 · pith:TNVJDE5Qnew · submitted 2026-06-06 · 🌌 astro-ph.CO

Reconciling large-scale Lyman-α correlations with the SCRIPT Semi-numerical Model

Pith reviewed 2026-06-27 19:04 UTC · model grok-4.3

classification 🌌 astro-ph.CO
keywords Lyman-alpha forestreionizationcosmic variancetransmission spikessemi-numerical modelslarge-scale correlationsXQR-30
0
0 comments X

The pith

Rare transmissive sightlines drive the large Lyman-alpha correlations seen at redshift 6.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests why Lyman-alpha forest data at redshift 6 show strong correlations on scales larger than 200 comoving megaparsecs, a signal that has been hard to reproduce in reionization simulations. Using mock spectra from the SCRIPT semi-numerical model, the authors find that most realizations underpredict the correlation length, yet a minority match the observations. A delete-2 jackknife analysis identifies two rare, highly transmissive sightlines as the main drivers of the measured signal. Adding two such sightlines to the mock ensemble raises the fraction of models consistent with both the correlation length and its redshift evolution from 17.5 percent to 74.1 percent. The analysis also confirms that spatial variations in the ionizing mean free path remain necessary to produce the observed correlation structure.

Core claim

The unexpectedly large Lyman-alpha correlations in the extended XQR-30 sample arise from cosmic variance tied to a small number of highly transmissive sightlines; the SCRIPT model ensemble reproduces the data once those rare sightlines are accounted for, with the delete-2 jackknife isolating their outsized statistical weight and the insertion test showing that mean-free-path fluctuations are still required.

What carries the argument

Delete-2 jackknife analysis on the observed sightlines combined with targeted insertion of highly transmissive sightlines into SCRIPT mock realizations, which isolates the statistical dominance of rare transmission spikes.

Load-bearing premise

The jackknife correctly identifies the two transmissive sightlines as the dominant statistical drivers, and inserting two such sightlines into the mocks provides a fair test of whether the underlying model can generate the observed signal.

What would settle it

A new, larger sample of sightlines at z approximately 6 that continues to show the same large correlation length even after the two most transmissive sightlines are removed, or after the full ensemble is re-analyzed without any insertion step.

read the original abstract

Recent analyses of high-redshift Lyman-$\alpha$ forest observations have revealed strong correlations on scales exceeding 200 cMpc at redshift z = 6. Reproducing these large-scale correlations has proven challenging for current large-volume reionization simulations. In this work, we investigate these large-scale correlations using mock spectra generated from the extended SCRIPT semi-numerical reionization model. We find that while the fiducial model ensemble systematically predicts smaller correlation lengths than those inferred from the 67 sightlines in the extended XQR-30 sample, a small fraction of individual mock realizations can naturally reproduce the observed signal. Using a delete-2 jackknife analysis, we demonstrate that the observed large-scale correlation length is disproportionately driven by a rare pair of highly transmissive sightlines associated with high-redshift transmission spikes. By inserting two such highly transmissive sightlines into our mock realizations, the fraction of models consistent with the observed redshift evolution and correlation length increases significantly from 17.5% to 74.1%. Furthermore, we show that spatial fluctuations in the ionizing mean free path remain an essential physical ingredient for reproducing the observed correlation structure. Our results suggest that the unexpectedly large Lyman-$\alpha$ correlations can be reconciled with existing reionization models when accounting for cosmic variance and the outsized statistical impact of rare, highly transmissive sightlines.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper uses mock Lyman-α spectra from the SCRIPT semi-numerical reionization model to address the tension between observed large-scale (≳200 cMpc) correlations at z=6 in the extended XQR-30 sample and existing simulations. It reports that the fiducial ensemble underpredicts the observed correlation length, but a small fraction of realizations match; a delete-2 jackknife shows the signal is driven by a rare pair of highly transmissive sightlines; inserting two such sightlines raises the fraction of consistent models from 17.5% to 74.1%; and mean-free-path fluctuations are required to reproduce the correlation structure. The central claim is that cosmic variance plus the outsized impact of rare transmissive sightlines reconciles the data with the model.

Significance. If the central claim holds, the work resolves an apparent discrepancy between Lyman-α forest data and reionization models without invoking new physics, by quantifying the role of cosmic variance and rare events. The concrete fractions (17.5% → 74.1%), delete-2 jackknife, and explicit test of mean-free-path fluctuations provide falsifiable, quantitative elements that strengthen the assessment.

major comments (2)
  1. [Abstract (jackknife and insertion paragraph)] Abstract (paragraph on jackknife and insertion test): The delete-2 jackknife correctly flags the pair of transmissive sightlines as dominant drivers, but the subsequent insertion of the two observed sightlines into the mock ensemble only demonstrates their statistical leverage if present; it does not test whether SCRIPT naturally produces such sightlines at the observed rate, with the correct spatial clustering, or transmission properties, which is required to support the reconciliation claim that the model can match the data once cosmic variance is included.
  2. [Abstract (mean-free-path sentence)] Abstract (final sentence on mean-free-path fluctuations): The statement that spatial fluctuations in the ionizing mean free path 'remain an essential physical ingredient' is load-bearing for the conclusion, yet the manuscript provides no explicit side-by-side comparison (e.g., correlation functions or consistency fractions) between the fiducial SCRIPT run and an otherwise identical run with uniform mean free path, making it impossible to quantify how essential the fluctuations are versus other model ingredients.
minor comments (1)
  1. [Abstract] The abstract reports specific percentages (17.5%, 74.1%) and the delete-2 procedure but does not define the precise correlation estimator or the sightline selection criteria used in the jackknife; these should be stated explicitly even in the abstract for reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed comments, which help to sharpen the scope of our claims regarding cosmic variance and rare sightlines in the SCRIPT model. We address each major comment below.

read point-by-point responses
  1. Referee: [Abstract (jackknife and insertion paragraph)] Abstract (paragraph on jackknife and insertion test): The delete-2 jackknife correctly flags the pair of transmissive sightlines as dominant drivers, but the subsequent insertion of the two observed sightlines into the mock ensemble only demonstrates their statistical leverage if present; it does not test whether SCRIPT naturally produces such sightlines at the observed rate, with the correct spatial clustering, or transmission properties, which is required to support the reconciliation claim that the model can match the data once cosmic variance is included.

    Authors: We agree that the insertion test primarily quantifies the outsized statistical leverage of the observed transmissive pair rather than directly demonstrating that SCRIPT produces sightlines with matching frequency, clustering, and transmission properties. The fiducial ensemble already yields a non-zero (17.5%) fraction of realizations consistent with the data, showing that such sightlines arise naturally, albeit rarely. To strengthen the reconciliation argument, we will add a dedicated comparison in the revised manuscript of the occurrence rate, spatial distribution, and transmission statistics of highly transmissive sightlines between the SCRIPT mocks and the extended XQR-30 sample. The abstract will be updated to reflect this additional analysis. revision: yes

  2. Referee: [Abstract (mean-free-path sentence)] Abstract (final sentence on mean-free-path fluctuations): The statement that spatial fluctuations in the ionizing mean free path 'remain an essential physical ingredient' is load-bearing for the conclusion, yet the manuscript provides no explicit side-by-side comparison (e.g., correlation functions or consistency fractions) between the fiducial SCRIPT run and an otherwise identical run with uniform mean free path, making it impossible to quantify how essential the fluctuations are versus other model ingredients.

    Authors: We acknowledge that the manuscript does not currently include an explicit side-by-side comparison that isolates the contribution of mean-free-path fluctuations. In the revised version we will add this comparison, showing both the correlation functions and the fractions of consistent realizations for the fiducial SCRIPT run (with fluctuations) and an otherwise identical run with a spatially uniform mean free path. This will allow a quantitative assessment of how essential the fluctuations are for reproducing the observed large-scale correlation structure. revision: yes

Circularity Check

0 steps flagged

No circularity: derivation uses independent mocks and data-driven diagnostics

full rationale

The paper runs the SCRIPT semi-numerical model to produce an ensemble of mock spectra, applies a delete-2 jackknife directly to the observed 67 sightlines to isolate the pair of transmissive spikes as the dominant contributors to the measured correlation length, and then reports the change in consistency fraction (17.5% to 74.1%) after manually inserting those same observed sightlines into the mocks. This insertion step is explicitly framed as a diagnostic of cosmic-variance leverage rather than a model prediction or fitted parameter. No equation, ansatz, or self-citation in the provided text reduces any claimed result to its own inputs by construction; the central reconciliation statement rests on the model's independent realizations and the external data's jackknife analysis.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the SCRIPT model's ability to generate realistic transmission spikes and on the assumption that the observed sample is representative once two outliers are considered. No new free parameters, axioms, or invented entities are introduced in the abstract.

axioms (1)
  • domain assumption The SCRIPT semi-numerical reionization model produces statistically representative transmission fields on >200 cMpc scales.
    Invoked when comparing mock correlation lengths to the XQR-30 data.

pith-pipeline@v0.9.1-grok · 5775 in / 1416 out tokens · 15927 ms · 2026-06-27T19:04:58.465894+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

46 extracted references · 3 canonical work pages · 3 internal anchors

  1. [1]

    Barkana and A

    R. Barkana and A. Loeb,In the beginning: The first sources of light and the reionization of the universe,Physics Reports349(2001) 125 [astro-ph/0010468]

  2. [2]

    Choudhury,A short introduction to reionization physics,General Relativity and Gravitation54(2022) 102

    T.R. Choudhury,A short introduction to reionization physics,General Relativity and Gravitation54(2022) 102

  3. [3]

    Chakraborty and T.R

    A. Chakraborty and T.R. Choudhury,Reionization and its sources, inEncyclopedia of Astrophysics, Volume 4, vol. 4, (eprint: arXiv:2502.12004), pp. 433–452, Jan., 2026, DOI

  4. [4]

    Gunn and B.A

    J.E. Gunn and B.A. Peterson,On the Density of Neutral Hydrogen in Intergalactic Space.,The Astrophysical Journal142(1965) 1633

  5. [5]

    Becker, J.S

    G.D. Becker, J.S. Bolton and A. Lidz,Reionisation and High-Redshift Galaxies: The View from Quasar Absorption Lines,Publications of the Astronomical Society of Australia32(2015) e045

  6. [6]

    Becker, J.S

    G.D. Becker, J.S. Bolton, P. Madau, M. Pettini, E.V. Ryan-Weber and B.P. Venemans, – 17 – 5.0 5.2 5.4 5.6 5.8 6.0 z −0.2 0.0 0.2 0.4 0.6 A 5.0 5.2 5.4 5.6 5.8 6.0 z 101 102 L [cMpc] Data from Bosman et al. (2022) Matrix from Spina et al. (2026) Figure 11. Redshift evolution of the correlation amplitude and length. The results obtained using the observed d...

  7. [7]

    Bosman, X

    S.E.I. Bosman, X. Fan, L. Jiang, S. Reed, Y. Matsuoka, G. Becker et al.,New constraints on Lyman-αopacity with a sample of 62 quasarsat z>5.7,Monthly Notices of the Royal Astronomical Society479(2018) 1055

  8. [8]

    Davies and S.R

    F.B. Davies and S.R. Furlanetto,Large fluctuations in the hydrogen-ionizing background and mean free path following the epoch of reionization,Monthly Notices of the Royal Astronomical Society460(2016) 1328

  9. [9]

    D’Aloisio, M

    A. D’Aloisio, M. McQuinn and H. Trac,LARGE OPACITY VARIATIONS IN THE HIGH-REDSHIFT LYαFOREST: THE SIGNATURE OF RELIC TEMPERATURE FLUCTUATIONS FROM PATCHY REIONIZATION,The Astrophysical Journal Letters813 (2015) L38

  10. [10]

    Kulkarni, L.C

    G. Kulkarni, L.C. Keating, M.G. Haehnelt, S.E.I. Bosman, E. Puchwein, J. Chardin et al., Large Lyαopacity fluctuations and low CMBτin models of late reionization with large islands of neutral hydrogen extending to z<5.5,Monthly Notices of the Royal Astronomical Society: Letters485(2019) L24

  11. [11]

    Keating, L.H

    L.C. Keating, L.H. Weinberger, G. Kulkarni, M.G. Haehnelt, J. Chardin and D. Aubert,Long troughs in the Lyman-αforest below redshift 6 due to islands of neutral hydrogen,Monthly Notices of the Royal Astronomical Society491(2020) 1736

  12. [12]

    Nasir and A

    F. Nasir and A. D’Aloisio,Observing the tail of reionization: Neutral islands in the z = 5.5 Lyman-αforest,Monthly Notices of the Royal Astronomical Society494(2020) 3080

  13. [13]

    Choudhury, A

    T.R. Choudhury, A. Paranjape and S.E.I. Bosman,Studying the Lymanαoptical depth fluctuations at z∼5.5 using fast semi-numerical methods,Monthly Notices of the Royal Astronomical Society501(2021) 5782

  14. [14]

    Y. Qin, A. Mesinger, S.E.I. Bosman and M. Viel,Reionization and galaxy inference from the high-redshift Lyαforest,Monthly Notices of the Royal Astronomical Society506(2021) 2390. – 18 –

  15. [15]

    Y. Qin, A. Mesinger, D. Prelogovi´ c, G. Becker, M. Bischetti, S. Bosman et al.,Percent-level timing of reionisation: Self-consistent, implicit-likelihood inference from XQR-30+ Lyαforest data,Publications of the Astronomical Society of Australia42(2025) e049

  16. [16]

    D’Odorico, E

    V. D’Odorico, E. Ba˜ nados, G.D. Becker, M. Bischetti, S.E.I. Bosman, G. Cupani et al., XQR-30: The ultimate XSHOOTER quasar sample at the reionization epoch,Monthly Notices of the Royal Astronomical Society523(2023) 1399

  17. [17]

    Bosman, F.B

    S.E.I. Bosman, F.B. Davies, G.D. Becker, L.C. Keating, R.L. Davies, Y. Zhu et al.,Hydrogen reionization ends by z = 5.3: Lyman-αoptical depth measured by the XQR-30 sample,Monthly Notices of the Royal Astronomical Society514(2022) 55

  18. [18]

    Songaila and L.L

    A. Songaila and L.L. Cowie,Approaching Reionization: The Evolution of the LyαForest from z = 4 to z = 6,The Astronomical Journal123(2002) 2183

  19. [19]

    Gallerani, T.R

    S. Gallerani, T.R. Choudhury and A. Ferrara,Constraining the reionization history with QSO absorption spectra,Monthly Notices of the Royal Astronomical Society370(2006) 1401

  20. [20]

    Fan, M.A

    X. Fan, M.A. Strauss, R.H. Becker, R.L. White, J.E. Gunn, G.R. Knapp et al.,Constraining the Evolution of the Ionizing Background and the Epoch of Reionization with z˜6 Quasars. II. A Sample of 19 Quasars,The Astronomical Journal132(2006) 117

  21. [21]

    Gallerani, A

    S. Gallerani, A. Ferrara, X. Fan and T.R. Choudhury,Glimpsing through the high-redshift neutral hydrogen fog,Monthly Notices of the Royal Astronomical Society386(2008) 359

  22. [22]

    Zhu, G.D

    Y. Zhu, G.D. Becker, S.E.I. Bosman, L.C. Keating, H.M. Christenson, E. Ba˜ nados et al., Chasing the Tail of Cosmic Reionization with Dark Gap Statistics in the LyαForest over 5<z <6,The Astrophysical Journal923(2021) 223

  23. [23]

    Studying dark gaps in Ly-$\alpha$ forest transmission with large reionization simulations

    B. Maity, F.B. Davies, B. Spina and S.E.I. Bosman,Studying dark gaps in Ly-$α$forest transmission with large reionization simulation, Jan., 2026. 10.48550/arXiv.2602.00301

  24. [24]

    McGreer, A

    I.D. McGreer, A. Mesinger and X. Fan,The first (nearly) model-independent constraint on the neutral hydrogen fraction at z∼5–6,Monthly Notices of the Royal Astronomical Society415 (2011) 3237

  25. [25]

    McGreer, A

    I.D. McGreer, A. Mesinger and V. D’Odorico,Model-independent evidence in favour of an end to reionization by z≈6,Monthly Notices of the Royal Astronomical Society447(2015) 499

  26. [26]

    Davies, S.E.I

    F.B. Davies, S.E.I. Bosman, V. D’Odorico, S. Campo, A. Mesinger, Y. Qin et al.,Updated dark pixel fraction constraints on reionization’s end from the Lyman-series forests of XQR-30, Monthly Notices of the Royal Astronomical Society545(2026) staf1862

  27. [27]

    Spina, S.E.I

    B. Spina, S.E.I. Bosman, B. Maity and F.B. Davies,Measuring the intergalactic medium correlation length at 5<z<6.1: A fast change at the end of reionization,Astronomy & Astrophysics706(2026) A273

  28. [28]

    Choudhury and A

    T.R. Choudhury and A. Paranjape,Photon number conservation and the large-scale 21 cm power spectrum in seminumerical models of reionization,Monthly Notices of the Royal Astronomical Society481(2018) 3821 [1807.00836]

  29. [29]

    Choudhury and A

    T.R. Choudhury and A. Chakraborty,Capturing small-scale reionization physics: A sub-grid model for photon sinks with SCRIPT,Journal of Cosmology and Astroparticle Physics2025 (2025) 114

  30. [30]

    P.a.R. Ade, N. Aghanim, M. Arnaud, M. Ashdown, J. Aumont, C. Baccigalupi et al.,Planck 2015 results - XIII. Cosmological parameters,Astronomy & Astrophysics594(2016) A13

  31. [31]

    Maity and T.R

    B. Maity and T.R. Choudhury,Probing the thermal history during reionization using a seminumerical photon-conserving code script,Monthly Notices of the Royal Astronomical Society511(2022) 2239 [2110.14231]. – 19 –

  32. [32]

    Maity, F

    B. Maity, F. Davies and P. Gaikwad,Efficient modeling of Lyman-αopacity fluctuations during the late reionization epoch,Astronomy and Astrophysics706(2026) A36

  33. [33]

    Torrado and A

    J. Torrado and A. Lewis,Cobaya: Bayesian analysis in cosmology,Astrophysics Source Code Library(2019) ascl:1910.019

  34. [34]

    Torrado and A

    J. Torrado and A. Lewis,Cobaya: Code for Bayesian Analysis of hierarchical physical models, Journal of Cosmology and Astroparticle Physics2021(2021) 057 [2005.05290]

  35. [35]

    Metropolis, A.W

    N. Metropolis, A.W. Rosenbluth, M.N. Rosenbluth, A.H. Teller and E. Teller,Equation of State Calculations by Fast Computing Machines,The Journal of Chemical Physics21(1953) 1087

  36. [36]

    Lewis,Efficient sampling of fast and slow cosmological parameters,Physical Review D87 (2013) 103529

    A. Lewis,Efficient sampling of fast and slow cosmological parameters,Physical Review D87 (2013) 103529

  37. [37]

    Cosmological parameters from CMB and other data: a Monte-Carlo approach

    A. Lewis and S. Bridle,Cosmological parameters from CMB and other data: A Monte-Carlo approach, Oct., 2002. 10.48550/arXiv.astro-ph/0205436

  38. [38]

    Gelman and D.B

    A. Gelman and D.B. Rubin,Inference from Iterative Simulation Using Multiple Sequences, Statistical Science7(1992) 457

  39. [39]

    GetDist: a Python package for analysing Monte Carlo samples

    A. Lewis,GetDist: A Python package for analysing Monte Carlo samples, Oct., 2019. 10.48550/arXiv.1910.13970

  40. [40]

    Bhattacharyya,On a Measure of Divergence between Two Multinomial Populations, Sankhy¯ a: The Indian Journal of Statistics (1933-1960)7(1946) 401 [25047882]

    A. Bhattacharyya,On a Measure of Divergence between Two Multinomial Populations, Sankhy¯ a: The Indian Journal of Statistics (1933-1960)7(1946) 401 [25047882]

  41. [41]

    Kashyap,The perfect marriage and much more: Combining dimension reduction, distance measures and covariance,Physica A: Statistical Mechanics and its Applications536(2019) 120938

    R. Kashyap,The perfect marriage and much more: Combining dimension reduction, distance measures and covariance,Physica A: Statistical Mechanics and its Applications536(2019) 120938

  42. [42]

    Gnedin, G.D

    N.Y. Gnedin, G.D. Becker and X. Fan,Cosmic Reionization on Computers: Properties of the Post-reionization IGM,The Astrophysical Journal841(2017) 26

  43. [43]

    Chardin, M.G

    J. Chardin, M.G. Haehnelt, S.E.I. Bosman and E. Puchwein,A tale of seven narrow spikes and a long trough: Constraining the timing of the percolation of H ii bubbles at the tail end of reionization with ULAS J1120+0641,Monthly Notices of the Royal Astronomical Society473 (2018) 765

  44. [44]

    Kakiichi, R.S

    K. Kakiichi, R.S. Ellis, N. Laporte, A. Zitrin, A.-C. Eilers, E. Ryan-Weber et al.,The role of galaxies and AGN in reionizing the IGM – I. Keck spectroscopy of 5<z<7 galaxies in the QSO field J1148+5251,Monthly Notices of the Royal Astronomical Society479(2018) 43

  45. [45]

    Garaldi, N.Y

    E. Garaldi, N.Y. Gnedin and P. Madau,Constraining the Tail End of Reionization Using Lyα Transmission Spikes,The Astrophysical Journal876(2019) 31

  46. [46]

    Gaikwad, M

    P. Gaikwad, M. Rauch, M.G. Haehnelt, E. Puchwein, J.S. Bolton, L.C. Keating et al.,Probing the thermal state of the intergalactic medium at z>5 with the transmission spikes in high-resolution Lyαforest spectra,Monthly Notices of the Royal Astronomical Society494 (2020) 5091. – 20 –