Interpretability of deep-learning methods applied to large-scale structure surveys
Pith reviewed 2026-05-23 04:41 UTC · model grok-4.3
The pith
A convolutional neural network for large-scale structure surveys draws its predictions from a mix of Gaussian and non-Gaussian information, with emphasis on scales near the linear-to-nonlinear transition.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Training the network on degraded large-scale structure data shows that its parameter predictions rely on a mix of both Gaussian and non-Gaussian information, and that the network places particular emphasis on structures whose scales lie at the limit between the linear and nonlinear regimes.
What carries the argument
The technique of training and predicting with input maps from which targeted information has been removed, then measuring the resulting change in constraining power.
If this is right
- The network accesses information beyond what is captured by Gaussian statistics alone.
- The emphasis on transitional scales implies sensitivity to mildly nonlinear structures.
- The combination of information types may allow the network to break parameter degeneracies that affect traditional analyses.
Where Pith is reading between the lines
- If the result holds, networks could be tested on mocks engineered to suppress non-Gaussian features to confirm the claimed dependence.
- The same degradation method could be applied to other summary statistics to compare their information sources directly.
Load-bearing premise
Removing specific information from the survey data isolates the network's dependence on those features without the removal process itself creating new training dynamics or compensatory behaviors.
What would settle it
A measurement in which removing the claimed non-Gaussian or transitional-scale information leaves the network's error bars and bias unchanged.
Figures
read the original abstract
Deep learning and convolutional neural networks in particular are powerful and promising tools for cosmological analysis of large-scale structure surveys. They are already providing similar performance to classical analysis methods using fixed summary statistics, are showing potential to break key degeneracies by better probe combination and will likely improve rapidly in the coming years as progress is made in the physical modelling through both software and hardware improvement. One key issue remains: unlike classical analysis, a convolutional neural network's decision process is hidden from the user as the network optimises millions of parameters with no direct physical meaning. This prevents a clear understanding of the potential limitations and biases of the analysis, making it hard to rely on as a main analysis method. In this work, we explore the behaviour of such a convolutional neural network through a novel method. Instead of trying to analyse a network a posteriori, i.e. after training has been completed, we study the impact on the constraining power of training the network and predicting parameters with degraded data where we removed part of the information. This allows us to gain an understanding of which parts and features of a large-scale structure survey are most important in the network's prediction process. We find that the network's prediction process relies on a mix of both Gaussian and non-Gaussian information, and seems to put an emphasis on structures whose scales are at the limit between linear and non-linear regimes.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces a method to interpret convolutional neural networks for cosmological parameter inference from large-scale structure surveys. Rather than post-hoc analysis, the authors retrain networks on deliberately degraded data with targeted removal of Gaussian versus non-Gaussian information or specific scale ranges, then measure changes in constraining power to infer which features the network relies upon. The headline result is that the network draws on a combination of both Gaussian and non-Gaussian information while emphasizing scales near the linear-to-nonlinear transition.
Significance. If the degradation protocol can be shown to isolate feature dependence without confounding changes to training dynamics or data statistics, the approach would supply a practical, forward-modeling route to interpretability that is directly relevant to ongoing and future LSS analyses. The method is original in its emphasis on retraining rather than post-training attribution and could help address the black-box concern that currently limits adoption of DL methods as primary analysis tools.
major comments (2)
- [Methods (degradation protocol)] The central claim that performance degradation after targeted information removal directly reveals the network's learned reliance on Gaussian/non-Gaussian content or specific scales rests on the untested assumption that the degradation operator leaves the remaining data statistics and optimization landscape unchanged except for the excised component. No quantitative controls (power-spectrum matching, preservation of higher-order statistics, or ablation studies on the degradation operator itself) are described that would rule out compensatory training dynamics or induced artifacts.
- [Abstract and Results] The abstract and provided description supply no quantitative results, error bars, or implementation details on how data degradation is performed or on the magnitude of the reported performance changes. Without these, it is not possible to assess whether the evidence supports the stated conclusion that the network 'relies on a mix' or 'puts an emphasis' on particular scales.
minor comments (2)
- [Methods] Notation for the degradation operators and the precise definition of 'Gaussian' versus 'non-Gaussian' information should be introduced explicitly with equations or pseudocode.
- [Introduction / Data] The manuscript would benefit from a clear statement of the cosmological parameters being inferred and the survey specifications (volume, redshift range, noise model) used in the training sets.
Simulated Author's Rebuttal
We thank the referee for their thoughtful and constructive report. We address each major comment below and commit to revisions that strengthen the presentation of our degradation protocol and the quantitative support for our claims.
read point-by-point responses
-
Referee: [Methods (degradation protocol)] The central claim that performance degradation after targeted information removal directly reveals the network's learned reliance on Gaussian/non-Gaussian content or specific scales rests on the untested assumption that the degradation operator leaves the remaining data statistics and optimization landscape unchanged except for the excised component. No quantitative controls (power-spectrum matching, preservation of higher-order statistics, or ablation studies on the degradation operator itself) are described that would rule out compensatory training dynamics or induced artifacts.
Authors: We agree that explicit validation of the degradation operator is necessary to support the interpretability conclusions. The manuscript describes the targeted removal procedures but does not present the requested quantitative controls. We will add these in the revised methods section, including direct comparisons of the power spectrum and selected higher-order statistics before and after degradation, as well as ablation tests on the degradation parameters to check for induced artifacts or changes in training dynamics. revision: yes
-
Referee: [Abstract and Results] The abstract and provided description supply no quantitative results, error bars, or implementation details on how data degradation is performed or on the magnitude of the reported performance changes. Without these, it is not possible to assess whether the evidence supports the stated conclusion that the network 'relies on a mix' or 'puts an emphasis' on particular scales.
Authors: We accept that the current abstract is qualitative and lacks the requested numerical support. We will revise the abstract to report the magnitude of performance changes (e.g., relative increases in parameter uncertainties) when Gaussian or non-Gaussian information is removed, together with error bars obtained from multiple independent realizations. Implementation details of the degradation steps will be summarized concisely in the abstract or moved to a prominent position in the results section. revision: yes
Circularity Check
No circularity: empirical ablation method is independent of its inputs
full rationale
The paper presents an interpretability study that trains CNNs on deliberately degraded survey data (removing Gaussian/non-Gaussian content or specific scales) and measures resulting changes in parameter constraints. This diagnostic is not derived from any fitted parameter that is then re-predicted, nor does it rely on self-definitional equations, uniqueness theorems imported from the authors' prior work, or ansatzes smuggled via self-citation. No load-bearing step reduces the central claim (reliance on mixed Gaussian/non-Gaussian information at linear-to-nonlinear scales) to a tautology or to the degradation operator itself. The approach is self-contained against external benchmarks of network performance on held-out data, yielding a normal non-finding of circularity.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Abadi, M., Agarwal, A., Barham, P., et al. 2016, Tensorflow: Large-scale machine learning on heterogeneous distributed systems, publication Title: arXiv.org
work page 2016
-
[2]
Abbott, T. M. C., Aguena, M., Alarcon, A., et al. 2022, Physical Review D, 105
work page 2022
-
[3]
Aiola, S., Calabrese, E., Maurin, L., et al. 2020, J. Cosmol. Astropart. Phys., 2020, 047
work page 2020
- [4]
- [5]
-
[6]
Bernardeau, F., Waerbeke, L. V ., & Mellier, Y . 1997, Astronomy and Astro- physics, 322, 1
work page 1997
- [7]
-
[8]
Dietrich, J. P. & Hartlap, J. 2010, Monthly Notices of the Royal Astronomical Society, 402, 1049 Article number, page 8 of 11 G. Aymerich et al.: Interpretability of deep-learning methods applied to large-scale structure surveys
work page 2010
-
[9]
2022, Machine Learning and Cosmology, arXiv:2203.08056 [astro-ph, physics:hep-ph, stat]
Dvorkin, C., Mishra-Sharma, S., Nord, B., et al. 2022, Machine Learning and Cosmology, arXiv:2203.08056 [astro-ph, physics:hep-ph, stat]
-
[10]
Fluri, J., Kacprzak, T., Lucchi, A., et al. 2019, Physical Review D, 100
work page 2019
-
[11]
Fluri, J., Kacprzak, T., Lucchi, A., et al. 2022, A full wCDM analysis of KiDS- 1000 weak lensing maps using Deep Learning, publication Title: arXiv.org
work page 2022
-
[12]
Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, Publications of the Astronomical Society of the Pacific, 125, 306
work page 2013
-
[13]
Friedrich, O., Gruen, D., DeRose, J., et al. 2018, Phys. Rev. D, 98, 023508, pub- lisher: American Physical Society
work page 2018
-
[14]
Gong, Z., Halder, A., Barreira, A., Seitz, S., & Friedrich, O. 2023, J. Cosmol. Astropart. Phys., 2023, 040
work page 2023
-
[15]
Gong, Z., Halder, A., Bohrdt, A., Seitz, S., & Gebauer, D. 2024, C3NN: Cosmo- logical Correlator Convolutional Neural Network – an interpretable machine learning tool for cosmological analyses, arXiv:2402.09526 [astro-ph]
- [16]
-
[17]
2015, Deep residual learning for image recognition, publication Title: arXiv.org
He, K., Zhang, X., Ren, S., & Sun, J. 2015, Deep residual learning for image recognition, publication Title: arXiv.org
work page 2015
-
[18]
Heydenreich, S., Linke, L., Burger, P., & Schneider, P. 2023, A&A, 672, A44
work page 2023
-
[19]
Heymans, C., Tröster, T., Asgari, M., et al. 2021, A&A, 646, A140
work page 2021
-
[20]
Hirata, C. M. & Seljak, U. 2004, Phys. Rev. D, 70, 063526, publisher: American Physical Society
work page 2004
-
[21]
Joachimi, B., Mandelbaum, R., Abdalla, F. B., & Bridle, S. L. 2011, A&A, 527, A26
work page 2011
- [22]
-
[23]
Kingma, D. P. & Ba, J. 2017, Adam: A Method for Stochastic Optimization, arXiv:1412.6980 [cs]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[24]
2012, Monthly Notices of the Royal Astronomical Society, 424, 1647
Kirk, D., Rassat, A., Host, O., & Bridle, S. 2012, Monthly Notices of the Royal Astronomical Society, 424, 1647
work page 2012
- [25]
-
[26]
LeCun, Y ., Bottou, L., Bengio, Y ., & Ha, P. 1998
work page 1998
-
[27]
2023, Monthly Notices of the Royal Astronomical Society, 521, 2050
Lu, T., Haiman, Z., & Li, X. 2023, Monthly Notices of the Royal Astronomical Society, 521, 2050
work page 2023
-
[28]
V ., Pontzen, A., Nord, B., & Thiyagalingam, J
Lucie-Smith, L., Peiris, H. V ., Pontzen, A., Nord, B., & Thiyagalingam, J. 2024, Phys. Rev. D, 109, 063524
work page 2024
- [29]
- [30]
-
[31]
Piras, D. & Lombriser, L. 2024, Phys. Rev. D, 110, 023514, arXiv:2310.10717 [astro-ph] Planck Collaboration, Ade, P. A. R., Aghanim, N., et al. 2014, A&A, 571, A16 Planck Collaboration, Aghanim, N., Akrami, Y ., et al. 2020, A&A, 641, A6
-
[32]
Porredon, A., Crocce, M., Elvin-Poole, J., et al. 2022, Phys. Rev. D, 106, 103530
work page 2022
- [33]
-
[34]
Ravanbakhsh, S., Oliva, J., Fromenteau, S., et al. 2016, in Proceedings of the 33rd International Conference on International Conference on Machine Learning - V olume 48, ICML’16 (New York, NY , USA: JMLR.org), 2407– 2416
work page 2016
- [35]
-
[36]
Riess, A. G., Yuan, W., Macri, L. M., et al. 2022, ApJL, 934, L7, arXiv:2112.04510 [astro-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[37]
Samek, W., Montavon, G., Lapuschkin, S., Anders, C. J., & Müller, K.-R. 2021, Proceedings of the IEEE, 109, 247, conference Name: Proceedings of the IEEE
work page 2021
-
[38]
Seetharaman, P., Wichern, G., Pardo, B., & Roux, J. L. 2020, AutoClip: Adap- tive gradient clipping for Source Separation Networks, publication Title: arXiv.org
work page 2020
-
[39]
Sgier, R., Réfrégier, A., Amara, A., & Nicola, A. 2019, J. Cosmol. Astropart. Phys., 2019, 044
work page 2019
-
[40]
Shannon, C. E. 1948, Bell System Technical Journal, 27, 379
work page 1948
-
[41]
2007, IEEE Transactions on Image Pro- cessing, 16, 297
Starck, J.-L., Fadili, J., & Murtagh, F. 2007, IEEE Transactions on Image Pro- cessing, 16, 297
work page 2007
-
[42]
Villaescusa-Navarro, F., Wandelt, B. D., Anglés-Alcázar, D., et al. 2022, ApJ, 928, 44
work page 2022
-
[43]
Villanueva-Domingo, P. & Villaescusa-Navarro, F. 2021, ApJ, 907, 44 Zürcher, D., Fluri, J., Sgier, R., et al. 2022, Monthly Notices of the Royal Astro- nomical Society, 511, 2075 Article number, page 9 of 11 A&A proofs: manuscript no. aanda Appendix A: Results including intrinsic alignment In this appendix, we present the results obtained when mod- elling...
work page 2021
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.