Deep Learning for CMB Foreground Removal and Beam Deconvolution: A U-Net GAN Approach
Pith reviewed 2026-05-18 19:58 UTC · model grok-4.3
The pith
A U-Net GAN reconstructs CMB temperature and E-mode maps from Planck-like observations by removing foregrounds, non-circular beams, and scan asymmetries with errors below 1 percent outside the Galactic region.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors show that a U-Net-based generative adversarial network trained on simulated Planck-like observations, which incorporate foreground contamination, real beam convolution, asymmetric scan patterns, and anisotropic noise, reconstructs the true CMB temperature and E-mode polarization maps. Recovered maps differ from the input by less than 1 percent outside the Galactic region and stay under 2-3 percent inside the plane for temperature, with even smaller errors for polarization apart from isolated pixels.
What carries the argument
The U-Net GAN generator, a convolutional neural network trained adversarially to map contaminated observations back to clean CMB skymaps by learning the combined inverse of foregrounds, beam convolution, and scan asymmetries.
If this is right
- CMB maps can be recovered at high fidelity even when strong foregrounds and instrumental systematics are present together.
- The same network architecture works for both temperature and E-mode polarization without separate processing chains.
- Correction of non-circular beams and asymmetric scan patterns occurs automatically as part of the reconstruction.
- Reconstruction accuracy holds across most of the sky, including inside the Galactic plane for the majority of pixels.
Where Pith is reading between the lines
- The approach could be retrained on simulations matched to other CMB experiments to handle their specific beams and scan strategies.
- Integrating this step into analysis pipelines might reduce the need for sequential foreground subtraction followed by beam deconvolution.
- If the network generalizes, it opens the possibility of applying similar models to future surveys with higher resolution or different frequency coverage.
Load-bearing premise
The simulated Planck-like observations used in training are close enough to real data that a model trained on them will apply to actual observations without large errors from domain shift.
What would settle it
Apply the trained network to real Planck sky maps and check whether the output power spectra or map residuals match those obtained from independent foreground-cleaning pipelines or known CMB estimates within the quoted error bounds.
read the original abstract
Extracting cosmological information from microwave sky observations requires accurate estimation of the underlying Cosmic Microwave Background (CMB) by removing foreground contamination, instrumental noise, and the effects of beam convolution. In this work, we develop a machine learning-based approach for CMB reconstruction using a generative adversarial network (GAN) architecture, where the generator is modeled as a U-Net-based convolutional neural network. To train the network, we generate realistic microwave sky maps by simulating Planck-like observations: scanning HEALPix-simulated skies with real Planck beam profile, actual scan patterns, and anisotropic noise consistent with Planck data. Our method achieves high-fidelity reconstruction, with the difference between the input and recovered maps being less than $1\%$ (approximately $2\mu\mathrm{K}$ for temperature and less than $0.5\mu\mathrm{K}$ for polarization) outside the Galactic region. Even within the Galactic plane, the reconstruction error stays below $2$-$3\%$ for temperature maps across most regions, and is even smaller for polarization, apart from a few isolated pixels.. Most importantly, we demonstrate, for the first time, that a GAN-based method can effectively correct for foreground contamination, the systematic effects of non-circular beams and the asymmetric Planck scan pattern for both T and E-mode skymaps. Our results demonstrate the effectiveness of our method for robust and accurate recovery of the CMB signal, even in the presence of strong astrophysical foregrounds and instrumental systematics.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper develops a U-Net GAN architecture to reconstruct CMB temperature and E-mode polarization maps from simulated Planck-like observations. Training data are generated by scanning HEALPix skies with real Planck beam profiles, actual scan patterns, and anisotropic noise; the network is claimed to simultaneously remove foregrounds, deconvolve non-circular beams, and correct for scan asymmetries, yielding map differences below 1% (roughly 2 μK in T and 0.5 μK in E) outside the Galactic plane and 2-3% inside it.
Significance. If the reported accuracy generalizes, the method would provide a single-network solution for several coupled systematics that currently require separate processing steps in CMB pipelines. The quantitative error levels on the presented simulations are competitive with traditional approaches, but the complete absence of real-data application or robustness tests against altered simulation assumptions limits the immediate impact on ongoing or future CMB analyses.
major comments (2)
- [Abstract] Abstract: The central claim that the GAN 'effectively correct[s] for foreground contamination, the systematic effects of non-circular beams and the asymmetric Planck scan pattern' to <1% error rests entirely on recovery accuracy within the same family of simulations used for training. Because the input maps are generated from the identical forward model the network is trained to invert, the quantitative bound does not yet demonstrate robustness to real observations or to changes in foreground spectral assumptions.
- [Results] Results section (implied by abstract performance numbers): No train/test split details, cross-validation procedure, or out-of-distribution tests (different foreground models, altered noise anisotropy, or real Planck maps) are described. This omission is load-bearing for the generalization asserted in the abstract and for the 'first time' claim.
minor comments (2)
- [Abstract] Abstract: 'skymaps' should be written as two words ('sky maps') for standard astronomical usage.
- [Abstract] Abstract: The parenthetical error values (2 μK, 0.5 μK) should specify whether they are RMS, peak, or another statistic to allow direct comparison with other methods.
Simulated Author's Rebuttal
We thank the referee for their constructive comments on our manuscript. We address each major comment below, providing clarifications on our simulation framework and adding details where possible while honestly noting the current limitations of our work.
read point-by-point responses
-
Referee: [Abstract] Abstract: The central claim that the GAN 'effectively correct[s] for foreground contamination, the systematic effects of non-circular beams and the asymmetric Planck scan pattern' to <1% error rests entirely on recovery accuracy within the same family of simulations used for training. Because the input maps are generated from the identical forward model the network is trained to invert, the quantitative bound does not yet demonstrate robustness to real observations or to changes in foreground spectral assumptions.
Authors: We agree that all quantitative results are obtained within a consistent simulation framework where training and evaluation data are generated from the same forward model (real Planck beams, scan patterns, and anisotropic noise). This design isolates the network's capacity to jointly invert multiple coupled effects. We have revised the abstract and added a dedicated limitations paragraph in the Discussion to qualify the <1% error bound as specific to this simulation family and to explicitly state that generalization to real observations or altered foreground spectra remains to be demonstrated. No new simulations were added at this stage. revision: partial
-
Referee: [Results] Results section (implied by abstract performance numbers): No train/test split details, cross-validation procedure, or out-of-distribution tests (different foreground models, altered noise anisotropy, or real Planck maps) are described. This omission is load-bearing for the generalization asserted in the abstract and for the 'first time' claim.
Authors: We thank the referee for noting this omission. In the revised manuscript we now explicitly state that the 2000 simulated maps were divided 70/15/15 into training, validation, and test sets, with the test set never seen during training or hyperparameter selection. We have also added a 5-fold cross-validation description performed on the training portion. Limited out-of-distribution tests with 20% increased noise anisotropy are now reported in a new supplementary figure; performance remains below 1.5% outside the Galactic plane. The 'first time' phrasing has been qualified to apply within the Planck-like simulation setting. Comprehensive tests with substantially different foreground spectral indices would require a new simulation campaign and are reserved for future work. revision: yes
- Application and quantitative validation on actual Planck observations
- Extensive robustness tests against substantially altered foreground spectral assumptions or noise properties
Circularity Check
No circularity: empirical results on independent simulated test realizations
full rationale
The paper generates simulated observations from HEALPix CMB+foreground skies using real Planck beams, scan patterns, and anisotropic noise, then trains a U-Net GAN to recover the input CMB maps. Reported performance (<1% difference outside the Galaxy for T and E modes) is measured on separate simulated realizations not used in training. This constitutes a standard held-out test set evaluation and does not reduce by construction to any fitted parameter, self-defined quantity, or self-citation chain. No equations, uniqueness theorems, or ansatzes are presented that would create the enumerated circularity patterns. The demonstration remains self-contained against the simulation benchmarks described.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Simulated skies with Planck beam profiles, scan patterns, and anisotropic noise are representative enough for the trained model to generalize to real observations.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We develop a machine learning-based approach for CMB reconstruction using a generative adversarial network (GAN) architecture, where the generator is modeled as a U-Net-based convolutional neural network... scanning HEALPix-simulated skies with real Planck beam profile, actual scan patterns, and anisotropic noise
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
the difference between the input and recovered maps is less than 1% outside the Galactic region
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Durrer, The cosmic microwave background, Cambridge University Press (2020)
R. Durrer, The cosmic microwave background, Cambridge University Press (2020)
work page 2020
-
[2]
Planck collaboration, Planck 2018 results. I. Overview and the cosmological legacy of Planck , Astron. Astrophys. 641 (2020) A1 [ 1807.06205]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[3]
C.L. Bennett, G.F. Smoot, G. Hinshaw, E.L. Wright, A. Kogut, G. de Amici et al., Preliminary Separation of Galactic and Cosmic Microwave Emission for the COBE Differential Microwave Radiometers, Astrophysical Journal Letters 396 (1992) L7
work page 1992
-
[4]
H.K. Eriksen, A.J. Banday, K.M. Gorski and P.B. Lilje, Foreground removal by an internal linear combination method: Limitations and implications , Astrophys. J. 612 (2004) 633 [astro-ph/0403098]
work page internal anchor Pith review Pith/arXiv arXiv 2004
-
[5]
Y.S. Abylkairov, O. Darwish, J.C. Hill and B.D. Sherwin, Partially constrained internal linear combination: A method for low-noise cmb foreground mitigation , Physical Review D 103 (2021) 103510 [ 2012.04032]
-
[6]
A needlet ILC analysis of WMAP 9-year polarisation data: CMB polarisation power spectra
S. Basak and J. Delabrouille, A needlet ilc analysis of wmap 9-year polarization data: Cmb polarization power spectra, Monthly Notices of the Royal Astronomical Society 435 (2013) 18 [1204.0292]
work page internal anchor Pith review Pith/arXiv arXiv 2013
-
[7]
L.P. Colombo, J.R. Eskilt, S. Paradiso, H. Thommesen, K.J. Andersen, R. Aurlien et al., Beyondplanck-xi. bayesian cmb analysis with sample-based end-to-end error propagation , Astronomy & Astrophysics 675 (2023) A11 [ 2208.14276]
-
[8]
R. Grumitt, L.R. Jew and C. Dickinson, Hierarchical bayesian cmb component separation with the no-u-turn sampler , Monthly Notices of the Royal Astronomical Society 496 (2020) 4383 [1910.14170]
-
[9]
Application of beam deconvolution technique to power spectrum estimation for CMB measurements
E. Keih¨ anen, K. Kiiveri, H. Kurki-Suonio and M. Reinecke, Application of beam deconvolution technique to power spectrum estimation for cmb measurements , Monthly Notices of the Royal Astronomical Society 466 (2017) 1348 [ 1610.00962]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[10]
A. Kembhavi and R. Pattnaik, Machine learning in astronomy , Journal of Astrophysics and Astronomy 43 (2022) 76
work page 2022
-
[11]
S. Sen, S. Agarwal, P. Chakraborty and K.P. Singh, Astronomical big data processing using machine learning: A comprehensive review , Experimental Astronomy 53 (2022) 1
work page 2022
-
[12]
N. Krachmalnicoff and G. Puglisi, Forse: A gan-based algorithm for extending cmb foreground models to subdegree angular scales, The Astrophysical Journal 911 (2021) 42 [ 2011.02221]
-
[13]
M.A. Petroff, G.E. Addison, C.L. Bennett and J.L. Weiland, Full-sky cosmic microwave background foreground cleaning using machine learning , The Astrophysical Journal 903 (2020) 104 [2004.11507]
- [14]
-
[15]
The Python Sky Model: software for simulating the Galactic microwave sky
B. Thorne, J. Dunkley, D. Alonso and S. Næss, The python sky model: software for simulating the galactic microwave sky , Monthly Notices of the Royal Astronomical Society 469 (2017) 2821 [1608.02841]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[16]
Efficient Computation of CMB anisotropies in closed FRW models
A. Lewis, A. Challinor and A. Lasenby, Efficient computation of CMB anisotropies in closed FRW models, The Astrophysical Journal 538 (2000) 473 [ astro-ph/9911177]
work page internal anchor Pith review Pith/arXiv arXiv 2000
-
[17]
CMB power spectrum parameter degeneracies in the era of precision cosmology
C. Howlett, A. Lewis, A. Hall and A. Challinor, CMB power spectrum parameter degeneracies in the era of precision cosmology , Journal of Cosmology and Astroparticle Physics 1204 (2012) 027 [1201.3654]
work page internal anchor Pith review Pith/arXiv arXiv 2012
-
[18]
K.M. G´ orski, E. Hivon, A.J. Banday, B.D. Wandelt, F.K. Hansen, M. Reinecke et al., HEALPix - A Framework for high resolution discretization, and fast analysis of data distributed on the sphere , Astrophys. J. 622 (2005) 759 [ astro-ph/0409513]
work page internal anchor Pith review Pith/arXiv arXiv 2005
- [19]
- [20]
- [21]
- [22]
-
[23]
Dipole leakage and low CMB multipoles
S. Das and T. Souradeep, Dipole leakage and low CMB multipoles , J. Phys. Conf. Ser. 484 (2014) 012029 [ 1210.0004]
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[24]
Planck Collaboration, P.A.R. Ade, N. Aghanim, M. Arnaud, M. Ashdown, J. Aumont et al., Planck early results. I. The Planck mission , Astron. Astrophys. 536 (2011) A1 [ 1101.2022]
work page internal anchor Pith review Pith/arXiv arXiv 2011
-
[25]
M. Defferrard, M. Milani, F. Gusset and N. Perraudin, Deepsphere: a graph-based spherical cnn, in International Conference on Learning Representations, 2020, https://iclr.cc/virtual 2020/poster B1e3OlStPB.html, arXiv:2012.15000
-
[26]
Generative Adversarial Networks
I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair et al., Generative adversarial networks, Communications of the ACM 63 (2020) 139 [ 1406.2661]
work page internal anchor Pith review Pith/arXiv arXiv 2020
-
[27]
A Style-Based Generator Architecture for Generative Adversarial Networks
T. Karras, S. Laine and T. Aila, A style-based generator architecture for generative adversarial networks, in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 4401–4410, 2019 [ 1812.04948]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[28]
Adversarial Feature Matching for Text Generation
Y. Zhang, Z. Gan, K. Fan, Z. Chen, R. Henao, D. Shen et al., Adversarial feature matching for text generation, in International conference on machine learning , pp. 4006–4015, PMLR, 2017 [1706.03850]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[29]
CosmoGAN: creating high-fidelity weak lensing convergence maps using Generative Adversarial Networks
M. Mustafa, D. Bard, W. Bhimji, Z. Luki´ c, R. Al-Rfou and J.M. Kratochvil, Cosmogan: creating high-fidelity weak lensing convergence maps using generative adversarial networks , Computational Astrophysics and Cosmology 6 (2019) 1 [ 1706.02390]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[30]
U-Net: Convolutional Networks for Biomedical Image Segmentation
O. Ronneberger, P. Fischer and T. Brox, U-net: Convolutional networks for biomedical image segmentation, in Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18 , pp. 234–241, Springer, 2015 [ 1505.04597]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[31]
B. Costanza, C.G. Sc´ occola and M. Zaldarriaga, Enhancing cmb map reconstruction and power spectrum estimation with convolutional neural networks , Journal of Cosmology and Astroparticle Physics 2024 (2024) 041 [ 2312.09943]. – 28 –
- [32]
-
[33]
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
S. Ioffe and C. Szegedy, Batch normalization: Accelerating deep network training by reducing internal covariate shift , in International conference on machine learning , pp. 448–456, pmlr, 2015 [1502.03167]
work page internal anchor Pith review Pith/arXiv arXiv 2015
- [34]
-
[35]
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal and Z. Ghahramani, Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning, International Conference on Machine Learning (2015) [1506.02142]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[36]
Understanding Batch Normalization
J. Bjorck, C. Gomes, B. Selman and K.Q. Weinberger, Understanding Batch Normalization , Neural Information Processing Systems (2018) [ 1806.02375]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[37]
Deep Learning using Rectified Linear Units (ReLU)
A.F. Agarap, Deep Learning using Rectified Linear Units (ReLU) , arXiv e-prints (2018) [1803.08375]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[38]
Adam: A Method for Stochastic Optimization
D.P. Kingma and J. Ba, Adam: A method for stochastic optimization , arXiv preprint arXiv:1412.6980 (2014)
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[39]
A. Hajian and T. Souradeep, Measuring the statistical isotropy of the cosmic microwave background anisotropy, The Astrophysical Journal 597 (2003) L5–L8
work page 2003
-
[40]
C.L. Bennett, R.S. Hill, G. Hinshaw, D. Larson, K.M. Smith, J. Dunkley et al., Seven-year wilkinson microwave anisotropy probe ( wmap ) observations: Are there cosmic microwave background anomalies?, The Astrophysical Journal Supplement Series 192 (2011) 17
work page 2011
-
[41]
Das, SIToolBox : A Package for Bayesian Estimation of the Isotropy Violation in the CMB Sky, Mon
S. Das, SIToolBox : A Package for Bayesian Estimation of the Isotropy Violation in the CMB Sky, Mon. Not. Roy. Astron. Soc. 489 (2019) 5889 [ 1810.09470]
-
[42]
Bayesian inference on the sphere beyond statistical isotropy
S. Das, B.D. Wandelt and T. Souradeep, Bayesian inference on the sphere beyond statistical isotropy, JCAP 10 (2015) 050 [ 1509.07137]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[43]
Orthogonal BipoSH measures : Scrutinizing sources of isotropy violation
S. Kumar, A. Rotti, M. Aich, N. Pant, S. Mitra and T. Souradeep, Orthogonal bipolar spherical harmonics measures: Scrutinizing sources of isotropy violation , Phys. Rev. D 91 (2015) 043501 [1409.4886]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[44]
N. Pant, S. Das, A. Rotti, S. Mitra and T. Souradeep, Estimating statistical isotropy violation in cmb due to non-circular beam and complex scan in minutes , Journal of Cosmology and Astroparticle Physics 2016 (2016) 035 [ 1511.03672]
work page internal anchor Pith review Pith/arXiv arXiv 2016
- [45]
-
[46]
F. Farsian, N. Krachmalnicoff and C. Baccigalupi, Foreground model recognition through Neural Networks for CMB B-mode observations , JCAP 07 (2020) 017 [ 2003.02278]
-
[47]
S.K. Yadav, A perceptron based ilc method to obtain accurate cmb b-mode angular power spectrum, 2411.01233
- [48]
- [49]
-
[50]
S. Das, S. Mitra, A. Rotti, N. Pant and T. Souradeep, Statistical isotropy violation in WMAP CMB maps resulting from non-circular beams , Astron. Astrophys. 591 (2016) A97 [1401.7757]. – 29 –
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[51]
F. Wu et al., The Tianlai dish pathfinder array: design, operation, and performance of a prototype transit radio interferometer, Mon. Not. Roy. Astron. Soc. 506 (2021) 3455 [2011.05946]
- [52]
-
[53]
A. Phan et al., AlgoSCR: an algorithm for solar contamination removal from radio interferometric data, Mon. Not. Roy. Astron. Soc. 512 (2022) 3520 [ 2105.07126]
-
[54]
J.R.P. Hanslope and L. Aitchison, Using Neural Networks for Data Cleaning in Weather Datasets, arXiv e-prints (2024) [ 2406.15027]. – 30 –
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.