The Deep Learning-Based Dual-Branch Multimodal Fusion Model for Solar Flare Prediction
Pith reviewed 2026-05-19 22:59 UTC · model grok-4.3
The pith
Dual-branch fusion model predicts strong solar flares with TSS of 0.78 for X-class events
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The dual-branch multimodal fusion model integrates magnetograms and magnetic parameters through cross-attention mechanisms, followed by cross-scale interactions at the feature level to enhance multi-scale representation. It achieves a TSS of 0.661 and an HSS of 0.658 for binary ≥C-class prediction, while notably attaining a TSS of 0.780 and an HSS of 0.775 for X-class flares in the multi-class task, demonstrating superior performance in predicting intense X-class flares, effectively suppressing the false alarm rate, and exhibiting strong generalization capability.
What carries the argument
Dual-branch architecture using cross-attention mechanisms to fuse magnetogram images with magnetic parameters, plus cross-scale feature interactions for multi-scale representation.
If this is right
- The model provides more reliable forecasts for intense solar flares that can disrupt satellites and power systems.
- Lower false alarm rates reduce unnecessary preparations by operators.
- The approach demonstrates better ability to generalize to previously unseen solar active regions.
- Multi-class output allows for more nuanced space weather alerts tailored to flare strength.
Where Pith is reading between the lines
- Similar multimodal fusion techniques might enhance predictions for other solar phenomena such as coronal mass ejections.
- The success suggests that future models could benefit from incorporating even more data modalities like EUV images or time sequences.
- Operational deployment could support more precise real-time alerts for space weather impacts on technology.
Load-bearing premise
The splitting-before-sampling strategy based on NOAA active region numbers together with the stratified group five-fold cross-validation scheme fully prevents data leakage and ensures genuine generalization to unseen active regions.
What would settle it
If a test on randomly split data without respecting active region boundaries shows substantially lower TSS and HSS scores for X-class flares, it would suggest that the original results benefited from data leakage.
Figures
read the original abstract
Solar flares are intense eruptive events caused by the rapid release of magnetic energy, often impacting Earth's space environment through electromagnetic radiation and high-energy particles. Accurate flare prediction is critical for space weather forecasting. However, many existing deep learning approaches often rely on single-modal inputs or shallow feature fusion, limiting their ability to capture complementary information. In this study, we propose a dual-branch multimodal fusion deep learning model for predicting 24-hour solar flares. The model integrates magnetograms and magnetic parameters through cross-attention mechanisms, followed by cross-scale interactions at the feature level to enhance multi-scale representation. It is designed to perform both binary prediction of $\geqslant$ C-class flares and multi-class classification of C, M, and X-class flares. To ensure rigorous evaluation, we employ a stratified group five-fold cross-validation scheme to preserve class representativeness and adopt a splitting-before-sampling strategy based on NOAA active region numbers to prevent data leakage. Experimental results show that the model achieves a TSS of 0.661 and an HSS of 0.658 for binary $\geqslant$ C-class prediction, while notably attaining a TSS of 0.780 and an HSS of 0.775 for X-class flares in the multi-class task. Compared with existing approaches, the model demonstrates superior performance in predicting intense X-class flares, effectively suppresses the false alarm rate, and exhibits strong generalization capability.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a dual-branch multimodal deep learning model that fuses solar magnetograms and magnetic parameters via cross-attention mechanisms and cross-scale feature interactions for 24-hour ahead prediction of solar flares. It performs both binary classification of ≥C-class events and multi-class classification into C/M/X flares, reporting TSS = 0.661 and HSS = 0.658 on the binary task and notably higher skill (TSS = 0.780, HSS = 0.775) for X-class events. Evaluation relies on a stratified group 5-fold cross-validation scheme combined with a splitting-before-sampling strategy keyed to NOAA active-region numbers, which the authors state prevents data leakage and supports claims of superior performance and strong generalization relative to prior approaches.
Significance. If the reported metrics are shown to reflect genuine cross-AR generalization rather than residual intra-region correlations, the work would constitute a useful incremental advance in multimodal deep-learning methods for space-weather forecasting. The emphasis on X-class skill and false-alarm suppression addresses a practically important regime, and the adoption of group CV is a constructive step toward more rigorous evaluation standards in the field.
major comments (1)
- [Evaluation protocol] Evaluation protocol (abstract and §3): The claim that the stratified group 5-fold CV with NOAA-AR splitting-before-sampling fully prevents leakage is load-bearing for the generalization and superiority assertions. The manuscript does not specify (i) whether every sample belonging to a given AR is forced into a single fold, (ii) how overlapping 24-hour prediction windows within the same AR are assigned, or (iii) whether temporal autocorrelation of magnetic evolution across consecutive samples is mitigated. Without these details the reported TSS/HSS values cannot be confidently interpreted as cross-region generalization.
minor comments (2)
- [Figures] Figure captions and architecture diagrams should explicitly label the cross-attention blocks and cross-scale interaction modules so that the dual-branch fusion pathway is immediately traceable.
- [Results] The abstract states performance numbers but does not report the number of positive/negative samples per fold or the class-imbalance handling strategy; these statistics belong in the results section or a supplementary table.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our evaluation protocol and for recognizing the potential value of our multimodal approach for X-class flare prediction. We agree that greater specificity is required to support the no-leakage claim and will revise the manuscript to address each sub-point raised.
read point-by-point responses
-
Referee: whether every sample belonging to a given AR is forced into a single fold
Authors: Yes. Our stratified group 5-fold procedure first partitions the set of unique NOAA active-region numbers into five disjoint groups. All magnetogram and magnetic-parameter samples associated with any given AR number are then assigned exclusively to the fold corresponding to that group. This group-level assignment occurs before any instance-level sampling. We will add an explicit statement of this rule in the revised §3. revision: yes
-
Referee: how overlapping 24-hour prediction windows within the same AR are assigned
Authors: Because the split is performed at the AR-group level prior to sampling, every 24-hour window (including temporally overlapping windows) that belongs to the same active region is automatically placed in the identical fold. Consequently, no training–test leakage can arise from overlapping prediction intervals that share the same AR. We will clarify this assignment rule in the methods section of the revision. revision: yes
-
Referee: whether temporal autocorrelation of magnetic evolution across consecutive samples is mitigated
Authors: The protocol prevents cross-AR leakage but does not impose additional temporal gaps or decorrelation steps within an AR; consecutive samples from the same region therefore remain together in one fold. We acknowledge that residual intra-AR temporal autocorrelation may still be present. In the revised manuscript we will explicitly discuss this limitation and its implications for the interpretation of within-fold performance versus cross-region generalization. revision: partial
Circularity Check
No significant circularity; performance metrics are empirical evaluations on held-out data
full rationale
The paper's central claims consist of reported TSS/HSS scores obtained via stratified group 5-fold cross-validation after splitting samples by NOAA active region numbers. These metrics are computed directly from model predictions on test folds that are constructed to be disjoint from training data; they do not reduce algebraically to any fitted parameter, self-citation, or ansatz inside the model equations. The dual-branch fusion architecture, cross-attention mechanism, and training procedure are defined independently of the final numerical scores, so the generalization statements rest on external observational benchmarks rather than internal redefinition.
Axiom & Free-Parameter Ledger
free parameters (1)
- neural network weights and hyperparameters
axioms (1)
- domain assumption Magnetograms and magnetic parameters contain complementary information that improves flare prediction when fused via cross-attention
Reference graph
Works this paper leans on
-
[1]
W., Qahwaji, R., Colak, T., et al
Ahmed, O. W., Qahwaji, R., Colak, T., et al. 2013, Solar Physics, 283, 157
work page 2013
-
[2]
Barnes, G., Leka, K., Schumer, E., & Della-Rose, D. 2007, Space Weather, 5
work page 2007
-
[3]
2016, The Astrophysical Journal, 829, 89
Barnes, G., Leka, K., Schrijver, C., et al. 2016, The Astrophysical Journal, 829, 89
work page 2016
-
[4]
Gallagher, P. T. 2012, The Astrophysical Journal Letters, 747, L41
work page 2012
-
[5]
Bobra, M. G., & Couvidat, S. 2015, The Astrophysical Journal, 798, 135
work page 2015
-
[6]
Bobra, M. G., Sun, X., Hoeksema, J. T., et al. 2014, Solar Physics, 289, 3549
work page 2014
-
[7]
E., Al-Ghraibah, A., & McAteer, R
Boucheron, L. E., Al-Ghraibah, A., & McAteer, R. J. 2015, The Astrophysical Journal, 812, 51
work page 2015
-
[8]
2021, in Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)
Chen, C.-F., Fan, Q., & Panda, R. 2021, in Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)
work page 2021
-
[9]
2025, Multimedia Tools and Applications, 1
Chen, R., Gao, C., Tan, Z., et al. 2025, Multimedia Tools and Applications, 1
work page 2025
-
[10]
Chen, Y., Manchester, W. B., Hero, A. O., et al. 2019, Space weather, 17, 1404
work page 2019
-
[11]
2021, The Astrophysical Journal, 915, 38
Cicogna, D., Berrilli, F., Calchetti, D., et al. 2021, The Astrophysical Journal, 915, 38
work page 2021
-
[12]
2022, The Astrophysical Journal Supplement Series, 260, 9
Deshmukh, V., Flyer, N., Van der Sande, K., & Berger, T. 2022, The Astrophysical Journal Supplement Series, 260, 9
work page 2022
-
[13]
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. 2019, in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (Association for Computational Linguistics), 4171–4186
work page 2019
-
[14]
2021, in International Conference on Learning Representations
Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al. 2021, in International Conference on Learning Representations. https://openreview.net/forum?id=YicbFdNTTy
work page 2021
-
[15]
Falconer, D., Barghouty, A. F., Khazanov, I., & Moore, R. 2011, Space Weather, 9
work page 2011
-
[16]
Florios, K., Kontogiannis, I., Park, S.-H., et al. 2018, Solar Physics, 293, 28
work page 2018
-
[17]
Giovanelli, R. G. 1939, Astrophysical Journal, vol. 89, p. 555, 89, 555
work page 1939
-
[18]
D., Royer, A., Blankevoort, T., & Bejnordi, B
Havtorn, J. D., Royer, A., Blankevoort, T., & Bejnordi, B. E. 2023, in Proceedings of the IEEE/CVF International Conference on Computer Vision, 838–848
work page 2023
-
[19]
2018, The Astrophysical Journal, 856, 7
Huang, X., Wang, H., Xu, L., et al. 2018, The Astrophysical Journal, 856, 7
work page 2018
-
[20]
Huang, X., Yu, D., Hu, Q., Wang, H., & Cui, Y. 2010, Solar Physics, 263, 175
work page 2010
-
[21]
2020, Space weather, 18, e2020SW002440
Jiao, Z., Sun, H., Wang, X., et al. 2020, Space weather, 18, e2020SW002440
work page 2020
-
[22]
Kaneda, K., Wada, Y., Iida, T., et al. 2022, in Proceedings of the Asian Conference on Computer Vision, 1488–1503 Kors´ os, M., Ludmany, A., Erdelyi, R., & Baranyi, T. 2015, The Astrophysical Journal Letters, 802, L21
work page 2022
-
[23]
Lee, K., Moon, Y.-J., Lee, J.-Y., Lee, K.-S., & Na, H. 2012, Solar Physics, 281, 639
work page 2012
-
[24]
Lee, Y., Kim, J., Willette, J., & Hwang, S. J. 2022, in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 7287–7296
work page 2022
-
[25]
2003, The Astrophysical Journal, 595, 1296
Leka, K., & Barnes, G. 2003, The Astrophysical Journal, 595, 1296
work page 2003
-
[26]
2019, The Astrophysical Journal Supplement Series, 243, 36
Leka, K., Park, S.-H., Kusano, K., et al. 2019, The Astrophysical Journal Supplement Series, 243, 36
work page 2019
-
[27]
2021, Advances in neural information processing systems, 34, 9694
Li, J., Selvaraju, R., Gotmare, A., et al. 2021, Advances in neural information processing systems, 34, 9694
work page 2021
-
[28]
2013, Research in Astronomy and Astrophysics, 13, 1118
Li, R., & Zhu, J. 2013, Research in Astronomy and Astrophysics, 13, 1118
work page 2013
-
[29]
2020, The Astrophysical Journal, 891, 10
Li, X., Zheng, Y., Wang, X., & Wang, L. 2020, The Astrophysical Journal, 891, 10
work page 2020
-
[30]
2024, The Astrophysical Journal Supplement Series, 276, 7
Li, X., Li, X., Zheng, Y., et al. 2024, The Astrophysical Journal Supplement Series, 276, 7
work page 2024
-
[31]
Liu, C., Deng, N., Wang, J. T., & Wang, H. 2017, The Astrophysical Journal, 843, 104
work page 2017
-
[32]
Liu, H., Liu, C., Wang, J. T. L., & Wang, H. 2019, The Astrophysical Journal, 877, 121, doi: 10.3847/1538-4357/ab1b3c
-
[33]
Liu, Y. D., Luhmann, J. G., Kajdiˇ c, P., et al. 2014, Nature Communications, 5, 3481
work page 2014
-
[34]
2019, Advances in neural information processing systems, 32
Lu, J., Batra, D., Parikh, D., & Lee, S. 2019, Advances in neural information processing systems, 32
work page 2019
-
[35]
McCloskey, A. E., Gallagher, P. T., & Bloomfield, D. S. 2018, Journal of Space Weather and Space Climate, 8, A34
work page 2018
- [36]
-
[37]
2021, Earth, Planets and Space, 73, 1
Nishizuka, N., Kubo, Y., Sugiura, K., Den, M., & Ishii, M. 2021, Earth, Planets and Space, 73, 1
work page 2021
-
[38]
2018, The Astrophysical Journal, 858, 113
Nishizuka, N., Sugiura, K., Kubo, Y., Den, M., & Ishii, M. 2018, The Astrophysical Journal, 858, 113
work page 2018
-
[39]
2017, The Astrophysical Journal, 835, 156
Nishizuka, N., Sugiura, K., Kubo, Y., et al. 2017, The Astrophysical Journal, 835, 156
work page 2017
-
[40]
2024, arXiv preprint ARXIV.2406.11054
Pandey, C., Adeyeha, T., Hong, J., Angryk, R., & Aydin, B. 2024, arXiv preprint ARXIV.2406.11054
-
[41]
2018, The Astrophysical Journal, 869, 91
Park, E., Moon, Y.-J., Shin, S., et al. 2018, The Astrophysical Journal, 869, 91
work page 2018
-
[42]
2017, Space Weather, 15, 704 27
Park, J., Moon, Y.-J., Choi, S., et al. 2017, Space Weather, 15, 704 27
work page 2017
-
[43]
Raboonik, A., Safari, H., Alipour, N., & Wheatland, M. S. 2016, The Astrophysical Journal, 834, 11
work page 2016
-
[44]
Ran, H., Liu, Y. D., Guo, Y., & Wang, R. 2022, The Astrophysical Journal, 937, 43
work page 2022
-
[45]
Ribeiro, F., & Gradvohl, A. L. S. 2021, Astronomy and Computing, 35, 100468
work page 2021
-
[46]
Sadykov, V. M., & Kosovichev, A. G. 2017, The Astrophysical Journal, 849, 148
work page 2017
-
[47]
Schou, J., Scherrer, P. H., Bush, R. I., et al. 2012, Solar Physics, 275, 229
work page 2012
-
[48]
Song, H., Tan, C., Jing, J., et al. 2009, Solar Physics, 254, 101
work page 2009
-
[49]
Sun, K., Xiao, B., Liu, D., & Wang, J. 2019, in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 5693–5703
work page 2019
-
[50]
Sun, Z., Bobra, M. G., Wang, X., et al. 2022, The Astrophysical Journal, 931, 163
work page 2022
-
[51]
2021, The Astrophysical Journal Supplement Series, 257, 50
Tang, R., Liao, W., Chen, Z., et al. 2021, The Astrophysical Journal Supplement Series, 257, 50
work page 2021
-
[52]
2017, Advances in neural information processing systems, 30
Vaswani, A., Shazeer, N., Parmar, N., et al. 2017, Advances in neural information processing systems, 30
work page 2017
-
[53]
2019, The Astrophysical Journal, 884, 175
Wang, J., Liu, S., Ao, X., et al. 2019, The Astrophysical Journal, 884, 175
work page 2019
-
[54]
Wang, J., Zhang, Y., Webber, S. A. H., et al. 2020, The Astrophysical Journal, 892, 140
work page 2020
-
[55]
2020, The Astrophysical Journal, 895, 3
Wang, X., Chen, Y., Toth, G., et al. 2020, The Astrophysical Journal, 895, 3
work page 2020
-
[56]
2013, The Astrophysical Journal Letters, 774, L27
Yang, X., Lin, G., Zhang, H., & Mao, X. 2013, The Astrophysical Journal Letters, 774, L27
work page 2013
-
[57]
2023, IEEE transactions on pattern analysis and machine intelligence, 45, 10870
Yao, T., Li, Y., Pan, Y., et al. 2023, IEEE transactions on pattern analysis and machine intelligence, 45, 10870
work page 2023
-
[58]
2021, The Astrophysical Journal, 910, 8
Yi, K., Moon, Y.-J., Lim, D., Park, E., & Lee, H. 2021, The Astrophysical Journal, 910, 8
work page 2021
-
[59]
2009, The Astrophysical Journal, 709, 321
Yu, D., Huang, X., Hu, Q., et al. 2009, The Astrophysical Journal, 709, 321
work page 2009
-
[60]
2021, Monthly Notices of the Royal Astronomical Society, 507, 3519
Zheng, Y., Li, X., Si, Y., Qin, W., & Tian, H. 2021, Monthly Notices of the Royal Astronomical Society, 507, 3519
work page 2021
-
[61]
2019, The Astrophysical Journal, 885, 73
Zheng, Y., Li, X., & Wang, X. 2019, The Astrophysical Journal, 885, 73
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.