SDF2Net: Shallow to Deep Feature Fusion Network for PolSAR Image Classification
Pith reviewed 2026-05-24 03:37 UTC · model grok-4.3
The pith
A three-branch complex-valued CNN fuses shallow and deep features to improve PolSAR land cover classification accuracy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that the Shallow to Deep Feature Fusion Network (SDF2Net), by integrating feature maps from three branches of a complex-valued CNN at progressively deeper stages, produces representations that yield higher overall classification accuracy on PolSAR land-cover tasks than the state-of-the-art methods tested on the AIRSAR Flevoland, AIRSAR San Francisco, and ESAR Oberpfaffenhofen datasets.
What carries the argument
The three-branch shallow-to-deep feature fusion architecture applied to complex-valued PolSAR inputs.
If this is right
- Overall accuracy rises by 1.3 percent and 0.8 percent on the two AIRSAR datasets.
- Overall accuracy rises by 0.5 percent on the ESAR dataset.
- The model reaches 96.01 percent accuracy on Flevoland data with only a 1 percent sampling ratio.
Where Pith is reading between the lines
- The same shallow-to-deep fusion pattern could be tested on other complex-valued modalities such as full-polarimetric radar or interferometric data.
- If the gains hold under stricter cross-validation, the method would lower the labeled-data requirement for operational PolSAR mapping.
- Explicit connections between early and late layers may help capture both fine texture and broader context that single-stream complex CNNs miss.
Load-bearing premise
The three-branch fusion architecture generates feature representations that are superior to those from standard complex-valued CNN baselines on the chosen test sets.
What would settle it
A side-by-side accuracy comparison of SDF2Net against the same baselines on a new PolSAR scene, using the identical one-percent sampling ratio, would show whether the reported gains persist.
Figures
read the original abstract
Polarimetric synthetic aperture radar (PolSAR) images encompass valuable information that can facilitate extensive land cover interpretation and generate diverse output products. Extracting meaningful features from PolSAR data poses challenges distinct from those encountered in optical imagery. Deep learning (DL) methods offer effective solutions for overcoming these challenges in PolSAR feature extraction. Convolutional neural networks (CNNs) play a crucial role in capturing PolSAR image characteristics by leveraging kernel capabilities to consider local information and the complex-valued nature of PolSAR data. In this study, a novel three-branch fusion of complex-valued CNN, named the Shallow to Deep Feature Fusion Network (SDF2Net), is proposed for PolSAR image classification. To validate the performance of the proposed method, classification results are compared against multiple state-of-the-art approaches using the airborne synthetic aperture radar (AIRSAR) datasets of Flevoland and San Francisco, as well as the ESAR Oberpfaffenhofen dataset. The results indicate that the proposed approach demonstrates improvements in overallaccuracy, with a 1.3% and 0.8% enhancement for the AIRSAR datasets and a 0.5% improvement for the ESAR dataset. Analyses conducted on the Flevoland data underscore the effectiveness of the SDF2Net model, revealing a promising overall accuracy of 96.01% even with only a 1% sampling ratio.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes SDF2Net, a three-branch shallow-to-deep feature fusion network using complex-valued CNNs for PolSAR image classification. It reports overall accuracy improvements of 1.3% and 0.8% on the two AIRSAR datasets and 0.5% on the ESAR Oberpfaffenhofen dataset relative to prior SOTA methods, including a 96.01% accuracy at 1% sampling ratio on Flevoland.
Significance. If the reported gains prove robust under repeated trials and proper statistical controls, the fusion architecture could offer a practical advance for handling complex-valued PolSAR inputs in remote-sensing classification tasks. The use of standard public benchmarks and direct SOTA comparisons is a positive aspect of the evaluation design.
major comments (2)
- [Abstract and §5] Abstract and §5 (Experiments): the central performance claim rests on single-run point estimates (e.g., the 1.3% gain on AIRSAR Flevoland and 96.01% at 1% sampling) without reported standard deviations, multiple random seeds, or any statistical significance test against the complex-valued CNN baselines. At 1% sampling, label noise and initialization sensitivity are known to be high, so the observed deltas cannot be distinguished from experimental fluctuation on the basis of the presented evidence.
- [§4 and §5] §4 (Proposed Method) and §5: no ablation isolating the shallow-to-deep fusion component is provided, so it is impossible to determine whether the modest accuracy deltas arise from the three-branch architecture itself or from other unstated differences in training protocol, data augmentation, or hyper-parameter choices.
minor comments (2)
- [§3] Figure captions and §3: the complex-valued convolution and fusion operations would benefit from explicit equations rather than prose descriptions alone.
- [§5] Table captions in §5: clarify whether the listed baselines are re-implemented with the same training protocol or taken from the original papers.
Simulated Author's Rebuttal
We thank the referee for their constructive feedback. We address each major comment below and indicate the revisions planned for the next version of the manuscript.
read point-by-point responses
-
Referee: [Abstract and §5] Abstract and §5 (Experiments): the central performance claim rests on single-run point estimates (e.g., the 1.3% gain on AIRSAR Flevoland and 96.01% at 1% sampling) without reported standard deviations, multiple random seeds, or any statistical significance test against the complex-valued CNN baselines. At 1% sampling, label noise and initialization sensitivity are known to be high, so the observed deltas cannot be distinguished from experimental fluctuation on the basis of the presented evidence.
Authors: We acknowledge that the presented results rely on single-run point estimates without standard deviations or statistical tests. While single-run reporting is prevalent in PolSAR classification literature, the concern about robustness at low sampling ratios is valid. In the revised manuscript we will rerun all experiments with multiple random seeds, report mean and standard deviation values, and add paired statistical significance tests against the complex-valued baselines. revision: yes
-
Referee: [§4 and §5] §4 (Proposed Method) and §5: no ablation isolating the shallow-to-deep fusion component is provided, so it is impossible to determine whether the modest accuracy deltas arise from the three-branch architecture itself or from other unstated differences in training protocol, data augmentation, or hyper-parameter choices.
Authors: We agree that an ablation isolating the shallow-to-deep fusion is necessary to attribute performance gains specifically to the proposed architecture. The revised manuscript will include an ablation study that compares the full three-branch SDF2Net against controlled variants (single-branch and late-fusion baselines) while holding training protocol, augmentation, and hyperparameters fixed. revision: yes
Circularity Check
No circularity: purely empirical architecture comparison on held-out test pixels
full rationale
The paper introduces SDF2Net, a three-branch complex-valued CNN fusion architecture, and evaluates it via overall accuracy on standard PolSAR benchmark scenes (Flevoland, San Francisco, Oberpfaffenhofen) at fixed sampling ratios. No equations, first-principles derivations, or predictions are presented that reduce by construction to fitted hyperparameters, self-citations, or renamed inputs; reported deltas (0.5–1.3 %) are direct empirical measurements against prior methods on the same held-out pixels. The central claim therefore rests on experimental comparison rather than any self-referential loop.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Optimal combination of polarimetric features for vegetation classification in polsar image,
Q. Yin, W. Hong, F. Zhang, and E. Pottier, “Optimal combination of polarimetric features for vegetation classification in polsar image,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 12, no. 10, pp. 3919–3931, 2019
work page 2019
-
[2]
Automatic surface water mapping using polarimetric sar data for long-term change detection,
W. Zhang, B. Hu, and G. S. Brown, “Automatic surface water mapping using polarimetric sar data for long-term change detection,” Water, vol. 12, no. 3, p. 872, 2020
work page 2020
-
[3]
Man-made target detection from polarimetric sar data via nonstationarity and asymmetry,
D. Xiang, T. Tang, Y . Ban, and Y . Su, “Man-made target detection from polarimetric sar data via nonstationarity and asymmetry,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing , vol. 9, no. 4, pp. 1459–1469, 2016
work page 2016
-
[4]
Spatial feature-based convolutional neural network for polsar image classification,
R. Shang, J. Wang, L. Jiao, X. Yang, and Y . Li, “Spatial feature-based convolutional neural network for polsar image classification,” Applied Soft Computing, vol. 123, p. 108922, 2022
work page 2022
-
[5]
A new architecture of a complex-valued convolutional neural network for polsar image classification,
Y . Ren, W. Jiang, and Y . Liu, “A new architecture of a complex-valued convolutional neural network for polsar image classification,” Remote Sensing, vol. 15, no. 19, p. 4801, 2023
work page 2023
-
[6]
B. Brisco, M. Mahdianpari, and F. Mohammadimanesh, “Hybrid com- pact polarimetric sar for environmental monitoring with the radarsat constellation mission,” Remote Sensing, vol. 12, no. 20, p. 3283, 2020
work page 2020
-
[7]
Disaster monitoring by fully polarimetric sar data acquired with alos-palsar,
Y . Yamaguchi, “Disaster monitoring by fully polarimetric sar data acquired with alos-palsar,” Proceedings of the IEEE , vol. 100, no. 10, pp. 2851–2860, 2012
work page 2012
-
[8]
Semisupervised classification of polsar image incorporating labels’ semantic priors,
B. Hou, J. Guan, Q. Wu, and L. Jiao, “Semisupervised classification of polsar image incorporating labels’ semantic priors,” IEEE Geoscience and Remote Sensing Letters , vol. 17, no. 10, pp. 1737–1741, 2019
work page 2019
-
[9]
Polarimetric radar technology for european defence superiority-the polrad project,
A. Lupidi, C. Greiff, S. Br ¨uggenwirth, M. Brandfass, and M. Martorella, “Polarimetric radar technology for european defence superiority-the polrad project,” in 2020 21st International Radar Symposium (IRS) . IEEE, 2020, pp. 6–10
work page 2020
-
[10]
D. Mandal and Y . Rao, “Sasya: An integrated framework for crop biophysical parameter retrieval and within-season crop yield prediction with sar remote sensing data,” Remote Sensing Applications: Society and Environment, vol. 20, p. 100366, 2020
work page 2020
-
[11]
Multitemporal polarimetric sar change detection for crop monitoring and crop type classification,
C. Silva-Perez, A. Marino, J. M. Lopez-Sanchez, and I. Cameron, “Multitemporal polarimetric sar change detection for crop monitoring and crop type classification,”IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing , vol. 14, pp. 12 361–12 374, 2021
work page 2021
-
[12]
M. Datcu, Z. Huang, A. Anghel, J. Zhao, and R. Cacoveanu, “Ex- plainable, physics-aware, trustworthy artificial intelligence: A paradigm shift for synthetic aperture radar,” IEEE Geoscience and Remote Sensing Magazine, vol. 11, no. 1, pp. 8–25, 2023
work page 2023
-
[13]
Polsar ship detection based on azimuth sublook polarimetric covariance matrix,
Z. Yang, L. Fang, B. Shen, and T. Liu, “Polsar ship detection based on azimuth sublook polarimetric covariance matrix,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing , vol. 15, pp. 8506–8518, 2022
work page 2022
-
[14]
New decomposition of the radar target scattering matrix,
E. Krogager, “New decomposition of the radar target scattering matrix,” Electronics letters, vol. 18, no. 26, pp. 1525–1527, 1990
work page 1990
-
[15]
A three-component scattering model for polarimetric sar data,
A. Freeman and S. L. Durden, “A three-component scattering model for polarimetric sar data,” IEEE transactions on geoscience and remote sensing, vol. 36, no. 3, pp. 963–973, 1998
work page 1998
-
[16]
Four- component scattering model for polarimetric sar image decomposition,
Y . Yamaguchi, T. Moriyama, M. Ishido, and H. Yamada, “Four- component scattering model for polarimetric sar image decomposition,” IEEE Transactions on geoscience and remote sensing , vol. 43, no. 8, pp. 1699–1706, 2005
work page 2005
-
[17]
A review of target decomposition theorems in radar polarimetry,
S. R. Cloude and E. Pottier, “A review of target decomposition theorems in radar polarimetry,” IEEE transactions on geoscience and remote sensing, vol. 34, no. 2, pp. 498–518, 1996
work page 1996
-
[18]
J. Qin, Z. Liu, L. Ran, R. Xie, J. Tang, and Z. Guo, “A target sar image expansion method based on conditional wasserstein deep convolutional gan for automatic target recognition,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing , vol. 15, pp. 7153– 7170, 2022
work page 2022
-
[19]
A novel algorithm for land use and land cover classification using radarsat-2 polarimetric sar data,
Z. Qi, A. G.-O. Yeh, X. Li, and Z. Lin, “A novel algorithm for land use and land cover classification using radarsat-2 polarimetric sar data,” Remote Sensing of Environment , vol. 118, pp. 21–39, 2012
work page 2012
-
[20]
A review of polsar image classification: From polarimetry to deep learning,
H. Wang, F. Xu, and Y .-Q. Jin, “A review of polsar image classification: From polarimetry to deep learning,” in IGARSS 2019-2019 IEEE Inter- national Geoscience and Remote Sensing Symposium . IEEE, 2019, pp. 3189–3192
work page 2019
-
[21]
Classification of sar and polsar images using deep learning: A review,
H. Parikh, S. Patel, and V . Patel, “Classification of sar and polsar images using deep learning: A review,”International Journal of Image and Data Fusion, vol. 11, no. 1, pp. 1–32, 2020
work page 2020
-
[22]
Polarimetric sar image clas- sification using deep convolutional neural networks,
Y . Zhou, H. Wang, F. Xu, and Y .-Q. Jin, “Polarimetric sar image clas- sification using deep convolutional neural networks,” IEEE Geoscience and Remote Sensing Letters , vol. 13, no. 12, pp. 1935–1939, 2016
work page 1935
-
[23]
Polsar image classification using polarimetric-feature-driven deep convolutional neural network,
S.-W. Chen and C.-S. Tao, “Polsar image classification using polarimetric-feature-driven deep convolutional neural network,” IEEE Geoscience and Remote Sensing Letters , vol. 15, no. 4, pp. 627–631, 2018
work page 2018
-
[24]
A. Radman, M. Mahdianpari, B. Brisco, B. Salehi, and F. Mohammadi- manesh, “Dual-branch fusion of convolutional neural network and graph convolutional network for polsar image classification,” Remote Sensing, vol. 15, no. 1, p. 75, 2022
work page 2022
-
[25]
Polsar image classification with lightweight 3d convolutional networks,
H. Dong, L. Zhang, and B. Zou, “Polsar image classification with lightweight 3d convolutional networks,” Remote Sensing, vol. 12, no. 3, p. 396, 2020
work page 2020
-
[26]
J. Barrachina, C. Ren, G. Vieillard, C. Morisseau, and J.-P. Ovarlez, “Real-and complex-valued neural networks for sar image segmentation through different polarimetric representations,” in 2022 IEEE Interna- tional Conference on Image Processing (ICIP). IEEE, 2022, pp. 1456– 1460. 13
work page 2022
-
[27]
Complex-valued vs. real-valued convolutional neural network for polsar data classification,
R. M. Asiyabi, M. Datcu, H. Nies, and A. Anghel, “Complex-valued vs. real-valued convolutional neural network for polsar data classification,” in IGARSS 2022-2022 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 2022, pp. 421–424
work page 2022
-
[28]
Complex-valued convolutional neural net- works for object detection in polsar data,
R. H ¨ansch and O. Hellwich, “Complex-valued convolutional neural net- works for object detection in polsar data,” in 8th European Conference on Synthetic Aperture Radar . VDE, 2010, pp. 1–4
work page 2010
-
[29]
Hybrid attention- based encoder–decoder fully convolutional network for polsar image classification,
Z. Fang, G. Zhang, Q. Dai, B. Xue, and P. Wang, “Hybrid attention- based encoder–decoder fully convolutional network for polsar image classification,” Remote Sensing, vol. 15, no. 2, p. 526, 2023
work page 2023
-
[30]
Polarimetric sar terrain classification using 3d convolutional neural network,
L. Zhang, Z. Chen, B. Zou, and Y . Gao, “Polarimetric sar terrain classification using 3d convolutional neural network,” in IGARSS 2018- 2018 IEEE International Geoscience and Remote Sensing Symposium . IEEE, 2018, pp. 4551–4554
work page 2018
-
[31]
H. Dong, L. Zhang, D. Lu, and B. Zou, “Attention-based polarimetric feature selection convolutional network for polsar image classification,” IEEE Geoscience and Remote Sensing Letters , vol. 19, pp. 1–5, 2020
work page 2020
-
[32]
Squeeze-and-excitation networks,
J. Hu, L. Shen, and G. Sun, “Squeeze-and-excitation networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 7132–7141
work page 2018
-
[33]
J. Zhang, P. Ma, T. Jiang, X. Zhao, W. Tan, J. Zhang, S. Zou, X. Huang, M. Grzegorzek, and C. Li, “Sem-rcnn: a squeeze-and-excitation-based mask region convolutional neural network for multi-class environmental microorganism detection,” Applied Sciences , vol. 12, no. 19, p. 9902, 2022
work page 2022
-
[34]
Dnn-based polsar image classification on noisy labels,
J. Ni, D. Xiang, Z. Lin, C. L ´opez-Mart´ınez, W. Hu, and F. Zhang, “Dnn-based polsar image classification on noisy labels,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing , vol. 15, pp. 3697–3713, 2022
work page 2022
-
[35]
Polsar image classifica- tion using a superpixel-based composite kernel and elastic net,
Y . Cao, Y . Wu, M. Li, W. Liang, and P. Zhang, “Polsar image classifica- tion using a superpixel-based composite kernel and elastic net,” Remote Sensing, vol. 13, no. 3, p. 380, 2021
work page 2021
-
[36]
Polsf: Polsar image datasets on san francisco,
X. Liu, L. Jiao, F. Liu, D. Zhang, and X. Tang, “Polsf: Polsar image datasets on san francisco,” in International Conference on Intelligence Science. Springer, 2022, pp. 214–219
work page 2022
-
[37]
Pol-insar-island-a benchmark dataset for multi-frequency pol-insar data land cover classification,
S. Hochstuhl, N. Pfeffer, A. Thiele, S. Hinz, J. Amao-Oliva, R. Scheiber, A. Reigber, and H. Dirks, “Pol-insar-island-a benchmark dataset for multi-frequency pol-insar data land cover classification,” ISPRS Open Journal of Photogrammetry and Remote Sensing , vol. 10, p. 100047, 2023
work page 2023
-
[38]
Support vector machine for multifrequency sar polarimetric data classification,
C. Lardeux, P.-L. Frison, C. Tison, J.-C. Souyris, B. Stoll, B. Fruneau, and J.-P. Rudant, “Support vector machine for multifrequency sar polarimetric data classification,” IEEE Transactions on Geoscience and Remote Sensing, vol. 47, no. 12, pp. 4143–4152, 2009
work page 2009
-
[39]
Z. Zhang, H. Wang, F. Xu, and Y .-Q. Jin, “Complex-valued convo- lutional neural network and its application in polarimetric sar image classification,” IEEE Transactions on Geoscience and Remote Sensing , vol. 55, no. 12, pp. 7177–7188, 2017
work page 2017
-
[40]
Complex-valued 3- d convolutional neural network for polsar image classification,
X. Tan, M. Li, P. Zhang, Y . Wu, and W. Song, “Complex-valued 3- d convolutional neural network for polsar image classification,” IEEE Geoscience and Remote Sensing Letters , vol. 17, no. 6, pp. 1022–1026, 2019
work page 2019
-
[41]
A. Jamali, M. Mahdianpari, F. Mohammadimanesh, A. Bhattacharya, and S. Homayouni, “Polsar image classification based on deep convolu- tional neural networks using wavelet transformation,” IEEE Geoscience and Remote Sensing Letters , vol. 19, pp. 1–5, 2022
work page 2022
-
[42]
Polsar image classification using attention based shallow to deep convolutional neural network,
M. Q. Alkhatib, M. Al-Saad, N. Aburaed, M. S. Zitouni, and H. Al- Ahmad, “Polsar image classification using attention based shallow to deep convolutional neural network,” in IGARSS 2023-2023 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 2023, pp. 8034–8037
work page 2023
-
[43]
Improved spatial-spectral super- pixel hyperspectral unmixing,
M. Q. Alkhatib and M. Velez-Reyes, “Improved spatial-spectral super- pixel hyperspectral unmixing,” Remote Sensing, vol. 11, no. 20, p. 2374, 2019. Mohammed Q. Alkhatib (S’09, M’18, SM’24) earned his B.S. degree in Telecommunications En- gineering from Yarmouk University, Irbid, Jordan, in 2008. Subsequently, he completed his M.S. and Ph.D. degrees in Ele...
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.