PolSAR Image Classification using a Hybrid Complex-Valued Network (HybridCVNet)

Mohammed Q. Alkhatib

arxiv: 2605.31137 · v1 · pith:R5W3VDZNnew · submitted 2026-05-29 · 💻 cs.CV

PolSAR Image Classification using a Hybrid Complex-Valued Network (HybridCVNet)

Mohammed Q. Alkhatib This is my paper

Pith reviewed 2026-06-28 22:50 UTC · model grok-4.3

classification 💻 cs.CV

keywords PolSAR image classificationcomplex-valued networksCNNvision transformerremote sensinghybrid architecturepolarimetric SAR

0 comments

The pith

Hybrid complex-valued network blends CNN and vision transformer for PolSAR classification

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces HybridCVNet, a network that merges complex-valued convolutional layers with a complex-valued vision transformer for classifying polarimetric synthetic aperture radar images. Real-valued networks lose the phase component of this data, while the hybrid design uses 3D and 2D complex convolutions to pull out complementary features and model their interdependencies. Tests on common PolSAR datasets show the method exceeds prior results, including strong results when only one percent of samples are available for training. This points to a practical way to retain more of the information in radar returns for tasks like terrain mapping.

Core claim

HybridCVNet efficiently combines CV 3D and 2D CNNs as feature extractors with CV-ViT to extract complementary information and leverage interdependencies within PolSAR data, resulting in superior classification performance on widely-used datasets.

What carries the argument

The hybrid architecture of complex-valued CNNs and complex-valued vision transformer that processes phase information in PolSAR data

If this is right

Overall accuracy reaches 97.39 percent on the Flevoland dataset
Classification remains reliable even at a one percent sampling ratio
Kappa coefficient of 0.972 is obtained on the San Francisco dataset
The approach exceeds results from other methods on standard PolSAR test sets

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same hybrid pattern could be tested on other complex-valued remote-sensing inputs such as interferometric data
Lower data requirements might support deployment where ground-truth labels are scarce
The architecture offers a template for preserving phase in any complex imaging pipeline

Load-bearing premise

The hybrid CV-CNN plus CV-ViT design extracts complementary information and leverages interdependencies in PolSAR data without the need for extensive post-hoc tuning or dataset-specific adjustments

What would settle it

Showing that a standard real-valued network or a non-hybrid complex network reaches equal or higher accuracy on the Flevoland and San Francisco datasets at the same sampling ratios would undermine the claimed benefit of the specific hybrid design

Figures

Figures reproduced from arXiv: 2605.31137 by Mohammed Q. Alkhatib.

**Figure 1.** Figure 1: Overall Architecture of HybridCVNet presentation of the results and analyses, and the study concludes in Section IV with a summary and conclusion remarks. II. METHODOLOGY A. Polarimetric Data of PolSAR Image The properties of how ground objects scatter electromagnetic waves can be explained using polarized scattering matrix S as defined below: S = SHH SHV SV H SV V , (1) where SAB(A, B ∈ H, V ) repre… view at source ↗

**Figure 2.** Figure 2: Classification results of the Flevoland dataset. (a) Reference Class Map; (b) 3D-CNN; (c) WaveletCNN; (d) ViT; (e) swin Transformer; (f) [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Classification results of the San Francisco dataset. (a) Reference Class Map; (b) 3D-CNN; (c) WaveletCNN; (d) ViT; (e) swin Transformer; (f) [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Zoomed area of the white box region in Flevoland region Fig. [PITH_FULL_IMAGE:figures/full_fig_p004_4.png] view at source ↗

**Figure 5.** Figure 5: Classification accuracy of Flevoland dataset at different percentages of training data (a) OA (b) AA and (c) Kappa index. [PITH_FULL_IMAGE:figures/full_fig_p005_5.png] view at source ↗

read the original abstract

Recently, convolutional neural networks (CNNs) have become popular for image classification due to their effectiveness in computer vision tasks. Now, researchers are exploring the potential of vision transformers (ViTs) in remote sensing and Earth observation. However, traditional Real-Valued networks often overlook important phase information in Complex-Valued (CV) data like polarimetric synthetic aperture radar (PolSAR) data. To address this, new CV deep architectures have emerged. HybridCVNet, a novel hybrid network, blends CV-CNN and CV vision transformer (CV-ViT) techniques. It efficiently combines CV 3D and 2D CNNs as feature extractors, enhancing PolSAR image classification by extracting complementary information and effectively leveraging interdependencies within the data. Experimental results from widely-used PolSAR datasets show HybridCVNet outperforms other methods, achieving an overall accuracy of 97.39% on the Flevoland dataset and showing promise even with just a 1% sampling ratio, with a Kappa value of 0.972 on the San Francisco dataset. Source code is accessible through https://github.com/mqalkhatib/HybridCVNet

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

HybridCVNet is a CV-CNN plus CV-ViT stack for PolSAR that reports strong numbers on two standard datasets and ships the code.

read the letter

The paper's core contribution is a hybrid architecture that pairs complex-valued 3D/2D CNN layers with a complex-valued ViT for PolSAR classification. It keeps the phase information that real-valued models discard and claims 97.39% overall accuracy on Flevoland plus a Kappa of 0.972 on San Francisco, with usable results even at 1% labeled samples.

The work does two things cleanly. First, it shows that the hybrid can pull complementary spatial and long-range features from the complex data without obvious post-processing tricks. Second, the authors put the code on GitHub, which lets others rerun or extend the experiments directly.

The main soft spot is that the abstract gives almost no protocol details, so the size of the reported gains over prior CV networks is hard to judge from the summary alone. If the full paper includes proper cross-validation, multiple random splits, and clear baseline descriptions, that concern shrinks; otherwise the numbers stay hard to interpret. No load-bearing math or circular claims appear.

This paper is aimed at the remote-sensing crowd that already works with PolSAR or complex-valued networks. Someone looking for an off-the-shelf accuracy bump on these datasets or a starting point for further CV-ViT hybrids will find it useful.

It deserves a serious referee because the claim is empirical, the code is public, and the task is well-defined. I would send it out for review.

Referee Report

1 major / 0 minor

Summary. The manuscript proposes HybridCVNet, a hybrid complex-valued architecture combining CV-CNN (3D and 2D) feature extractors with a CV-ViT component for PolSAR image classification. It reports empirical results showing superior performance over other methods, with 97.39% overall accuracy on the Flevoland dataset and a Kappa value of 0.972 on the San Francisco dataset, including strong results at a 1% sampling ratio. Source code is released on GitHub.

Significance. If the performance claims hold under rigorous validation, the work would contribute to PolSAR classification by demonstrating benefits of hybrid complex-valued networks that preserve phase information. The public code release is a positive factor supporting reproducibility.

major comments (1)

[Abstract] Abstract: The performance numbers (97.39% OA on Flevoland, Kappa 0.972 on San Francisco) are stated without any description of the experimental protocol, data splitting strategy, baseline methods and their implementations, cross-validation procedure, or error analysis; this prevents evaluation of the central empirical claim.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on the abstract. We address the concern point-by-point below.

read point-by-point responses

Referee: [Abstract] Abstract: The performance numbers (97.39% OA on Flevoland, Kappa 0.972 on San Francisco) are stated without any description of the experimental protocol, data splitting strategy, baseline methods and their implementations, cross-validation procedure, or error analysis; this prevents evaluation of the central empirical claim.

Authors: We agree that the abstract is concise by design and omits explicit details on the experimental protocol, data splitting (e.g., the 1% sampling ratio), baseline implementations, cross-validation, and error analysis. These elements are fully described in the Experiments and Results sections of the manuscript, including dataset descriptions, sampling strategies, baseline comparisons, and quantitative metrics. To improve clarity for readers who encounter only the abstract, we will revise the abstract to include a brief sentence summarizing the evaluation protocol on standard PolSAR datasets with the reported sampling ratios and comparisons to baselines. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper's central claim is an empirical performance result: HybridCVNet achieves 97.39% accuracy on Flevoland and Kappa 0.972 on San Francisco using a hybrid CV-CNN + CV-ViT architecture. No derivation chain, equations, fitted parameters renamed as predictions, or self-citation load-bearing steps are present. The architecture's ability to extract complementary information is presented as an observed experimental outcome rather than a formal necessity derived from its own inputs. The work is self-contained against external benchmarks via reported dataset results and released code.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no equations, parameters, or explicit assumptions; ledger cannot be populated beyond noting absence of information.

pith-pipeline@v0.9.1-grok · 5729 in / 1008 out tokens · 19022 ms · 2026-06-28T22:50:52.192705+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

22 extracted references · 1 linked inside Pith

[1]

Potential of estimating soil moisture under vegetation cover by means of polsar,

I. Hajnsek, T. Jagdhuber, H. Schon, and K. P. Papathanassiou, “Potential of estimating soil moisture under vegetation cover by means of polsar,” IEEE Transactions on Geoscience and Remote Sensing, vol. 47, no. 2, pp. 442–454, 2009

2009
[2]

A robust change detection methodology for flood events using sar images,

M. Al-Saad, N. Aburaed, M. S. Zitouni, M. Q. Alkhatib, S. Almansoori, and H. Al Ahmad, “A robust change detection methodology for flood events using sar images,” inIGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, 2023, pp. 341–344

2023
[3]

Polarimetric radar technology for european defence superiority-the polrad project,

A. Lupidi, C. Greiff, S. Br ¨uggenwirth, M. Brandfass, and M. Martorella, “Polarimetric radar technology for european defence superiority-the polrad project,” in2020 21st International Radar Symposium (IRS). IEEE, 2020, pp. 6–10

2020
[4]

Machine learning classification based on k-nearest neighbors for polsar data,

J. A. Ferreira, A. K. Rodrigues, R. Ospina, and L. Gomez, “Machine learning classification based on k-nearest neighbors for polsar data,” Anais da Academia Brasileira de Ci ˆencias, vol. 96, no. 1, p. e20230064, 2024

2024
[5]

Performance analysis of sar filtering techniques using svm and wishart classifier,

A. Masurkar, R. Daruwala, and A. Mohite, “Performance analysis of sar filtering techniques using svm and wishart classifier,”Remote Sensing Applications: Society and Environment, vol. 34, p. 101189, 2024

2024
[6]

Polarimetric sar image clas- sification using deep convolutional neural networks,

Y . Zhou, H. Wang, F. Xu, and Y .-Q. Jin, “Polarimetric sar image clas- sification using deep convolutional neural networks,”IEEE Geoscience and Remote Sensing Letters, vol. 13, no. 12, pp. 1935–1939, 2016

1935
[7]

Polarimetric sar terrain classification using 3d convolutional neural network,

L. Zhang, Z. Chen, B. Zou, and Y . Gao, “Polarimetric sar terrain classification using 3d convolutional neural network,” inIGARSS 2018- 2018 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 2018, pp. 4551–4554

2018
[8]

Complex-valued convo- lutional neural network and its application in polarimetric sar image classification,

Z. Zhang, H. Wang, F. Xu, and Y .-Q. Jin, “Complex-valued convo- lutional neural network and its application in polarimetric sar image classification,”IEEE Transactions on Geoscience and Remote Sensing, vol. 55, no. 12, pp. 7177–7188, 2017

2017
[9]

Complex-valued 3- d convolutional neural network for polsar image classification,

X. Tan, M. Li, P. Zhang, Y . Wu, and W. Song, “Complex-valued 3- d convolutional neural network for polsar image classification,”IEEE Geoscience and Remote Sensing Letters, vol. 17, no. 6, pp. 1022–1026, 2019

2019
[10]

An image is worth 16x16 words: Transformers for image recognition at scale,

A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gellyet al., “An image is worth 16x16 words: Transformers for image recognition at scale,”arXiv preprint arXiv:2010.11929, 2020

Pith/arXiv arXiv 2010
[11]

Exploring vision transformers for polarimetric sar image classification,

H. Dong, L. Zhang, and B. Zou, “Exploring vision transformers for polarimetric sar image classification,”IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–15, 2021

2021
[12]

Multimodal fusion transformer for remote sensing image classification,

S. K. Roy, A. Deria, D. Hong, B. Rasti, A. Plaza, and J. Chanussot, “Multimodal fusion transformer for remote sensing image classification,” IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1– 20, 2023

2023
[13]

Local window attention transformer for polarimetric sar image classification,

A. Jamali, S. K. Roy, A. Bhattacharya, and P. Ghamisi, “Local window attention transformer for polarimetric sar image classification,”IEEE Geoscience and Remote Sensing Letters, vol. 20, pp. 1–5, 2023

2023
[14]

A new architecture of a complex-valued convolutional neural network for polsar image classification,

Y . Ren, W. Jiang, and Y . Liu, “A new architecture of a complex-valued convolutional neural network for polsar image classification,”Remote Sensing, vol. 15, no. 19, p. 4801, 2023

2023
[15]

Polsar image classification using attention based shallow to deep convolutional neural network,

M. Q. Alkhatib, M. Al-Saad, N. Aburaed, M. S. Zitouni, and H. Al- Ahmad, “Polsar image classification using attention based shallow to deep convolutional neural network,” inIGARSS 2023-2023 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 2023, pp. 8034–8037

2023
[16]

Hybridsn: Exploring 3-d–2-d cnn feature hierarchy for hyperspectral image classi- fication,

S. K. Roy, G. Krishna, S. R. Dubey, and B. B. Chaudhuri, “Hybridsn: Exploring 3-d–2-d cnn feature hierarchy for hyperspectral image classi- fication,”IEEE Geoscience and Remote Sensing Letters, vol. 17, no. 2, pp. 277–281, 2019

2019
[17]

Multimodal fusion transformer for remote sensing image classification,

S. K. Roy, A. Deria, D. Hong, B. Rasti, A. Plaza, and J. Chanussot, “Multimodal fusion transformer for remote sensing image classification,” IEEE Transactions on Geoscience and Remote Sensing, 2023

2023
[18]

Complex-valued vs. real-valued neural networks for classification perspectives: An example on non-circular data,

J. A. Barrachina, C. Ren, C. Morisseau, G. Vieillard, and J.-P. Ovar- lez, “Complex-valued vs. real-valued neural networks for classification perspectives: An example on non-circular data,” inICASSP 2021- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2021, pp. 2990–2994

2021
[19]

Polsar image classifica- tion using a superpixel-based composite kernel and elastic net,

Y . Cao, Y . Wu, M. Li, W. Liang, and P. Zhang, “Polsar image classifica- tion using a superpixel-based composite kernel and elastic net,”Remote Sensing, vol. 13, no. 3, p. 380, 2021

2021
[20]

Polsf: Polsar image datasets on san francisco,

X. Liu, L. Jiao, F. Liu, D. Zhang, and X. Tang, “Polsf: Polsar image datasets on san francisco,” inInternational Conference on Intelligence Science. Springer, 2022, pp. 214–219

2022
[21]

Polsar image classification based on deep convolu- tional neural networks using wavelet transformation,

A. Jamali, M. Mahdianpari, F. Mohammadimanesh, A. Bhattacharya, and S. Homayouni, “Polsar image classification based on deep convolu- tional neural networks using wavelet transformation,”IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1–5, 2022

2022
[22]

Swin transformer: Hierarchical vision transformer using shifted windows,

Z. Liu, Y . Lin, Y . Cao, H. Hu, Y . Wei, Z. Zhang, S. Lin, and B. Guo, “Swin transformer: Hierarchical vision transformer using shifted windows,” inProceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 10 012–10 022

2021

[1] [1]

Potential of estimating soil moisture under vegetation cover by means of polsar,

I. Hajnsek, T. Jagdhuber, H. Schon, and K. P. Papathanassiou, “Potential of estimating soil moisture under vegetation cover by means of polsar,” IEEE Transactions on Geoscience and Remote Sensing, vol. 47, no. 2, pp. 442–454, 2009

2009

[2] [2]

A robust change detection methodology for flood events using sar images,

M. Al-Saad, N. Aburaed, M. S. Zitouni, M. Q. Alkhatib, S. Almansoori, and H. Al Ahmad, “A robust change detection methodology for flood events using sar images,” inIGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, 2023, pp. 341–344

2023

[3] [3]

Polarimetric radar technology for european defence superiority-the polrad project,

A. Lupidi, C. Greiff, S. Br ¨uggenwirth, M. Brandfass, and M. Martorella, “Polarimetric radar technology for european defence superiority-the polrad project,” in2020 21st International Radar Symposium (IRS). IEEE, 2020, pp. 6–10

2020

[4] [4]

Machine learning classification based on k-nearest neighbors for polsar data,

J. A. Ferreira, A. K. Rodrigues, R. Ospina, and L. Gomez, “Machine learning classification based on k-nearest neighbors for polsar data,” Anais da Academia Brasileira de Ci ˆencias, vol. 96, no. 1, p. e20230064, 2024

2024

[5] [5]

Performance analysis of sar filtering techniques using svm and wishart classifier,

A. Masurkar, R. Daruwala, and A. Mohite, “Performance analysis of sar filtering techniques using svm and wishart classifier,”Remote Sensing Applications: Society and Environment, vol. 34, p. 101189, 2024

2024

[6] [6]

Polarimetric sar image clas- sification using deep convolutional neural networks,

Y . Zhou, H. Wang, F. Xu, and Y .-Q. Jin, “Polarimetric sar image clas- sification using deep convolutional neural networks,”IEEE Geoscience and Remote Sensing Letters, vol. 13, no. 12, pp. 1935–1939, 2016

1935

[7] [7]

Polarimetric sar terrain classification using 3d convolutional neural network,

L. Zhang, Z. Chen, B. Zou, and Y . Gao, “Polarimetric sar terrain classification using 3d convolutional neural network,” inIGARSS 2018- 2018 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 2018, pp. 4551–4554

2018

[8] [8]

Complex-valued convo- lutional neural network and its application in polarimetric sar image classification,

Z. Zhang, H. Wang, F. Xu, and Y .-Q. Jin, “Complex-valued convo- lutional neural network and its application in polarimetric sar image classification,”IEEE Transactions on Geoscience and Remote Sensing, vol. 55, no. 12, pp. 7177–7188, 2017

2017

[9] [9]

Complex-valued 3- d convolutional neural network for polsar image classification,

X. Tan, M. Li, P. Zhang, Y . Wu, and W. Song, “Complex-valued 3- d convolutional neural network for polsar image classification,”IEEE Geoscience and Remote Sensing Letters, vol. 17, no. 6, pp. 1022–1026, 2019

2019

[10] [10]

An image is worth 16x16 words: Transformers for image recognition at scale,

A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gellyet al., “An image is worth 16x16 words: Transformers for image recognition at scale,”arXiv preprint arXiv:2010.11929, 2020

Pith/arXiv arXiv 2010

[11] [11]

Exploring vision transformers for polarimetric sar image classification,

H. Dong, L. Zhang, and B. Zou, “Exploring vision transformers for polarimetric sar image classification,”IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–15, 2021

2021

[12] [12]

Multimodal fusion transformer for remote sensing image classification,

S. K. Roy, A. Deria, D. Hong, B. Rasti, A. Plaza, and J. Chanussot, “Multimodal fusion transformer for remote sensing image classification,” IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1– 20, 2023

2023

[13] [13]

Local window attention transformer for polarimetric sar image classification,

A. Jamali, S. K. Roy, A. Bhattacharya, and P. Ghamisi, “Local window attention transformer for polarimetric sar image classification,”IEEE Geoscience and Remote Sensing Letters, vol. 20, pp. 1–5, 2023

2023

[14] [14]

A new architecture of a complex-valued convolutional neural network for polsar image classification,

Y . Ren, W. Jiang, and Y . Liu, “A new architecture of a complex-valued convolutional neural network for polsar image classification,”Remote Sensing, vol. 15, no. 19, p. 4801, 2023

2023

[15] [15]

Polsar image classification using attention based shallow to deep convolutional neural network,

M. Q. Alkhatib, M. Al-Saad, N. Aburaed, M. S. Zitouni, and H. Al- Ahmad, “Polsar image classification using attention based shallow to deep convolutional neural network,” inIGARSS 2023-2023 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 2023, pp. 8034–8037

2023

[16] [16]

Hybridsn: Exploring 3-d–2-d cnn feature hierarchy for hyperspectral image classi- fication,

S. K. Roy, G. Krishna, S. R. Dubey, and B. B. Chaudhuri, “Hybridsn: Exploring 3-d–2-d cnn feature hierarchy for hyperspectral image classi- fication,”IEEE Geoscience and Remote Sensing Letters, vol. 17, no. 2, pp. 277–281, 2019

2019

[17] [17]

Multimodal fusion transformer for remote sensing image classification,

S. K. Roy, A. Deria, D. Hong, B. Rasti, A. Plaza, and J. Chanussot, “Multimodal fusion transformer for remote sensing image classification,” IEEE Transactions on Geoscience and Remote Sensing, 2023

2023

[18] [18]

Complex-valued vs. real-valued neural networks for classification perspectives: An example on non-circular data,

J. A. Barrachina, C. Ren, C. Morisseau, G. Vieillard, and J.-P. Ovar- lez, “Complex-valued vs. real-valued neural networks for classification perspectives: An example on non-circular data,” inICASSP 2021- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2021, pp. 2990–2994

2021

[19] [19]

Polsar image classifica- tion using a superpixel-based composite kernel and elastic net,

Y . Cao, Y . Wu, M. Li, W. Liang, and P. Zhang, “Polsar image classifica- tion using a superpixel-based composite kernel and elastic net,”Remote Sensing, vol. 13, no. 3, p. 380, 2021

2021

[20] [20]

Polsf: Polsar image datasets on san francisco,

X. Liu, L. Jiao, F. Liu, D. Zhang, and X. Tang, “Polsf: Polsar image datasets on san francisco,” inInternational Conference on Intelligence Science. Springer, 2022, pp. 214–219

2022

[21] [21]

Polsar image classification based on deep convolu- tional neural networks using wavelet transformation,

A. Jamali, M. Mahdianpari, F. Mohammadimanesh, A. Bhattacharya, and S. Homayouni, “Polsar image classification based on deep convolu- tional neural networks using wavelet transformation,”IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1–5, 2022

2022

[22] [22]

Swin transformer: Hierarchical vision transformer using shifted windows,

Z. Liu, Y . Lin, Y . Cao, H. Hu, Y . Wei, Z. Zhang, S. Lin, and B. Guo, “Swin transformer: Hierarchical vision transformer using shifted windows,” inProceedings of the IEEE/CVF international conference on computer vision, 2021, pp. 10 012–10 022

2021