SpectralTrain: A Universal Framework for Hyperspectral Image Classification

Jiarui Zhao; Liping Yu; Meihua Zhou; Nan Wan; Ruiguo Hu; Wai Kin Fung; Wenzhuo Liu; Xinyu Tong

arxiv: 2511.16084 · v2 · pith:5EGDK5I2new · submitted 2025-11-20 · 💻 cs.CV · cs.AI

SpectralTrain: A Universal Framework for Hyperspectral Image Classification

Meihua Zhou , Liping Yu , Xinyu Tong , Wai Kin Fung , Ruiguo Hu , Jiarui Zhao , Wenzhuo Liu , Nan Wan This is my paper

Pith reviewed 2026-05-17 20:54 UTC · model grok-4.3

classification 💻 cs.CV cs.AI

keywords hyperspectral image classificationcurriculum learningprincipal component analysistraining efficiencyremote sensingdeep learningcloud classification

0 comments

The pith

SpectralTrain speeds hyperspectral image model training by 2-7x using curriculum learning and PCA spectral reduction while keeping accuracy close to full training.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents SpectralTrain as a training framework for hyperspectral image classification that pairs a gradual curriculum schedule with principal component analysis to lower spectral dimensions step by step. This combination lets models learn spectral-spatial patterns on reduced data early in training before adding back complexity, cutting overall computation. The approach is designed to work with any backbone architecture, optimizer, or loss function and was tested on Indian Pines, Salinas-A, and a new CloudPatch-7 dataset for cloud classification. Results show training time reductions of 2-7 times with only small to moderate accuracy changes depending on the model. The work positions training strategy as a practical way to improve efficiency without changing model design.

Core claim

SpectralTrain integrates curriculum learning with PCA-based spectral downsampling to gradually expose models to increasing spectral complexity, enabling them to acquire essential spectral-spatial features at lower computational cost than standard full-spectrum training from the outset.

What carries the argument

The SpectralTrain framework that applies PCA to progressively reduce spectral bands while advancing a curriculum schedule that increases learning difficulty over training epochs.

If this is right

Models can reach usable performance on hyperspectral tasks with substantially less GPU time.
The same training schedule works across classical and state-of-the-art networks without modification.
Cloud classification in remote-sensing data becomes more practical for repeated or large-scale runs.
Training optimization serves as a complement to architectural improvements in hyperspectral models.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The method may transfer to other high-dimensional imaging domains where spectral or channel count drives cost.
Combining PCA reduction with curriculum learning could be tested on temporal sequences or multi-modal sensor data.
If the speedup holds on newer architectures, it would reduce the barrier to experimenting with larger hyperspectral models.

Load-bearing premise

That reducing spectral dimensions with PCA and increasing task complexity gradually will retain the information needed for accurate classification and yield better efficiency than training directly on the original high-dimensional data.

What would settle it

Running the same backbone on one of the benchmark datasets with and without SpectralTrain and finding either no training-time reduction or a large accuracy drop that exceeds the reported small-to-moderate deltas.

read the original abstract

Hyperspectral image (HSI) classification typically involves large-scale data and computationally intensive training, which limits the practical deployment of deep learning models in real-world remote sensing tasks. This study introduces SpectralTrain, a universal, architecture-agnostic training framework that enhances learning efficiency by integrating curriculum learning (CL) with principal component analysis (PCA)-based spectral downsampling. By gradually introducing spectral complexity while preserving essential information, SpectralTrain enables efficient learning of spectral -- spatial patterns at significantly reduced computational costs. The framework is independent of specific architectures, optimizers, or loss functions and is compatible with both classical and state-of-the-art (SOTA) models. Extensive experiments on three benchmark datasets -- Indian Pines, Salinas-A, and the newly introduced CloudPatch-7 -- demonstrate strong generalization across spatial scales, spectral characteristics, and application domains. The results indicate consistent reductions in training time by 2-7x speedups with small-to-moderate accuracy deltas depending on backbone. Its application to cloud classification further reveals potential in climate-related remote sensing, emphasizing training strategy optimization as an effective complement to architectural design in HSI models. Code is available at https://github.com/mh-zhou/SpectralTrain.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The manuscript introduces SpectralTrain, a universal, architecture-agnostic training framework for hyperspectral image classification that integrates curriculum learning with PCA-based spectral downsampling. It claims to enable efficient learning of spectral-spatial patterns at reduced computational cost, delivering consistent 2-7x training speedups with small-to-moderate accuracy deltas across backbones on the Indian Pines, Salinas-A, and newly introduced CloudPatch-7 datasets, while remaining independent of specific architectures, optimizers, or loss functions.

Significance. If the reported efficiency gains hold after proper controls, the framework could have practical significance for real-world remote sensing deployments by lowering training costs without large accuracy penalties. The architecture-agnostic design, code release at the cited GitHub repository, and introduction of the CloudPatch-7 dataset for cloud classification are positive elements that support reproducibility and broader applicability in climate-related tasks.

major comments (1)

[Experiments] The central efficiency claim attributes 2-7x speedups to the integrated SpectralTrain framework (PCA downsampling plus curriculum schedule). However, the experimental section provides no ablation that holds the final PCA dimensionality fixed while comparing the curriculum schedule against standard training with constant low-dimensional input from epoch 1 onward. Without this control, the contribution of the gradual complexity increase cannot be isolated from ordinary dimensionality reduction, which directly affects the load-bearing claim that the curriculum component reliably improves learning efficiency.

minor comments (2)

[Abstract] The abstract and results summary mention accuracy deltas and speedups but do not report error bars, standard deviations across runs, or a complete experimental protocol (e.g., hardware, exact hyperparameter settings, number of trials).
Notation for the curriculum schedule parameters (downsampling levels, transition epochs) should be defined more explicitly with a table or pseudocode to aid reproducibility.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We address the major comment below.

read point-by-point responses

Referee: The central efficiency claim attributes 2-7x speedups to the integrated SpectralTrain framework (PCA downsampling plus curriculum schedule). However, the experimental section provides no ablation that holds the final PCA dimensionality fixed while comparing the curriculum schedule against standard training with constant low-dimensional input from epoch 1 onward. Without this control, the contribution of the gradual complexity increase cannot be isolated from ordinary dimensionality reduction, which directly affects the load-bearing claim that the curriculum component reliably improves learning efficiency.

Authors: We appreciate the referee's observation. The manuscript reports overall speedups and accuracy for the full SpectralTrain framework versus standard training on full-dimensional inputs, and includes PCA downsampling results in several tables to illustrate the effect of dimensionality reduction. However, we acknowledge that an explicit ablation holding the final PCA dimensionality fixed and directly comparing the curriculum schedule (gradual spectral complexity increase) against constant low-dimensional training from epoch 1 is not presented. This control would help isolate the curriculum contribution. We will add this ablation, including training time and accuracy metrics on the primary datasets, in the revised manuscript. revision: yes

Circularity Check

0 steps flagged

Empirical training framework with no circular derivations

full rationale

The paper presents SpectralTrain as an empirical recipe that combines PCA-based spectral downsampling with a curriculum schedule for hyperspectral image classification. It reports measured speedups and accuracy on public benchmarks (Indian Pines, Salinas-A, CloudPatch-7) across multiple backbones. No mathematical derivation chain, first-principles prediction, or fitted parameter is claimed; the work contains no equations that reduce to their own inputs by construction, no self-citation load-bearing uniqueness theorems, and no renaming of known results as novel organization. The central claims rest on experimental outcomes rather than self-referential logic, making the contribution self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

Framework rests on standard machine-learning assumptions about curriculum learning and PCA information preservation; no new physical entities or heavily fitted constants are introduced in the abstract.

free parameters (1)

Curriculum schedule and downsampling levels
Specific rates for gradually increasing spectral complexity and PCA component counts are tunable parameters required to implement the method.

axioms (2)

domain assumption PCA downsampling preserves essential spectral information for downstream classification
Core premise allowing computational reduction without loss of key patterns.
domain assumption Curriculum learning improves training efficiency and final performance when spectral complexity is introduced gradually
Standard ML assumption applied to the HSI setting.

pith-pipeline@v0.9.0 · 5535 in / 1381 out tokens · 55439 ms · 2026-05-17T20:54:41.007292+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages

[1]

IEEE Transactions on Geoscience and Remote Sensing (2025)

He, Y., Tu, B., Liu, B., Li, J., Plaza, A.: Hsi-mformer: Integrating mamba and transformer experts for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing (2025)

work page 2025
[2]

El-Gabri, A.R., Aly, H.A., Ghoniemy, T.S.,et al.: DLRA-Net: Deep local residual attention network with contextual refinement for spectral super- resolution. Int. J. Comput. Vis.133, 1499–1531 (2025) https://doi.org/10.1007/ s11263-024-02238-w

work page 2025
[3]

Remote Sens

Plaza, A., Benediktsson, J.A., Boardman, J.W., Brazile, J., Bruzzone, L., Camps- Valls, G., Chanussot, J., Fauvel, M., Gamba, P., Gualtieri, A., Marconcini, M., Tilton, J.C., Trianni, G.: Recent advances in techniques for hyperspectral image processing. Remote Sens. Environ.113(Supplement 1), 110–122 (2009) https: //doi.org/10.1016/j.rse.2007.07.028

work page doi:10.1016/j.rse.2007.07.028 2009
[4]

IEEE Signal processing magazine19(1), 17–28 (2002)

Landgrebe, D.: Hyperspectral image data analysis. IEEE Signal processing magazine19(1), 17–28 (2002)

work page 2002
[5]

ISPRS Journal of Photogrammetry and Remote Sensing158, 279–317 (2019) 29

Paoletti, M.E., Haut, J.M., Plaza, J., Plaza, A.: Deep learning classifiers for hyperspectral imaging: A review. ISPRS Journal of Photogrammetry and Remote Sensing158, 279–317 (2019) 29

work page 2019
[6]

Wang, T., Yan, Z., Li, J.,et al.: Hyperspectral and multispectral image fusion with arbitrary resolution through self-supervised representations. Int. J. Comput. Vis. (2025) https://doi.org/10.1007/s11263-025-02540-1

work page doi:10.1007/s11263-025-02540-1 2025
[7]

Liu, Y., Dian, R., Li, S.: Low-rank transformer for high-resolution hyperspectral computational imaging. Int. J. Comput. Vis.133, 809–824 (2025) https://doi. org/10.1007/s11263-024-02203-7

work page doi:10.1007/s11263-024-02203-7 2025
[8]

IEEE Transactions on Pattern Analysis and Machine Intelligence46(12), 8036–8055 (2024)

Wang, Y., Yue, Y., Lu, R., Han, Y., Song, S., Huang, G.: Efficienttrain++: Generalized curriculum learning for efficient visual backbone training. IEEE Transactions on Pattern Analysis and Machine Intelligence46(12), 8036–8055 (2024)

work page 2024
[9]

Advances in neural information processing systems33, 1877–1901 (2020)

Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J.D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A.,et al.: Language models are few-shot learners. Advances in neural information processing systems33, 1877–1901 (2020)

work page 1901
[10]

Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (long and Short Papers), pp. 4171–4186 (2019)

work page 2019
[11]

In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp

He, K., Chen, X., Xie, S., Li, Y., Doll´ ar, P., Girshick, R.: Masked autoencoders are scalable vision learners. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 16000–16009 (2022)

work page 2022
[12]

In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp

Li, F., Zhang, H., Liu, S., Guo, J., Ni, L.M., Zhang, L.: Dn-detr: Accelerate detr training by introducing query denoising. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13619–13627 (2022)

work page 2022
[13]

Advances in neural information processing systems 33, 3833–3845 (2020)

Zoph, B., Ghiasi, G., Lin, T.-Y., Cui, Y., Liu, H., Cubuk, E.D., Le, Q.: Rethinking pre-training and self-training. Advances in neural information processing systems 33, 3833–3845 (2020)

work page 2020
[14]

In: 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22), pp

Romero, J., Yin, J., Laanait, N., Xie, B., Young, M.T., Treichler, S., Starchenko, V., Borisevich, A., Sergeev, A., Matheson, M.: Accelerating collective communica- tion in data parallel training across deep learning frameworks. In: 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22), pp. 1027–1040 (2022)

work page 2022
[15]

IEEE/ACM Transactions on Audio, Speech, and Language Processing24(4), 796–806 (2016) 30

Wang, Z.-Q., Wang, D.: A joint training framework for robust automatic speech recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing24(4), 796–806 (2016) 30

work page 2016
[16]

Journal of Power Sources601, 234292 (2024)

Su, Q., Huang, R., He, H.: Heterogeneous multi-agent deep reinforcement learning for eco-driving of hybrid electric tracked vehicles: A heuristic training framework. Journal of Power Sources601, 234292 (2024)

work page 2024
[17]

Future Generation Computer Systems163, 107528 (2025)

Yu, X., Gao, Z., Xiong, Z., Zhao, C., Yang, Y.: Ddpg-adaptconfig: A deep reinforcement learning framework for adaptive device selection and training configuration in heterogeneity federated learning. Future Generation Computer Systems163, 107528 (2025)

work page 2025
[18]

IEEE transactions on pattern analysis and machine intelligence44(9), 4555–4576 (2021)

Wang, X., Chen, Y., Zhu, W.: A survey on curriculum learning. IEEE transactions on pattern analysis and machine intelligence44(9), 4555–4576 (2021)

work page 2021
[19]

Neural Computing and Applications36(9), 4709–4725 (2024)

Zhu, H., Xie, W., Mu, Y., Xu, J., Wang, F.L., Qu, Y., Hao, T.: A new semi- supervised fuzzy k-means clustering method with dynamic adjustment and label discrimination. Neural Computing and Applications36(9), 4709–4725 (2024)

work page 2024
[20]

In: Proceedings of the 2021 International Conference on Multimodal Interaction, pp

Hirano, Y., Okada, S., Komatani, K.: Recognizing social signals with weakly supervised multitask learning for multimodal dialogue systems. In: Proceedings of the 2021 International Conference on Multimodal Interaction, pp. 141–149 (2021)

work page 2021
[21]

IEEE Transactions on Circuits and Systems for Video Technology34(8), 7165–7175 (2024)

Xu, J., Ma, X., Zhang, L., Zhang, B., Chen, T.: Push-and-pull: A general training framework with differential augmentor for domain generalized point cloud classi- fication. IEEE Transactions on Circuits and Systems for Video Technology34(8), 7165–7175 (2024)

work page 2024
[22]

Applied Spectroscopy Reviews56(4), 289–323 (2021)

Calin, M.A., Calin, A.C., Nicolae, D.N.: Application of airborne and spaceborne hyperspectral imaging techniques for atmospheric research: Past, present, and future. Applied Spectroscopy Reviews56(4), 289–323 (2021)

work page 2021
[23]

Yan, H.Y., Zheng, R.Z., Boehm, B.B., Shaga, S.S., Black, D.B., Russell, R.R., Kursun, O.K.: Cloudpatch-7 hyperspectral dataset (2024)

work page 2024
[24]

In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp

He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recogni- tion. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

work page 2016
[25]

In: 2023 3rd International Conference on Innovative Sustainable Computational Technologies (CISCT), pp

Todi, A., Narula, N., Sharma, M., Gupta, U.: Convnext: A contemporary archi- tecture for convolutional neural networks for image classification. In: 2023 3rd International Conference on Innovative Sustainable Computational Technologies (CISCT), pp. 1–6 (2023). IEEE

work page 2023
[26]

In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp

Hampali, S., Sarkar, S.D., Rad, M., Lepetit, V.: Keypoint transformer: Solving joint identification in challenging hands and object interactions for accurate 3d pose estimation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11090–11100 (2022)

work page 2022
[27]

IEEE Transactions on Geoscience and Remote Sensing59(6), 5040–5053 (2020)

Wang, J., Gao, F., Dong, J., Du, Q.: Adaptive dropblock-enhanced generative 31 adversarial networks for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing59(6), 5040–5053 (2020)

work page 2020
[28]

In: 2024 IEEE Conference on Artificial Intelligence (CAI), pp

Varahagiri, S., Sinha, A., Dubey, S.R., Singh, S.K.: 3d-convolution guided spectral-spatial transformer for hyperspectral image classification. In: 2024 IEEE Conference on Artificial Intelligence (CAI), pp. 8–14 (2024). IEEE

work page 2024
[29]

IEEE Transactions on Geoscience and Remote Sensing62, 1–14 (2024)

Chen, N., Fang, L., Xia, Y., Xia, S., Liu, H., Yue, J.: Spectral query spatial: Revis- iting the role of center pixel in transformer for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing62, 1–14 (2024)

work page 2024
[30]

Neural Networks187, 107311 (2025)

Xu, Y., Wang, D., Zhang, L., Zhang, L.: Dual selective fusion transformer network for hyperspectral image classification. Neural Networks187, 107311 (2025)

work page 2025
[31]

In: 2022 International Conference on Data Science, Agents & Artificial Intelligence (ICDSAAI), vol

Ravikumar, A., Rohit, P., Nair, M.K., Bhatia, V.: Hyperspectral image classi- fication using deep matrix capsules. In: 2022 International Conference on Data Science, Agents & Artificial Intelligence (ICDSAAI), vol. 1, pp. 1–7 (2022). IEEE

work page 2022
[32]

IEEE transactions on signal processing54(8), 2910–2921 (2006)

Wu, Y., Hu, D., Wu, M., Hu, X.: A numerical-integration perspective on gaussian filters. IEEE transactions on signal processing54(8), 2910–2921 (2006)

work page 2006
[33]

IEEE transactions on signal processing61(2), 380–391 (2012)

Krishnan, S.R., Seelamantula, C.S.: On the selection of optimum savitzky-golay filters. IEEE transactions on signal processing61(2), 380–391 (2012)

work page 2012
[34]

IEEE Transactions on Image processing10(2), 231–241 (2001)

Chan, T.F., Osher, S., Shen, J.: The digital tv filter and nonlinear denoising. IEEE Transactions on Image processing10(2), 231–241 (2001)

work page 2001
[35]

In: International Conference on Machine Learning, pp

Mao, A., Mohri, M., Zhong, Y.: Cross-entropy loss functions: Theoretical analysis and applications. In: International Conference on Machine Learning, pp. 23803– 23828 (2023). pmlr

work page 2023
[36]

IEEE transactions on neural networks and learning systems29(7), 2896–2908 (2017)

Shi, W., Gong, Y., Tao, X., Zheng, N.: Training dcnn by combining max- margin, max-correlation objectives, and correntropy loss for multilabel image classification. IEEE transactions on neural networks and learning systems29(7), 2896–2908 (2017)

work page 2017
[37]

In: 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pp

Liu, L., Qi, H.: Learning effective binary descriptors via cross entropy. In: 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1251– 1258 (2017). IEEE

work page 2017
[38]

In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp

Guan, L.: Weight prediction boosts the convergence of adamw. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 329–340 (2023). Springer

work page 2023
[39]

Adam-mini: Use fewer learning rates to gain more.arXiv preprint arXiv:2406.16793,

Zhang, Y., Chen, C., Li, Z., Ding, T., Wu, C., Kingma, D.P., Ye, Y., Luo, Z.- Q., Sun, R.: Adam-mini: Use fewer learning rates to gain more. arXiv preprint arXiv:2406.16793 (2024) 32

work page arXiv 2024
[40]

In: International Conference on Machine Learning, pp

Gower, R.M., Loizou, N., Qian, X., Sailanbayev, A., Shulgin, E., Richt´ arik, P.: Sgd: General analysis and improved rates. In: International Conference on Machine Learning, pp. 5200–5209 (2019). PMLR

work page 2019
[41]

Wang, Y., Hu, X., Hu, Y.,et al.: Boosting domain generalization in remote sensing image segmentation via style mapping and general prototypical contrast. Int. J. Comput. Vis. (2025) https://doi.org/10.1007/s11263-025-02568-3

work page doi:10.1007/s11263-025-02568-3 2025
[42]

Fu, Y., Lam, A., Sato, I.,et al.: Adaptive spatial-spectral dictionary learning for hyperspectral image restoration. Int. J. Comput. Vis.122, 228–245 (2017) https://doi.org/10.1007/s11263-016-0921-6

work page doi:10.1007/s11263-016-0921-6 2017
[43]

Prentice Hall, Upper Saddle River, NJ (2009)

Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 3rd edn. Prentice Hall, Upper Saddle River, NJ (2009)

work page 2009
[44]

McGraw- Hill, New York (1999)

Bracewell, R.N.: The Fourier Transform and Its Applications, 3rd edn. McGraw- Hill, New York (1999)

work page 1999
[45]

Springer, New York (2002)

Jolliffe, I.T.: Principal Component Analysis, 2nd edn. Springer, New York (2002)

work page 2002
[46]

Shannon , title =

Shannon, C.E.: Communication in the presence of noise. Proceedings of the IRE 37(1), 10–21 (1949) https://doi.org/10.1109/JRPROC.1949.232969

work page doi:10.1109/jrproc.1949.232969 1949
[47]

Kluwer Academic / Plenum Publishers, New York (2003)

Chang, C.-I.: Hyperspectral Imaging: Techniques for Spectral Detection and Classification. Kluwer Academic / Plenum Publishers, New York (2003)

work page 2003
[48]

Academic Press, Burlington, MA (2006) 33

Schowengerdt, R.A.: Remote Sensing: Models and Methods for Image Processing, 3rd edn. Academic Press, Burlington, MA (2006) 33

work page 2006

[1] [1]

IEEE Transactions on Geoscience and Remote Sensing (2025)

He, Y., Tu, B., Liu, B., Li, J., Plaza, A.: Hsi-mformer: Integrating mamba and transformer experts for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing (2025)

work page 2025

[2] [2]

El-Gabri, A.R., Aly, H.A., Ghoniemy, T.S.,et al.: DLRA-Net: Deep local residual attention network with contextual refinement for spectral super- resolution. Int. J. Comput. Vis.133, 1499–1531 (2025) https://doi.org/10.1007/ s11263-024-02238-w

work page 2025

[3] [3]

Remote Sens

Plaza, A., Benediktsson, J.A., Boardman, J.W., Brazile, J., Bruzzone, L., Camps- Valls, G., Chanussot, J., Fauvel, M., Gamba, P., Gualtieri, A., Marconcini, M., Tilton, J.C., Trianni, G.: Recent advances in techniques for hyperspectral image processing. Remote Sens. Environ.113(Supplement 1), 110–122 (2009) https: //doi.org/10.1016/j.rse.2007.07.028

work page doi:10.1016/j.rse.2007.07.028 2009

[4] [4]

IEEE Signal processing magazine19(1), 17–28 (2002)

Landgrebe, D.: Hyperspectral image data analysis. IEEE Signal processing magazine19(1), 17–28 (2002)

work page 2002

[5] [5]

ISPRS Journal of Photogrammetry and Remote Sensing158, 279–317 (2019) 29

Paoletti, M.E., Haut, J.M., Plaza, J., Plaza, A.: Deep learning classifiers for hyperspectral imaging: A review. ISPRS Journal of Photogrammetry and Remote Sensing158, 279–317 (2019) 29

work page 2019

[6] [6]

Wang, T., Yan, Z., Li, J.,et al.: Hyperspectral and multispectral image fusion with arbitrary resolution through self-supervised representations. Int. J. Comput. Vis. (2025) https://doi.org/10.1007/s11263-025-02540-1

work page doi:10.1007/s11263-025-02540-1 2025

[7] [7]

Liu, Y., Dian, R., Li, S.: Low-rank transformer for high-resolution hyperspectral computational imaging. Int. J. Comput. Vis.133, 809–824 (2025) https://doi. org/10.1007/s11263-024-02203-7

work page doi:10.1007/s11263-024-02203-7 2025

[8] [8]

IEEE Transactions on Pattern Analysis and Machine Intelligence46(12), 8036–8055 (2024)

Wang, Y., Yue, Y., Lu, R., Han, Y., Song, S., Huang, G.: Efficienttrain++: Generalized curriculum learning for efficient visual backbone training. IEEE Transactions on Pattern Analysis and Machine Intelligence46(12), 8036–8055 (2024)

work page 2024

[9] [9]

Advances in neural information processing systems33, 1877–1901 (2020)

Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J.D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A.,et al.: Language models are few-shot learners. Advances in neural information processing systems33, 1877–1901 (2020)

work page 1901

[10] [10]

Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (long and Short Papers), pp. 4171–4186 (2019)

work page 2019

[11] [11]

In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp

He, K., Chen, X., Xie, S., Li, Y., Doll´ ar, P., Girshick, R.: Masked autoencoders are scalable vision learners. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 16000–16009 (2022)

work page 2022

[12] [12]

In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp

Li, F., Zhang, H., Liu, S., Guo, J., Ni, L.M., Zhang, L.: Dn-detr: Accelerate detr training by introducing query denoising. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13619–13627 (2022)

work page 2022

[13] [13]

Advances in neural information processing systems 33, 3833–3845 (2020)

Zoph, B., Ghiasi, G., Lin, T.-Y., Cui, Y., Liu, H., Cubuk, E.D., Le, Q.: Rethinking pre-training and self-training. Advances in neural information processing systems 33, 3833–3845 (2020)

work page 2020

[14] [14]

In: 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22), pp

Romero, J., Yin, J., Laanait, N., Xie, B., Young, M.T., Treichler, S., Starchenko, V., Borisevich, A., Sergeev, A., Matheson, M.: Accelerating collective communica- tion in data parallel training across deep learning frameworks. In: 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22), pp. 1027–1040 (2022)

work page 2022

[15] [15]

IEEE/ACM Transactions on Audio, Speech, and Language Processing24(4), 796–806 (2016) 30

Wang, Z.-Q., Wang, D.: A joint training framework for robust automatic speech recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing24(4), 796–806 (2016) 30

work page 2016

[16] [16]

Journal of Power Sources601, 234292 (2024)

Su, Q., Huang, R., He, H.: Heterogeneous multi-agent deep reinforcement learning for eco-driving of hybrid electric tracked vehicles: A heuristic training framework. Journal of Power Sources601, 234292 (2024)

work page 2024

[17] [17]

Future Generation Computer Systems163, 107528 (2025)

Yu, X., Gao, Z., Xiong, Z., Zhao, C., Yang, Y.: Ddpg-adaptconfig: A deep reinforcement learning framework for adaptive device selection and training configuration in heterogeneity federated learning. Future Generation Computer Systems163, 107528 (2025)

work page 2025

[18] [18]

IEEE transactions on pattern analysis and machine intelligence44(9), 4555–4576 (2021)

Wang, X., Chen, Y., Zhu, W.: A survey on curriculum learning. IEEE transactions on pattern analysis and machine intelligence44(9), 4555–4576 (2021)

work page 2021

[19] [19]

Neural Computing and Applications36(9), 4709–4725 (2024)

Zhu, H., Xie, W., Mu, Y., Xu, J., Wang, F.L., Qu, Y., Hao, T.: A new semi- supervised fuzzy k-means clustering method with dynamic adjustment and label discrimination. Neural Computing and Applications36(9), 4709–4725 (2024)

work page 2024

[20] [20]

In: Proceedings of the 2021 International Conference on Multimodal Interaction, pp

Hirano, Y., Okada, S., Komatani, K.: Recognizing social signals with weakly supervised multitask learning for multimodal dialogue systems. In: Proceedings of the 2021 International Conference on Multimodal Interaction, pp. 141–149 (2021)

work page 2021

[21] [21]

IEEE Transactions on Circuits and Systems for Video Technology34(8), 7165–7175 (2024)

Xu, J., Ma, X., Zhang, L., Zhang, B., Chen, T.: Push-and-pull: A general training framework with differential augmentor for domain generalized point cloud classi- fication. IEEE Transactions on Circuits and Systems for Video Technology34(8), 7165–7175 (2024)

work page 2024

[22] [22]

Applied Spectroscopy Reviews56(4), 289–323 (2021)

Calin, M.A., Calin, A.C., Nicolae, D.N.: Application of airborne and spaceborne hyperspectral imaging techniques for atmospheric research: Past, present, and future. Applied Spectroscopy Reviews56(4), 289–323 (2021)

work page 2021

[23] [23]

Yan, H.Y., Zheng, R.Z., Boehm, B.B., Shaga, S.S., Black, D.B., Russell, R.R., Kursun, O.K.: Cloudpatch-7 hyperspectral dataset (2024)

work page 2024

[24] [24]

In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp

He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recogni- tion. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

work page 2016

[25] [25]

In: 2023 3rd International Conference on Innovative Sustainable Computational Technologies (CISCT), pp

Todi, A., Narula, N., Sharma, M., Gupta, U.: Convnext: A contemporary archi- tecture for convolutional neural networks for image classification. In: 2023 3rd International Conference on Innovative Sustainable Computational Technologies (CISCT), pp. 1–6 (2023). IEEE

work page 2023

[26] [26]

In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp

Hampali, S., Sarkar, S.D., Rad, M., Lepetit, V.: Keypoint transformer: Solving joint identification in challenging hands and object interactions for accurate 3d pose estimation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11090–11100 (2022)

work page 2022

[27] [27]

IEEE Transactions on Geoscience and Remote Sensing59(6), 5040–5053 (2020)

Wang, J., Gao, F., Dong, J., Du, Q.: Adaptive dropblock-enhanced generative 31 adversarial networks for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing59(6), 5040–5053 (2020)

work page 2020

[28] [28]

In: 2024 IEEE Conference on Artificial Intelligence (CAI), pp

Varahagiri, S., Sinha, A., Dubey, S.R., Singh, S.K.: 3d-convolution guided spectral-spatial transformer for hyperspectral image classification. In: 2024 IEEE Conference on Artificial Intelligence (CAI), pp. 8–14 (2024). IEEE

work page 2024

[29] [29]

IEEE Transactions on Geoscience and Remote Sensing62, 1–14 (2024)

Chen, N., Fang, L., Xia, Y., Xia, S., Liu, H., Yue, J.: Spectral query spatial: Revis- iting the role of center pixel in transformer for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing62, 1–14 (2024)

work page 2024

[30] [30]

Neural Networks187, 107311 (2025)

Xu, Y., Wang, D., Zhang, L., Zhang, L.: Dual selective fusion transformer network for hyperspectral image classification. Neural Networks187, 107311 (2025)

work page 2025

[31] [31]

In: 2022 International Conference on Data Science, Agents & Artificial Intelligence (ICDSAAI), vol

Ravikumar, A., Rohit, P., Nair, M.K., Bhatia, V.: Hyperspectral image classi- fication using deep matrix capsules. In: 2022 International Conference on Data Science, Agents & Artificial Intelligence (ICDSAAI), vol. 1, pp. 1–7 (2022). IEEE

work page 2022

[32] [32]

IEEE transactions on signal processing54(8), 2910–2921 (2006)

Wu, Y., Hu, D., Wu, M., Hu, X.: A numerical-integration perspective on gaussian filters. IEEE transactions on signal processing54(8), 2910–2921 (2006)

work page 2006

[33] [33]

IEEE transactions on signal processing61(2), 380–391 (2012)

Krishnan, S.R., Seelamantula, C.S.: On the selection of optimum savitzky-golay filters. IEEE transactions on signal processing61(2), 380–391 (2012)

work page 2012

[34] [34]

IEEE Transactions on Image processing10(2), 231–241 (2001)

Chan, T.F., Osher, S., Shen, J.: The digital tv filter and nonlinear denoising. IEEE Transactions on Image processing10(2), 231–241 (2001)

work page 2001

[35] [35]

In: International Conference on Machine Learning, pp

Mao, A., Mohri, M., Zhong, Y.: Cross-entropy loss functions: Theoretical analysis and applications. In: International Conference on Machine Learning, pp. 23803– 23828 (2023). pmlr

work page 2023

[36] [36]

IEEE transactions on neural networks and learning systems29(7), 2896–2908 (2017)

Shi, W., Gong, Y., Tao, X., Zheng, N.: Training dcnn by combining max- margin, max-correlation objectives, and correntropy loss for multilabel image classification. IEEE transactions on neural networks and learning systems29(7), 2896–2908 (2017)

work page 2017

[37] [37]

In: 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pp

Liu, L., Qi, H.: Learning effective binary descriptors via cross entropy. In: 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1251– 1258 (2017). IEEE

work page 2017

[38] [38]

In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp

Guan, L.: Weight prediction boosts the convergence of adamw. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 329–340 (2023). Springer

work page 2023

[39] [39]

Adam-mini: Use fewer learning rates to gain more.arXiv preprint arXiv:2406.16793,

Zhang, Y., Chen, C., Li, Z., Ding, T., Wu, C., Kingma, D.P., Ye, Y., Luo, Z.- Q., Sun, R.: Adam-mini: Use fewer learning rates to gain more. arXiv preprint arXiv:2406.16793 (2024) 32

work page arXiv 2024

[40] [40]

In: International Conference on Machine Learning, pp

Gower, R.M., Loizou, N., Qian, X., Sailanbayev, A., Shulgin, E., Richt´ arik, P.: Sgd: General analysis and improved rates. In: International Conference on Machine Learning, pp. 5200–5209 (2019). PMLR

work page 2019

[41] [41]

Wang, Y., Hu, X., Hu, Y.,et al.: Boosting domain generalization in remote sensing image segmentation via style mapping and general prototypical contrast. Int. J. Comput. Vis. (2025) https://doi.org/10.1007/s11263-025-02568-3

work page doi:10.1007/s11263-025-02568-3 2025

[42] [42]

Fu, Y., Lam, A., Sato, I.,et al.: Adaptive spatial-spectral dictionary learning for hyperspectral image restoration. Int. J. Comput. Vis.122, 228–245 (2017) https://doi.org/10.1007/s11263-016-0921-6

work page doi:10.1007/s11263-016-0921-6 2017

[43] [43]

Prentice Hall, Upper Saddle River, NJ (2009)

Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 3rd edn. Prentice Hall, Upper Saddle River, NJ (2009)

work page 2009

[44] [44]

McGraw- Hill, New York (1999)

Bracewell, R.N.: The Fourier Transform and Its Applications, 3rd edn. McGraw- Hill, New York (1999)

work page 1999

[45] [45]

Springer, New York (2002)

Jolliffe, I.T.: Principal Component Analysis, 2nd edn. Springer, New York (2002)

work page 2002

[46] [46]

Shannon , title =

Shannon, C.E.: Communication in the presence of noise. Proceedings of the IRE 37(1), 10–21 (1949) https://doi.org/10.1109/JRPROC.1949.232969

work page doi:10.1109/jrproc.1949.232969 1949

[47] [47]

Kluwer Academic / Plenum Publishers, New York (2003)

Chang, C.-I.: Hyperspectral Imaging: Techniques for Spectral Detection and Classification. Kluwer Academic / Plenum Publishers, New York (2003)

work page 2003

[48] [48]

Academic Press, Burlington, MA (2006) 33

Schowengerdt, R.A.: Remote Sensing: Models and Methods for Image Processing, 3rd edn. Academic Press, Burlington, MA (2006) 33

work page 2006