pith. sign in

arxiv: 2605.00310 · v1 · submitted 2026-05-01 · 💻 cs.CV · cs.AI· cs.LG

Beyond Visual Fidelity: Benchmarking Super-Resolution Models for Large-Scale Remote Sensing Imagery via Downstream Task Integration

Pith reviewed 2026-05-09 20:21 UTC · model grok-4.3

classification 💻 cs.CV cs.AIcs.LG
keywords super-resolutionremote sensingbenchmarkdownstream tasksfidelity metricsland cover classificationEarth observationimage quality
0
0 comments X

The pith

Super-resolution models picked by PSNR or SSIM often deliver worse results on actual remote sensing tasks than lower-scoring alternatives.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Super-resolution methods reconstruct sharper satellite images from coarse inputs, promising better support for monitoring applications in agriculture, urban planning, and disaster response. Standard benchmarks judge success only by how closely the output matches a high-resolution reference using pixel-wise scores like PSNR and SSIM. This paper introduces GeoSR-Bench, a dataset of 36,000 spatially matched image pairs at multiple resolution scales, and runs 270 controlled experiments that feed the super-resolved outputs into five real downstream tasks per scenario. The measured correlations between fidelity scores and task accuracy are frequently weak or negative. The finding indicates that visual quality metrics alone give unreliable guidance when the end goal is accurate Earth observation outputs rather than prettier pictures.

Core claim

GeoSR-Bench supplies the first large-scale, task-integrated benchmark for super-resolution in remote sensing. It pairs low- and high-resolution imagery across diverse land covers and directly measures how nine different SR models affect performance on land cover segmentation, infrastructure mapping, and biophysical variable estimation. Results across GAN, transformer, neural operator, and diffusion architectures show that gains on traditional fidelity metrics do not reliably produce gains on these tasks and can even reduce task accuracy.

What carries the argument

GeoSR-Bench, a collection of spatially co-located, temporally aligned, and quality-controlled image pairs from 36,000 locations that links SR outputs to five downstream task models per scenario.

If this is right

  • SR model rankings for remote sensing shift when evaluation uses task performance instead of fidelity metrics.
  • Developers should optimize or fine-tune SR networks directly on task losses rather than generic reconstruction objectives.
  • Diffusion and transformer SR models may outperform GANs on task utility even when trailing on PSNR or SSIM.
  • New benchmarks must include downstream task integration to guide SR progress for Earth observation.
  • Operational monitoring systems may achieve higher accuracy by selecting SR models according to task results rather than visual scores.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Incorporating auxiliary task losses during SR training could close the gap between visual fidelity and functional utility.
  • The same evaluation mismatch likely exists in other domains such as medical or astronomical imaging where tasks matter more than pixel accuracy.
  • Future SR research could test whether certain artifact types introduced by diffusion models systematically hurt change detection more than segmentation.
  • Agencies running large-scale Earth observation pipelines might adopt task-integrated benchmarks to replace current fidelity-only leaderboards.

Load-bearing premise

The five chosen downstream tasks and the 36,000 image locations represent the practical value of super-resolved imagery in real Earth monitoring workflows.

What would settle it

A replication of the 270-setting experiments on a fresh geographic sample or with different task models that instead finds consistently strong positive correlations between PSNR/SSIM gains and downstream accuracy.

Figures

Figures reproduced from arXiv: 2605.00310 by Dinesh Manocha, Gengchen Mai, Kangyang Chai, Sergii Skakun, Xiaowei Jia, Yanhua Li, Yiqun Xie, Zhihao Wang, Zhili Li.

Figure 1
Figure 1. Figure 1: GeoSR benchmark focuses on two cross-platform SR tasks: MODIS [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: GeoSR Dataset Construction Process Overview. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 1
Figure 1. Figure 1: These two SR tasks also cover diverse utilities. For [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 3
Figure 3. Figure 3: Spatial distributions of image pairs in (a) MODIS-to-Landsat-8 and (b) Sentinel-2-to-NAIP SR coincident datasets. [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: PSNR and SSIM do not always align with visual perception in SR. [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Relative performance comparison across SR models on the MODIS-to-Landsat-8 downstream tasks using SegFormer as the downstream model. Relative [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Relative performance comparison across SR models on the Sentinel-2-to-NAIP downstream tasks using SegFormer as the downstream model. Relative [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Pearson correlation between visual fidelity metrics (PSNR/SSIM) and downstream task performance computed within the top- [PITH_FULL_IMAGE:figures/full_fig_p014_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Spearman’s rank correlation between visual fidelity metrics (PSNR/SSIM) and downstream task performance computed within the top- [PITH_FULL_IMAGE:figures/full_fig_p015_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Downstream task performance comparison on the TreeFinder dataset [PITH_FULL_IMAGE:figures/full_fig_p015_9.png] view at source ↗
read the original abstract

Super-resolution (SR) techniques have made major advances in reconstructing high-resolution images from low-resolution inputs. The increased resolution provides visual enhancement and utility for monitoring tasks. In particular, SR has been increasingly developed for satellite-based Earth observation, with applications in urban planning, agriculture, ecology, and disaster response. However, existing SR studies and benchmarks typically use fidelity metrics such as PSNR or SSIM, whereas the true utility of super-resolved images lies in supporting downstream tasks such as land cover classification, biomass estimation, and change detection. To bridge this gap, we introduce GeoSR-Bench, a downstream task-integrated SR benchmark dataset to evaluate SR models beyond fidelity metrics. GeoSR-Bench comprises spatially co-located, temporally aligned, and quality-controlled image pairs from about 36,000 locations across diverse land covers, spanning resolutions from 500m to 0.6m. To the best of our knowledge, GeoSR-Bench is the first SR benchmark that directly connects improved image resolution from SR models with downstream Earth monitoring tasks, including land cover segmentation, infrastructure mapping, and biophysical variable estimation. Using GeoSR-Bench, we benchmark GAN, transformer, neural operator, and diffusion-based SR models on perceptual quality and downstream task performance. We conduct experiments with 270 settings, covering 2 cross-platform SR tasks, 9 SR models, 3 downstream task models, and 5 downstream tasks for each SR task. The results show that improvements in traditional SR metrics often do not correlate with gains in task performance, and the correlations can be negative, indicating that these metrics provide limited guidance for selecting superior models for downstream tasks. This reveals the need to integrate downstream tasks into SR model development and evaluation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces GeoSR-Bench, a large-scale benchmark with ~36,000 quality-controlled, co-located image pairs spanning resolutions from 500m to 0.6m across diverse land covers. It evaluates 9 SR models (including GAN, transformer, neural operator, and diffusion-based) in 2 cross-platform scenarios through 270 experimental settings with 3 downstream task models and 5 tasks per scenario (land cover segmentation, infrastructure mapping, biophysical variable estimation). The primary result is that traditional SR fidelity metrics (PSNR, SSIM) exhibit weak or negative correlations with downstream task performance gains.

Significance. The scale of the benchmark and the direct integration of downstream tasks represent a valuable contribution to the field of remote sensing image processing. If the observed lack of positive correlation between fidelity metrics and task utility is robust, this could significantly influence SR model development by prioritizing task performance over visual metrics, leading to more practical models for applications in urban planning, agriculture, and disaster response. The empirical benchmarking approach with held-out tasks is a strength.

major comments (2)
  1. [§3] §3 (Dataset): The description of the 36,000 locations and quality control process lacks explicit details on selection criteria, data exclusion rules, and assessment of potential biases or representativeness across land cover types, which is critical to support the generalizability of the negative correlation findings.
  2. [§5] §5 (Experimental Results): The analysis of correlations between SR metrics and downstream task performance across the 270 settings does not include statistical significance tests, error bars, or confidence intervals; this weakens the central claim that correlations 'often do not correlate' or 'can be negative'.
minor comments (1)
  1. [Abstract] Abstract: A supplementary table breaking down the 270 settings by SR model, task, and platform would improve clarity of the experimental design.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thorough review and constructive comments on our manuscript. We appreciate the recognition of the benchmark's scale and the importance of integrating downstream tasks. Below, we provide point-by-point responses to the major comments.

read point-by-point responses
  1. Referee: [§3] §3 (Dataset): The description of the 36,000 locations and quality control process lacks explicit details on selection criteria, data exclusion rules, and assessment of potential biases or representativeness across land cover types, which is critical to support the generalizability of the negative correlation findings.

    Authors: We agree that more detailed information on the dataset curation process is necessary to bolster the generalizability of our results. In the revised version of the manuscript, we will expand the description in §3 to explicitly detail the selection criteria for the approximately 36,000 locations, the specific data exclusion rules applied during quality control, and an assessment of potential biases along with the representativeness across various land cover types. This addition will help substantiate the robustness of the observed correlations. revision: yes

  2. Referee: [§5] §5 (Experimental Results): The analysis of correlations between SR metrics and downstream task performance across the 270 settings does not include statistical significance tests, error bars, or confidence intervals; this weakens the central claim that correlations 'often do not correlate' or 'can be negative'.

    Authors: We concur that incorporating statistical rigor would strengthen the presentation of our findings. Accordingly, in the revised manuscript, we will augment the analysis in §5 by including statistical significance tests for the correlations, as well as error bars and confidence intervals where appropriate, across the 270 experimental settings. These additions will provide quantitative support for our conclusions regarding the weak or negative correlations between traditional SR fidelity metrics and downstream task performance. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical benchmarking study with independent experimental results

full rationale

The paper introduces GeoSR-Bench as a new dataset and performs direct empirical comparisons of 9 SR models across 2 scenarios, 3 task models, and 5 downstream tasks using 36,000 image pairs. No derivations, equations, or fitted parameters are presented as predictions; results are computed from held-out evaluations. No self-citation chains or ansatzes underpin the central claim of weak/negative correlations between fidelity metrics and task performance. The study is self-contained against external benchmarks and does not reduce its findings to inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The work rests on domain assumptions about data alignment and task relevance rather than new mathematical axioms, free parameters, or invented entities.

axioms (1)
  • domain assumption Spatially co-located, temporally aligned, and quality-controlled image pairs from diverse land covers form a fair basis for SR benchmarking.
    Invoked in the construction of GeoSR-Bench from 36,000 locations.

pith-pipeline@v0.9.0 · 5656 in / 1332 out tokens · 49802 ms · 2026-05-09T20:21:25.530642+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

85 extracted references · 85 canonical work pages

  1. [1]

    Ntire 2017 challenge on single image super-resolution: Dataset and study

    Eirikur Agustsson and Radu Timofte. Ntire 2017 challenge on single image super-resolution: Dataset and study. InProceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 126–135, 2017

  2. [2]

    Improving component substitution pansharpening through multivariate regression of ms + pan data.IEEE Transactions on Geoscience and Remote Sensing, 45(10):3230– 3239, 2007

    Bruno Aiazzi, Stefano Baronti, and Massimo Selva. Improving component substitution pansharpening through multivariate regression of ms + pan data.IEEE Transactions on Geoscience and Remote Sensing, 45(10):3230– 3239, 2007

  3. [3]

    Canopy height model and naip imagery pairs across conus.Scientific Data, 12(1):322, 2025

    Brady W Allred, Sarah E McCord, and Scott L Morford. Canopy height model and naip imagery pairs across conus.Scientific Data, 12(1):322, 2025

  4. [4]

    Apache Sedona

    Apache. Apache Sedona. https://sedona.apache.org/1.6.0/, 2025. Ac- cessed: 2025-11-05

  5. [5]

    Cnn-based super-resolution of hyperspectral images.IEEE Transactions on Geoscience and Remote Sensing, 58(9):6106–6121, 2020

    Pattathal V Arun, Krishna Mohan Buddhiraju, Alok Porwal, and Jocelyn Chanussot. Cnn-based super-resolution of hyperspectral images.IEEE Transactions on Geoscience and Remote Sensing, 58(9):6106–6121, 2020

  6. [6]

    Toward real-world single image super-resolution: A new benchmark and a new model

    Jianrui Cai, Hui Zeng, Hongwei Yong, Zisheng Cao, and Lei Zhang. Toward real-world single image super-resolution: A new benchmark and a new model. InProceedings of the IEEE/CVF international conference on computer vision, pages 3086–3095, 2019. JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 16

  7. [7]

    Large-scale individual building extraction from open-source satellite imagery via super-resolution-based instance segmentation approach

    Shenglong Chen, Yoshiki Ogawa, Chenbo Zhao, and Yoshihide Sekimoto. Large-scale individual building extraction from open-source satellite imagery via super-resolution-based instance segmentation approach. ISPRS Journal of Photogrammetry and Remote Sensing, 195:129–152, 2023

  8. [8]

    Activating more pixels in image super-resolution transformer

    Xiangyu Chen, Xintao Wang, Jiantao Zhou, Yu Qiao, and Chao Dong. Activating more pixels in image super-resolution transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 22367–22377, 2023

  9. [9]

    Binarized diffusion model for image super- resolution.Advances in Neural Information Processing Systems, 37:30651–30669, 2024

    Zheng Chen, Haotong Qin, Yong Guo, Xiongfei Su, Xin Yuan, Linghe Kong, and Yulun Zhang. Binarized diffusion model for image super- resolution.Advances in Neural Information Processing Systems, 37:30651–30669, 2024

  10. [10]

    Recursive generalization transformer for image super-resolution.arXiv preprint arXiv:2303.06373, 2023

    Zheng Chen, Yulun Zhang, Jinjin Gu, Linghe Kong, and Xiaokang Yang. Recursive generalization transformer for image super-resolution.arXiv preprint arXiv:2303.06373, 2023

  11. [11]

    A solar panel dataset of very high resolution satellite imagery to support the sustainable development goals

    Cecilia N Clark and Fabio Pacifici. A solar panel dataset of very high resolution satellite imagery to support the sustainable development goals. Scientific Data, 10(1):636, 2023

  12. [12]

    Open high- resolution satellite imagery: The worldstrat dataset–with application to super-resolution.Advances in Neural Information Processing Systems, 35:25979–25991, 2022

    Julien Cornebise, Ivan Oršoli ´c, and Freddie Kalaitzis. Open high- resolution satellite imagery: The worldstrat dataset–with application to super-resolution.Advances in Neural Information Processing Systems, 35:25979–25991, 2022

  13. [13]

    National land cover database (nlcd) 2021 products, 2023

    Jon Dewitz. National land cover database (nlcd) 2021 products, 2023. U.S. Geological Survey data release

  14. [14]

    Learning a deep convolutional network for image super-resolution

    Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. Learning a deep convolutional network for image super-resolution. InComputer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part IV 13, pages 184–199. Springer, 2014

  15. [15]

    Esa worldcover global land cover service

    European Space Agency (ESA) WorldCover. Esa worldcover global land cover service. https://esa-worldcover.org/en/data-access, 2021. Accessed: 2025-12-11

  16. [16]

    Remote sensing time series analysis: A review of data and applications

    Yingchun Fu, Zhe Zhu, Liangyun Liu, Wenfeng Zhan, Tao He, Huanfeng Shen, Jun Zhao, Yongxue Liu, Hongsheng Zhang, Zihan Liu, et al. Remote sensing time series analysis: A review of data and applications. Journal of Remote Sensing, 4:0285, 2024

  17. [17]

    Copernicus_s2_cloud_probability: Sentinel -2 cloud probability

    Google Earth Engine. Copernicus_s2_cloud_probability: Sentinel -2 cloud probability. https://developers.google.com/earth-engine/datasets/catalog/ COPERNICUS_S2_CLOUD_PROBABILITY, 2025. Accessed: 2025- 12-11

  18. [18]

    Spatial-temporal super-resolution of satellite imagery via conditional pixel synthesis

    Yutong He, Dingjie Wang, Nicholas Lai, William Zhang, Chenlin Meng, Marshall Burke, David Lobell, and Stefano Ermon. Spatial-temporal super-resolution of satellite imagery via conditional pixel synthesis. Advances in Neural Information Processing Systems, 34:27903–27915, 2021

  19. [19]

    Tracking urbanization in developing regions with remote sensing spatial-temporal super-resolution.arXiv preprint arXiv:2204.01736, 2022

    Yutong He, William Zhang, Chenlin Meng, Marshall Burke, David B Lobell, and Stefano Ermon. Tracking urbanization in developing regions with remote sensing spatial-temporal super-resolution.arXiv preprint arXiv:2204.01736, 2022

  20. [20]

    Cascaded diffusion models for high fidelity image generation.Journal of Machine Learning Research, 23(47):1–33, 2022

    Jonathan Ho, Chitwan Saharia, William Chan, David J Fleet, Mohammad Norouzi, and Tim Salimans. Cascaded diffusion models for high fidelity image generation.Journal of Machine Learning Research, 23(47):1–33, 2022

  21. [21]

    Efficient swin transformer for remote sensing image super-resolution.IEEE Transactions on Image Processing, 2024

    Xudong Kang, Puhong Duan, Jier Li, and Shutao Li. Efficient swin transformer for remote sensing image super-resolution.IEEE Transactions on Image Processing, 2024

  22. [22]

    Lobell, and Stefano Ermon

    Samar Khanna, Patrick Liu, Linqi Zhou, Chenlin Meng, Robin Rombach, Marshall Burke, David B. Lobell, and Stefano Ermon. Diffusionsat: A generative foundation model for satellite imagery. InThe Twelfth International Conference on Learning Representations, 2024

  23. [23]

    Accurate image super-resolution using very deep convolutional networks

    Jiwon Kim, Jung Kwon Lee, and Kyoung Mu Lee. Accurate image super-resolution using very deep convolutional networks. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 1646–1654, 2016

  24. [24]

    Single-image super-resolution using sparse regression and natural image prior.IEEE transactions on pattern analysis and machine intelligence, 32(6):1127–1133, 2010

    Kwang In Kim and Younghee Kwon. Single-image super-resolution using sparse regression and natural image prior.IEEE transactions on pattern analysis and machine intelligence, 32(6):1127–1133, 2010

  25. [25]

    Toward bridging the simulated-to-real gap: Benchmarking super-resolution on real data.IEEE transactions on pattern analysis and machine intelligence, 42(11):2944–2959, 2019

    Thomas Köhler, Michel Bätz, Farzad Naderi, André Kaup, Andreas Maier, and Christian Riess. Toward bridging the simulated-to-real gap: Benchmarking super-resolution on real data.IEEE transactions on pattern analysis and machine intelligence, 42(11):2944–2959, 2019

  26. [26]

    A real-world benchmark for sentinel-2 multi-image super-resolution.Scientific Data, 10(1):644, 2023

    Pawel Kowaleczko, Tomasz Tarasiewicz, Maciej Ziaja, Daniel Kostrzewa, Jakub Nalepa, Przemyslaw Rokita, and Michal Kawulok. A real-world benchmark for sentinel-2 multi-image super-resolution.Scientific Data, 10(1):644, 2023

  27. [27]

    A high-resolution canopy height model of the earth.Nature Ecology & Evolution, 7(11):1778–1789, 2023

    Nico Lang, Walter Jetz, Konrad Schindler, and Jan Dirk Wegner. A high-resolution canopy height model of the earth.Nature Ecology & Evolution, 7(11):1778–1789, 2023

  28. [28]

    Photo-realistic single image super- resolution using a generative adversarial network

    Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Jo- hannes Totz, Zehan Wang, et al. Photo-realistic single image super- resolution using a generative adversarial network. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 4681–4690, 2017

  29. [29]

    Transformer-based multistage enhancement for remote sensing image super-resolution.IEEE Transac- tions on Geoscience and Remote Sensing, 60:1–11, 2021

    Sen Lei, Zhenwei Shi, and Wenjing Mo. Transformer-based multistage enhancement for remote sensing image super-resolution.IEEE Transac- tions on Geoscience and Remote Sensing, 60:1–11, 2021

  30. [30]

    Sed: Semantic-aware discriminator for image super-resolution

    Bingchen Li, Xin Li, Hanxin Zhu, Yeying Jin, Ruoyu Feng, Zhizheng Zhang, and Zhibo Chen. Sed: Semantic-aware discriminator for image super-resolution. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 25784–25795, 2024

  31. [31]

    A global analysis of sentinel-2a, sentinel-2b and landsat-8 data revisit intervals and implications for terrestrial monitoring

    Jian Li and David P Roy. A global analysis of sentinel-2a, sentinel-2b and landsat-8 data revisit intervals and implications for terrestrial monitoring. Remote Sensing, 9(9):902, 2017

  32. [32]

    Lsdir: A large scale dataset for image restoration

    Yawei Li, Kai Zhang, Jingyun Liang, Jiezhang Cao, Ce Liu, Rui Gong, Yulun Zhang, Hao Tang, Yun Liu, Denis Demandolx, et al. Lsdir: A large scale dataset for image restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1775– 1787, 2023

  33. [33]

    A new sensor bias-driven spatio-temporal fusion model based on convolutional neural networks.Science China Information Sciences, 63:1–16, 2020

    Yunfei Li, Jun Li, Lin He, Jin Chen, and Antonio Plaza. A new sensor bias-driven spatio-temporal fusion model based on convolutional neural networks.Science China Information Sciences, 63:1–16, 2020

  34. [34]

    Swinir: Image restoration using swin transformer

    Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte. Swinir: Image restoration using swin transformer. InProceedings of the IEEE/CVF international conference on computer vision, pages 1833–1844, 2021

  35. [35]

    Enhanced deep residual networks for single image super- resolution

    Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. Enhanced deep residual networks for single image super- resolution. InProceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 136–144, 2017

  36. [36]

    Text2earth: Unlocking text-driven remote sensing image generation with a global-scale dataset and a foundation model.arXiv preprint arXiv:2501.00895, 2025

    Chenyang Liu, Keyan Chen, Rui Zhao, Zhengxia Zou, and Zhenwei Shi. Text2earth: Unlocking text-driven remote sensing image generation with a global-scale dataset and a foundation model.arXiv preprint arXiv:2501.00895, 2025

  37. [37]

    JG Liu. Smoothing filter-based intensity modulation: A spectral preserve image fusion technique for improving spatial details.International Journal of remote sensing, 21(18):3461–3472, 2000

  38. [38]

    Difffno: Diffusion fourier neural operator

    Xiaoyi Liu and Hao Tang. Difffno: Diffusion fourier neural operator. In Proceedings of the Computer Vision and Pattern Recognition Conference, pages 150–160, 2025

  39. [39]

    Swin transformer: Hierarchical vision transformer using shifted windows

    Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. Swin transformer: Hierarchical vision transformer using shifted windows. InProceedings of the IEEE/CVF international conference on computer vision, pages 10012–10022, 2021

  40. [40]

    Zeping Liu, Hong Tang, Lin Feng, and Siqing Lyu. China building rooftop area: the first multi-annual (2016–2021) and high-resolution (2.5 m) building rooftop area dataset in china derived with super-resolution segmentation from sentinel-2 imagery.Earth System Science Data, 15(8):3547–3572, 2023

  41. [41]

    Transformer for single image super-resolution

    Zhisheng Lu, Juncheng Li, Hong Liu, Chaoyan Huang, Linlin Zhang, and Tieyong Zeng. Transformer for single image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 457–466, 2022

  42. [42]

    Super- resolution of proba-v images using convolutional neural networks

    Marcus Märtens, Dario Izzo, Andrej Krzic, and Daniël Cox. Super- resolution of proba-v images using convolutional neural networks. Astrodynamics, 3:387–402, 2019

  43. [43]

    A conditional diffusion model with fast sampling strategy for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 2024

    Fanen Meng, Yijun Chen, Haoyu Jing, Laifu Zhang, Yiming Yan, Yingchao Ren, Sensen Wu, Tian Feng, Renyi Liu, and Zhenhong Du. A conditional diffusion model with fast sampling strategy for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 2024

  44. [44]

    Global urban areas: High-resolution population density and settlement data

    Meta Data for Good. Global urban areas: High-resolution population density and settlement data. https://dataforgood.facebook.com/dfg/tools/ globalurbanareas, 2023. Accessed: 2025-12-16

  45. [45]

    Sen2venµs, a dataset for the training of sentinel-2 super-resolution algorithms.Data, 7(7):96, 2022

    Julien Michel, Juan Vinasco-Salinas, Jordi Inglada, and Olivier Hagolle. Sen2venµs, a dataset for the training of sentinel-2 super-resolution algorithms.Data, 7(7):96, 2022

  46. [46]

    USRoadDetections: U.S

    Microsoft. USRoadDetections: U.S. Road Network Detection Dataset. https://github.com/microsoft/USRoadDetections, 2020. Accessed: 2025- 12-11

  47. [47]

    Ntsg landsat gross primary production (gpp) v2

    Numerical Terradynamic Simulation Group (NTSG). Ntsg landsat gross primary production (gpp) v2. https://developers.google.com/earth-engine/ JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 17 datasets/catalog/UMT_NTSG_v2_LANDSAT_GPP, 2021. Accessed: 2025-12-16

  48. [48]

    Landland- cov_baselc2022.tif, 2024

    University of Vermont Spatial Analysis Laboratory. Landland- cov_baselc2022.tif, 2024

  49. [49]

    Advancing image super-resolution techniques in remote sensing: A comprehensive survey.ISPRS Journal of Photogrammetry and Remote Sensing, 231:68–100, 2026

    Yunliang Qi, Meng Lou, Yimin Liu, Lu Li, Zhen Yang, and Wen Nie. Advancing image super-resolution techniques in remote sensing: A comprehensive survey.ISPRS Journal of Photogrammetry and Remote Sensing, 231:68–100, 2026

  50. [50]

    Cfat: Un- leashing triangular windows for image super-resolution

    Abhisek Ray, Gaurav Kumar, and Maheshkumar H Kolekar. Cfat: Un- leashing triangular windows for image super-resolution. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 26120–26129, 2024

  51. [51]

    Lavista Ferres, and Peyman Najafirad

    Caleb Robinson, Isaac Corley, Anthony Ortiz, Rahul Dodhia, Juan M. Lavista Ferres, and Peyman Najafirad. Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imagery, 2024

  52. [52]

    Image super-resolution via iterative refinement.IEEE transactions on pattern analysis and machine intelligence, 45(4):4713–4726, 2022

    Chitwan Saharia, Jonathan Ho, William Chan, Tim Salimans, David J Fleet, and Mohammad Norouzi. Image super-resolution via iterative refinement.IEEE transactions on pattern analysis and machine intelligence, 45(4):4713–4726, 2022

  53. [53]

    Rethinking image evaluation in super- resolution.arXiv preprint arXiv:2503.13074, 2025

    Shaolin Su, Josep M Rocafort, Danna Xue, David Serrano-Lozano, Lei Sun, and Javier Vazquez-Corral. Rethinking image evaluation in super- resolution.arXiv preprint arXiv:2503.13074, 2025

  54. [54]

    Deriving high spatiotemporal remote sensing images using deep convolutional network

    Zhenyu Tan, Peng Yue, Liping Di, and Junmei Tang. Deriving high spatiotemporal remote sensing images using deep convolutional network. Remote Sensing, 10(7):1066, 2018

  55. [55]

    Computer generated building footprints for the united states, 2018

    Bing Maps Team. Computer generated building footprints for the united states, 2018

  56. [56]

    Geological Survey

    U.S. Geological Survey. Landsat dynamic surface water extent (dswe) science products. https://www.usgs.gov/landsat-missions/ landsat-dynamic-surface-water-extent-science-products, 2022. Accessed: 2025-12-16

  57. [57]

    Naip seam lines by state

    USDA Farm Service Agency Aerial Photography Field Office. Naip seam lines by state. https://gdg.sc.egov.usda.gov/Catalog/ProductDescription/ NAIPSL.html, 2022. Accessed: 2025-12-11

  58. [58]

    Cropland data layer

    USDA National Agricultural Statistics Service. Cropland data layer. https://nassgeodata.gmu.edu/CropScape/, 2021. Published crop-specific data layer. Accessed: 12/16/2025

  59. [59]

    A cnn-based sentinel- 2 image super-resolution method using multiobjective training.IEEE Transactions on Geoscience and Remote Sensing, 61:1–14, 2023

    Vlad Vasilescu, Mihai Datcu, and Daniela Faur. A cnn-based sentinel- 2 image super-resolution method using multiobjective training.IEEE Transactions on Geoscience and Remote Sensing, 61:1–14, 2023

  60. [60]

    Towards real-world remote sensing image super- resolution: A new benchmark and an efficient model.IEEE Transactions on Geoscience and Remote Sensing, 2024

    Jia Wang, Liuyu Xiang, Lei Liu, Jiaochong Xu, Peipei Li, Qizhi Xu, and Zhaofeng He. Towards real-world remote sensing image super- resolution: A new benchmark and an efficient model.IEEE Transactions on Geoscience and Remote Sensing, 2024

  61. [61]

    Virtual image pair-based spatio-temporal fusion.Remote Sensing of Environment, 249:112009, 2020

    Qunming Wang, Yijie Tang, Xiaohua Tong, and Peter M Atkinson. Virtual image pair-based spatio-temporal fusion.Remote Sensing of Environment, 249:112009, 2020

  62. [62]

    Real-esrgan: Training real-world blind super-resolution with pure synthetic data

    Xintao Wang, Liangbin Xie, Chao Dong, and Ying Shan. Real-esrgan: Training real-world blind super-resolution with pure synthetic data. In Proceedings of the IEEE/CVF international conference on computer vision, pages 1905–1914, 2021

  63. [63]

    Esrgan: Enhanced super-resolution generative adversarial networks

    Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, and Chen Change Loy. Esrgan: Enhanced super-resolution generative adversarial networks. InProceedings of the European conference on computer vision (ECCV) workshops, pages 0–0, 2018

  64. [64]

    attention

    Yan Wang, Yi Liu, Shijie Zhao, Junlin Li, and Li Zhang. Camixersr: Only details need more" attention". InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 25837– 25846, 2024

  65. [65]

    Treefinder: A us-scale benchmark dataset for individual tree mortality monitoring using high- resolution aerial imagery

    Zhihao Wang, Cooper Li, Ruichen Wang, Lei Ma, George Hurtt, Xiaowei Jia, Gengchen Mai, Zhili Li, and Yiqun Xie. Treefinder: A us-scale benchmark dataset for individual tree mortality monitoring using high- resolution aerial imagery. InThe Thirty-ninth Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track

  66. [66]

    Image quality assessment: from error visibility to structural similarity

    Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4):600–612, 2004

  67. [67]

    Super-resolution neural operator

    Min Wei and Xuesong Zhang. Super-resolution neural operator. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 18247–18256, 2023

  68. [68]

    Xiao, Y., Yuan, Q., Jiang, K., He, J., Jin, X., Zhang, L.,

    Piper Wolters, Favyen Bastani, and Aniruddha Kembhavi. Zooming out on zooming in: Advancing super-resolution for remote sensing.arXiv preprint arXiv:2311.18082, 2023

  69. [69]

    Free-flowing rivers - wwf hydrosheds v1

    WWF HydroSHEDS. Free-flowing rivers - wwf hydrosheds v1. https://developers.google.com/earth-engine/datasets/catalog/WWF_ HydroSHEDS_v1_FreeFlowingRivers, 2000. Accessed: 2025-12-16

  70. [70]

    Ediffsr: An efficient diffusion probabilistic model for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 62:1–14, 2023

    Yi Xiao, Qiangqiang Yuan, Kui Jiang, Jiang He, Xianyu Jin, and Liangpei Zhang. Ediffsr: An efficient diffusion probabilistic model for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 62:1–14, 2023

  71. [71]

    Segformer: Simple and efficient design for semantic segmentation with transformers.Advances in neural information processing systems, 34:12077–12090, 2021

    Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M Alvarez, and Ping Luo. Segformer: Simple and efficient design for semantic segmentation with transformers.Advances in neural information processing systems, 34:12077–12090, 2021

  72. [72]

    Neurop-diff: Continuous remote sensing image super-resolution via neural operator diffusion.arXiv preprint arXiv:2501.09054, 2025

    Zihao Xu, Yuzhi Tang, Bowen Xu, and Qingquan Li. Neurop-diff: Continuous remote sensing image super-resolution via neural operator diffusion.arXiv preprint arXiv:2501.09054, 2025

  73. [73]

    Learning texture transformer network for image super-resolution

    Fuzhi Yang, Huan Yang, Jianlong Fu, Hongtao Lu, and Baining Guo. Learning texture transformer network for image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5791–5800, 2020

  74. [74]

    On single image scale- up using sparse-representations

    Roman Zeyde, Michael Elad, and Matan Protter. On single image scale- up using sparse-representations. InInternational conference on curves and surfaces, pages 711–730. Springer, 2010

  75. [75]

    Transcending the limit of local window: Advanced super-resolution transformer with adaptive token dictionary

    Leheng Zhang, Yawei Li, Xingyu Zhou, Xiaorui Zhao, and Shuhang Gu. Transcending the limit of local window: Advanced super-resolution transformer with adaptive token dictionary. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2856–2865, 2024

  76. [76]

    Uncertainty- guided perturbation for image super-resolution diffusion model

    Leheng Zhang, Weiyi You, Kexuan Shi, and Shuhang Gu. Uncertainty- guided perturbation for image super-resolution diffusion model. In Proceedings of the Computer Vision and Pattern Recognition Conference, pages 17980–17989, 2025

  77. [77]

    Adding conditional control to text-to-image diffusion models

    Lvmin Zhang, Anyi Rao, and Maneesh Agrawala. Adding conditional control to text-to-image diffusion models. InProceedings of the IEEE/CVF international conference on computer vision, pages 3836– 3847, 2023

  78. [78]

    The unreasonable effectiveness of deep features as a perceptual metric

    Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang. The unreasonable effectiveness of deep features as a perceptual metric. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 586–595, 2018

  79. [79]

    Image super-resolution using very deep residual channel attention networks

    Yulun Zhang, Kunpeng Li, Kai Li, Lichen Wang, Bineng Zhong, and Yun Fu. Image super-resolution using very deep residual channel attention networks. InProceedings of the European conference on computer vision (ECCV), pages 286–301, 2018

  80. [80]

    Single-image super-resolution based on rational fractal inter- polation.IEEE Transactions on Image Processing, 27(8):3782–3797, 2018

    Yunfeng Zhang, Qinglan Fan, Fangxun Bao, Yifang Liu, and Caiming Zhang. Single-image super-resolution based on rational fractal inter- polation.IEEE Transactions on Image Processing, 27(8):3782–3797, 2018

Showing first 80 references.