Beyond Visual Fidelity: Benchmarking Super-Resolution Models for Large-Scale Remote Sensing Imagery via Downstream Task Integration

Dinesh Manocha; Gengchen Mai; Kangyang Chai; Sergii Skakun; Xiaowei Jia; Yanhua Li; Yiqun Xie; Zhihao Wang; Zhili Li

arxiv: 2605.00310 · v1 · submitted 2026-05-01 · 💻 cs.CV · cs.AI· cs.LG

Beyond Visual Fidelity: Benchmarking Super-Resolution Models for Large-Scale Remote Sensing Imagery via Downstream Task Integration

Zhili Li , Kangyang Chai , Zhihao Wang , Xiaowei Jia , Yanhua Li , Gengchen Mai , Sergii Skakun , Dinesh Manocha

show 1 more author

Yiqun Xie

This is my paper

Pith reviewed 2026-05-09 20:21 UTC · model grok-4.3

classification 💻 cs.CV cs.AIcs.LG

keywords super-resolutionremote sensingbenchmarkdownstream tasksfidelity metricsland cover classificationEarth observationimage quality

0 comments

The pith

Super-resolution models picked by PSNR or SSIM often deliver worse results on actual remote sensing tasks than lower-scoring alternatives.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Super-resolution methods reconstruct sharper satellite images from coarse inputs, promising better support for monitoring applications in agriculture, urban planning, and disaster response. Standard benchmarks judge success only by how closely the output matches a high-resolution reference using pixel-wise scores like PSNR and SSIM. This paper introduces GeoSR-Bench, a dataset of 36,000 spatially matched image pairs at multiple resolution scales, and runs 270 controlled experiments that feed the super-resolved outputs into five real downstream tasks per scenario. The measured correlations between fidelity scores and task accuracy are frequently weak or negative. The finding indicates that visual quality metrics alone give unreliable guidance when the end goal is accurate Earth observation outputs rather than prettier pictures.

Core claim

GeoSR-Bench supplies the first large-scale, task-integrated benchmark for super-resolution in remote sensing. It pairs low- and high-resolution imagery across diverse land covers and directly measures how nine different SR models affect performance on land cover segmentation, infrastructure mapping, and biophysical variable estimation. Results across GAN, transformer, neural operator, and diffusion architectures show that gains on traditional fidelity metrics do not reliably produce gains on these tasks and can even reduce task accuracy.

What carries the argument

GeoSR-Bench, a collection of spatially co-located, temporally aligned, and quality-controlled image pairs from 36,000 locations that links SR outputs to five downstream task models per scenario.

If this is right

SR model rankings for remote sensing shift when evaluation uses task performance instead of fidelity metrics.
Developers should optimize or fine-tune SR networks directly on task losses rather than generic reconstruction objectives.
Diffusion and transformer SR models may outperform GANs on task utility even when trailing on PSNR or SSIM.
New benchmarks must include downstream task integration to guide SR progress for Earth observation.
Operational monitoring systems may achieve higher accuracy by selecting SR models according to task results rather than visual scores.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Incorporating auxiliary task losses during SR training could close the gap between visual fidelity and functional utility.
The same evaluation mismatch likely exists in other domains such as medical or astronomical imaging where tasks matter more than pixel accuracy.
Future SR research could test whether certain artifact types introduced by diffusion models systematically hurt change detection more than segmentation.
Agencies running large-scale Earth observation pipelines might adopt task-integrated benchmarks to replace current fidelity-only leaderboards.

Load-bearing premise

The five chosen downstream tasks and the 36,000 image locations represent the practical value of super-resolved imagery in real Earth monitoring workflows.

What would settle it

A replication of the 270-setting experiments on a fresh geographic sample or with different task models that instead finds consistently strong positive correlations between PSNR/SSIM gains and downstream accuracy.

Figures

Figures reproduced from arXiv: 2605.00310 by Dinesh Manocha, Gengchen Mai, Kangyang Chai, Sergii Skakun, Xiaowei Jia, Yanhua Li, Yiqun Xie, Zhihao Wang, Zhili Li.

**Figure 2.** Figure 2: GeoSR Dataset Construction Process Overview. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 1.** Figure 1: These two SR tasks also cover diverse utilities. For [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗

**Figure 3.** Figure 3: Spatial distributions of image pairs in (a) MODIS-to-Landsat-8 and (b) Sentinel-2-to-NAIP SR coincident datasets. [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: PSNR and SSIM do not always align with visual perception in SR. [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

**Figure 5.** Figure 5: Relative performance comparison across SR models on the MODIS-to-Landsat-8 downstream tasks using SegFormer as the downstream model. Relative [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗

**Figure 6.** Figure 6: Relative performance comparison across SR models on the Sentinel-2-to-NAIP downstream tasks using SegFormer as the downstream model. Relative [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗

**Figure 7.** Figure 7: Pearson correlation between visual fidelity metrics (PSNR/SSIM) and downstream task performance computed within the top- [PITH_FULL_IMAGE:figures/full_fig_p014_7.png] view at source ↗

**Figure 8.** Figure 8: Spearman’s rank correlation between visual fidelity metrics (PSNR/SSIM) and downstream task performance computed within the top- [PITH_FULL_IMAGE:figures/full_fig_p015_8.png] view at source ↗

**Figure 9.** Figure 9: Downstream task performance comparison on the TreeFinder dataset [PITH_FULL_IMAGE:figures/full_fig_p015_9.png] view at source ↗

read the original abstract

Super-resolution (SR) techniques have made major advances in reconstructing high-resolution images from low-resolution inputs. The increased resolution provides visual enhancement and utility for monitoring tasks. In particular, SR has been increasingly developed for satellite-based Earth observation, with applications in urban planning, agriculture, ecology, and disaster response. However, existing SR studies and benchmarks typically use fidelity metrics such as PSNR or SSIM, whereas the true utility of super-resolved images lies in supporting downstream tasks such as land cover classification, biomass estimation, and change detection. To bridge this gap, we introduce GeoSR-Bench, a downstream task-integrated SR benchmark dataset to evaluate SR models beyond fidelity metrics. GeoSR-Bench comprises spatially co-located, temporally aligned, and quality-controlled image pairs from about 36,000 locations across diverse land covers, spanning resolutions from 500m to 0.6m. To the best of our knowledge, GeoSR-Bench is the first SR benchmark that directly connects improved image resolution from SR models with downstream Earth monitoring tasks, including land cover segmentation, infrastructure mapping, and biophysical variable estimation. Using GeoSR-Bench, we benchmark GAN, transformer, neural operator, and diffusion-based SR models on perceptual quality and downstream task performance. We conduct experiments with 270 settings, covering 2 cross-platform SR tasks, 9 SR models, 3 downstream task models, and 5 downstream tasks for each SR task. The results show that improvements in traditional SR metrics often do not correlate with gains in task performance, and the correlations can be negative, indicating that these metrics provide limited guidance for selecting superior models for downstream tasks. This reveals the need to integrate downstream tasks into SR model development and evaluation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

GeoSR-Bench gives a large-scale empirical check on whether PSNR/SSIM gains actually help remote sensing tasks, and the answer is often no or even the reverse.

read the letter

The main point is that this paper builds GeoSR-Bench from quality-controlled, co-located pairs at roughly 36,000 locations and runs it through nine SR models and five downstream tasks to test whether better fidelity metrics lead to better real-world performance. The central result is that they frequently do not, and the correlations can go negative across the 270 settings they report. That is a useful data point for anyone who has been using PSNR or SSIM to pick SR models for satellite work. The scale and the direct tie to tasks like land cover segmentation, infrastructure mapping, and biophysical estimation are what the paper actually contributes. They cover two cross-platform scenarios, multiple model families including GANs, transformers, neural operators, and diffusion, and they keep the downstream models fixed so the comparison stays clean. That setup lets them show the disconnect without obvious circularity. The empirical pattern holds up in the numbers they present. The softer spot is whether the five chosen tasks and the specific task models capture enough of the variation that matters in practice. If SR artifacts affect change detection under atmospheric noise or fine-scale agriculture metrics differently than they affect the selected tasks, the claim that fidelity metrics give limited guidance would need qualification. The paper does not appear to include extensive sensitivity checks on task choice, so that remains an open question rather than a settled one. This is for researchers who develop or evaluate super-resolution for Earth observation and want evidence that goes past visual quality. Anyone running benchmarks or setting evaluation standards in remote sensing will find the dataset and the correlation results worth looking at. It has enough scale and a clear practical question to deserve peer review, though the referees should press on how far the decorrelation generalizes beyond the tasks tested.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces GeoSR-Bench, a large-scale benchmark with ~36,000 quality-controlled, co-located image pairs spanning resolutions from 500m to 0.6m across diverse land covers. It evaluates 9 SR models (including GAN, transformer, neural operator, and diffusion-based) in 2 cross-platform scenarios through 270 experimental settings with 3 downstream task models and 5 tasks per scenario (land cover segmentation, infrastructure mapping, biophysical variable estimation). The primary result is that traditional SR fidelity metrics (PSNR, SSIM) exhibit weak or negative correlations with downstream task performance gains.

Significance. The scale of the benchmark and the direct integration of downstream tasks represent a valuable contribution to the field of remote sensing image processing. If the observed lack of positive correlation between fidelity metrics and task utility is robust, this could significantly influence SR model development by prioritizing task performance over visual metrics, leading to more practical models for applications in urban planning, agriculture, and disaster response. The empirical benchmarking approach with held-out tasks is a strength.

major comments (2)

[§3] §3 (Dataset): The description of the 36,000 locations and quality control process lacks explicit details on selection criteria, data exclusion rules, and assessment of potential biases or representativeness across land cover types, which is critical to support the generalizability of the negative correlation findings.
[§5] §5 (Experimental Results): The analysis of correlations between SR metrics and downstream task performance across the 270 settings does not include statistical significance tests, error bars, or confidence intervals; this weakens the central claim that correlations 'often do not correlate' or 'can be negative'.

minor comments (1)

[Abstract] Abstract: A supplementary table breaking down the 270 settings by SR model, task, and platform would improve clarity of the experimental design.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thorough review and constructive comments on our manuscript. We appreciate the recognition of the benchmark's scale and the importance of integrating downstream tasks. Below, we provide point-by-point responses to the major comments.

read point-by-point responses

Referee: [§3] §3 (Dataset): The description of the 36,000 locations and quality control process lacks explicit details on selection criteria, data exclusion rules, and assessment of potential biases or representativeness across land cover types, which is critical to support the generalizability of the negative correlation findings.

Authors: We agree that more detailed information on the dataset curation process is necessary to bolster the generalizability of our results. In the revised version of the manuscript, we will expand the description in §3 to explicitly detail the selection criteria for the approximately 36,000 locations, the specific data exclusion rules applied during quality control, and an assessment of potential biases along with the representativeness across various land cover types. This addition will help substantiate the robustness of the observed correlations. revision: yes
Referee: [§5] §5 (Experimental Results): The analysis of correlations between SR metrics and downstream task performance across the 270 settings does not include statistical significance tests, error bars, or confidence intervals; this weakens the central claim that correlations 'often do not correlate' or 'can be negative'.

Authors: We concur that incorporating statistical rigor would strengthen the presentation of our findings. Accordingly, in the revised manuscript, we will augment the analysis in §5 by including statistical significance tests for the correlations, as well as error bars and confidence intervals where appropriate, across the 270 experimental settings. These additions will provide quantitative support for our conclusions regarding the weak or negative correlations between traditional SR fidelity metrics and downstream task performance. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical benchmarking study with independent experimental results

full rationale

The paper introduces GeoSR-Bench as a new dataset and performs direct empirical comparisons of 9 SR models across 2 scenarios, 3 task models, and 5 downstream tasks using 36,000 image pairs. No derivations, equations, or fitted parameters are presented as predictions; results are computed from held-out evaluations. No self-citation chains or ansatzes underpin the central claim of weak/negative correlations between fidelity metrics and task performance. The study is self-contained against external benchmarks and does not reduce its findings to inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The work rests on domain assumptions about data alignment and task relevance rather than new mathematical axioms, free parameters, or invented entities.

axioms (1)

domain assumption Spatially co-located, temporally aligned, and quality-controlled image pairs from diverse land covers form a fair basis for SR benchmarking.
Invoked in the construction of GeoSR-Bench from 36,000 locations.

pith-pipeline@v0.9.0 · 5656 in / 1332 out tokens · 49802 ms · 2026-05-09T20:21:25.530642+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

85 extracted references · 85 canonical work pages

[1]

Ntire 2017 challenge on single image super-resolution: Dataset and study

Eirikur Agustsson and Radu Timofte. Ntire 2017 challenge on single image super-resolution: Dataset and study. InProceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 126–135, 2017

work page 2017
[2]

Improving component substitution pansharpening through multivariate regression of ms + pan data.IEEE Transactions on Geoscience and Remote Sensing, 45(10):3230– 3239, 2007

Bruno Aiazzi, Stefano Baronti, and Massimo Selva. Improving component substitution pansharpening through multivariate regression of ms + pan data.IEEE Transactions on Geoscience and Remote Sensing, 45(10):3230– 3239, 2007

work page 2007
[3]

Canopy height model and naip imagery pairs across conus.Scientific Data, 12(1):322, 2025

Brady W Allred, Sarah E McCord, and Scott L Morford. Canopy height model and naip imagery pairs across conus.Scientific Data, 12(1):322, 2025

work page 2025
[4]

Apache Sedona

Apache. Apache Sedona. https://sedona.apache.org/1.6.0/, 2025. Ac- cessed: 2025-11-05

work page 2025
[5]

Cnn-based super-resolution of hyperspectral images.IEEE Transactions on Geoscience and Remote Sensing, 58(9):6106–6121, 2020

Pattathal V Arun, Krishna Mohan Buddhiraju, Alok Porwal, and Jocelyn Chanussot. Cnn-based super-resolution of hyperspectral images.IEEE Transactions on Geoscience and Remote Sensing, 58(9):6106–6121, 2020

work page 2020
[6]

Toward real-world single image super-resolution: A new benchmark and a new model

Jianrui Cai, Hui Zeng, Hongwei Yong, Zisheng Cao, and Lei Zhang. Toward real-world single image super-resolution: A new benchmark and a new model. InProceedings of the IEEE/CVF international conference on computer vision, pages 3086–3095, 2019. JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 16

work page 2019
[7]

Large-scale individual building extraction from open-source satellite imagery via super-resolution-based instance segmentation approach

Shenglong Chen, Yoshiki Ogawa, Chenbo Zhao, and Yoshihide Sekimoto. Large-scale individual building extraction from open-source satellite imagery via super-resolution-based instance segmentation approach. ISPRS Journal of Photogrammetry and Remote Sensing, 195:129–152, 2023

work page 2023
[8]

Activating more pixels in image super-resolution transformer

Xiangyu Chen, Xintao Wang, Jiantao Zhou, Yu Qiao, and Chao Dong. Activating more pixels in image super-resolution transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 22367–22377, 2023

work page 2023
[9]

Binarized diffusion model for image super- resolution.Advances in Neural Information Processing Systems, 37:30651–30669, 2024

Zheng Chen, Haotong Qin, Yong Guo, Xiongfei Su, Xin Yuan, Linghe Kong, and Yulun Zhang. Binarized diffusion model for image super- resolution.Advances in Neural Information Processing Systems, 37:30651–30669, 2024

work page 2024
[10]

Recursive generalization transformer for image super-resolution.arXiv preprint arXiv:2303.06373, 2023

Zheng Chen, Yulun Zhang, Jinjin Gu, Linghe Kong, and Xiaokang Yang. Recursive generalization transformer for image super-resolution.arXiv preprint arXiv:2303.06373, 2023

work page arXiv 2023
[11]

A solar panel dataset of very high resolution satellite imagery to support the sustainable development goals

Cecilia N Clark and Fabio Pacifici. A solar panel dataset of very high resolution satellite imagery to support the sustainable development goals. Scientific Data, 10(1):636, 2023

work page 2023
[12]

Open high- resolution satellite imagery: The worldstrat dataset–with application to super-resolution.Advances in Neural Information Processing Systems, 35:25979–25991, 2022

Julien Cornebise, Ivan Oršoli ´c, and Freddie Kalaitzis. Open high- resolution satellite imagery: The worldstrat dataset–with application to super-resolution.Advances in Neural Information Processing Systems, 35:25979–25991, 2022

work page 2022
[13]

National land cover database (nlcd) 2021 products, 2023

Jon Dewitz. National land cover database (nlcd) 2021 products, 2023. U.S. Geological Survey data release

work page 2021
[14]

Learning a deep convolutional network for image super-resolution

Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. Learning a deep convolutional network for image super-resolution. InComputer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part IV 13, pages 184–199. Springer, 2014

work page 2014
[15]

Esa worldcover global land cover service

European Space Agency (ESA) WorldCover. Esa worldcover global land cover service. https://esa-worldcover.org/en/data-access, 2021. Accessed: 2025-12-11

work page 2021
[16]

Remote sensing time series analysis: A review of data and applications

Yingchun Fu, Zhe Zhu, Liangyun Liu, Wenfeng Zhan, Tao He, Huanfeng Shen, Jun Zhao, Yongxue Liu, Hongsheng Zhang, Zihan Liu, et al. Remote sensing time series analysis: A review of data and applications. Journal of Remote Sensing, 4:0285, 2024

work page 2024
[17]

Copernicus_s2_cloud_probability: Sentinel -2 cloud probability

Google Earth Engine. Copernicus_s2_cloud_probability: Sentinel -2 cloud probability. https://developers.google.com/earth-engine/datasets/catalog/ COPERNICUS_S2_CLOUD_PROBABILITY, 2025. Accessed: 2025- 12-11

work page 2025
[18]

Spatial-temporal super-resolution of satellite imagery via conditional pixel synthesis

Yutong He, Dingjie Wang, Nicholas Lai, William Zhang, Chenlin Meng, Marshall Burke, David Lobell, and Stefano Ermon. Spatial-temporal super-resolution of satellite imagery via conditional pixel synthesis. Advances in Neural Information Processing Systems, 34:27903–27915, 2021

work page 2021
[19]

Tracking urbanization in developing regions with remote sensing spatial-temporal super-resolution.arXiv preprint arXiv:2204.01736, 2022

Yutong He, William Zhang, Chenlin Meng, Marshall Burke, David B Lobell, and Stefano Ermon. Tracking urbanization in developing regions with remote sensing spatial-temporal super-resolution.arXiv preprint arXiv:2204.01736, 2022

work page arXiv 2022
[20]

Cascaded diffusion models for high fidelity image generation.Journal of Machine Learning Research, 23(47):1–33, 2022

Jonathan Ho, Chitwan Saharia, William Chan, David J Fleet, Mohammad Norouzi, and Tim Salimans. Cascaded diffusion models for high fidelity image generation.Journal of Machine Learning Research, 23(47):1–33, 2022

work page 2022
[21]

Efficient swin transformer for remote sensing image super-resolution.IEEE Transactions on Image Processing, 2024

Xudong Kang, Puhong Duan, Jier Li, and Shutao Li. Efficient swin transformer for remote sensing image super-resolution.IEEE Transactions on Image Processing, 2024

work page 2024
[22]

Lobell, and Stefano Ermon

Samar Khanna, Patrick Liu, Linqi Zhou, Chenlin Meng, Robin Rombach, Marshall Burke, David B. Lobell, and Stefano Ermon. Diffusionsat: A generative foundation model for satellite imagery. InThe Twelfth International Conference on Learning Representations, 2024

work page 2024
[23]

Accurate image super-resolution using very deep convolutional networks

Jiwon Kim, Jung Kwon Lee, and Kyoung Mu Lee. Accurate image super-resolution using very deep convolutional networks. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 1646–1654, 2016

work page 2016
[24]

Single-image super-resolution using sparse regression and natural image prior.IEEE transactions on pattern analysis and machine intelligence, 32(6):1127–1133, 2010

Kwang In Kim and Younghee Kwon. Single-image super-resolution using sparse regression and natural image prior.IEEE transactions on pattern analysis and machine intelligence, 32(6):1127–1133, 2010

work page 2010
[25]

Toward bridging the simulated-to-real gap: Benchmarking super-resolution on real data.IEEE transactions on pattern analysis and machine intelligence, 42(11):2944–2959, 2019

Thomas Köhler, Michel Bätz, Farzad Naderi, André Kaup, Andreas Maier, and Christian Riess. Toward bridging the simulated-to-real gap: Benchmarking super-resolution on real data.IEEE transactions on pattern analysis and machine intelligence, 42(11):2944–2959, 2019

work page 2019
[26]

A real-world benchmark for sentinel-2 multi-image super-resolution.Scientific Data, 10(1):644, 2023

Pawel Kowaleczko, Tomasz Tarasiewicz, Maciej Ziaja, Daniel Kostrzewa, Jakub Nalepa, Przemyslaw Rokita, and Michal Kawulok. A real-world benchmark for sentinel-2 multi-image super-resolution.Scientific Data, 10(1):644, 2023

work page 2023
[27]

A high-resolution canopy height model of the earth.Nature Ecology & Evolution, 7(11):1778–1789, 2023

Nico Lang, Walter Jetz, Konrad Schindler, and Jan Dirk Wegner. A high-resolution canopy height model of the earth.Nature Ecology & Evolution, 7(11):1778–1789, 2023

work page 2023
[28]

Photo-realistic single image super- resolution using a generative adversarial network

Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Jo- hannes Totz, Zehan Wang, et al. Photo-realistic single image super- resolution using a generative adversarial network. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 4681–4690, 2017

work page 2017
[29]

Transformer-based multistage enhancement for remote sensing image super-resolution.IEEE Transac- tions on Geoscience and Remote Sensing, 60:1–11, 2021

Sen Lei, Zhenwei Shi, and Wenjing Mo. Transformer-based multistage enhancement for remote sensing image super-resolution.IEEE Transac- tions on Geoscience and Remote Sensing, 60:1–11, 2021

work page 2021
[30]

Sed: Semantic-aware discriminator for image super-resolution

Bingchen Li, Xin Li, Hanxin Zhu, Yeying Jin, Ruoyu Feng, Zhizheng Zhang, and Zhibo Chen. Sed: Semantic-aware discriminator for image super-resolution. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 25784–25795, 2024

work page 2024
[31]

A global analysis of sentinel-2a, sentinel-2b and landsat-8 data revisit intervals and implications for terrestrial monitoring

Jian Li and David P Roy. A global analysis of sentinel-2a, sentinel-2b and landsat-8 data revisit intervals and implications for terrestrial monitoring. Remote Sensing, 9(9):902, 2017

work page 2017
[32]

Lsdir: A large scale dataset for image restoration

Yawei Li, Kai Zhang, Jingyun Liang, Jiezhang Cao, Ce Liu, Rui Gong, Yulun Zhang, Hao Tang, Yun Liu, Denis Demandolx, et al. Lsdir: A large scale dataset for image restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1775– 1787, 2023

work page 2023
[33]

A new sensor bias-driven spatio-temporal fusion model based on convolutional neural networks.Science China Information Sciences, 63:1–16, 2020

Yunfei Li, Jun Li, Lin He, Jin Chen, and Antonio Plaza. A new sensor bias-driven spatio-temporal fusion model based on convolutional neural networks.Science China Information Sciences, 63:1–16, 2020

work page 2020
[34]

Swinir: Image restoration using swin transformer

Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte. Swinir: Image restoration using swin transformer. InProceedings of the IEEE/CVF international conference on computer vision, pages 1833–1844, 2021

work page 2021
[35]

Enhanced deep residual networks for single image super- resolution

Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. Enhanced deep residual networks for single image super- resolution. InProceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 136–144, 2017

work page 2017
[36]

Text2earth: Unlocking text-driven remote sensing image generation with a global-scale dataset and a foundation model.arXiv preprint arXiv:2501.00895, 2025

Chenyang Liu, Keyan Chen, Rui Zhao, Zhengxia Zou, and Zhenwei Shi. Text2earth: Unlocking text-driven remote sensing image generation with a global-scale dataset and a foundation model.arXiv preprint arXiv:2501.00895, 2025

work page arXiv 2025
[37]

JG Liu. Smoothing filter-based intensity modulation: A spectral preserve image fusion technique for improving spatial details.International Journal of remote sensing, 21(18):3461–3472, 2000

work page 2000
[38]

Difffno: Diffusion fourier neural operator

Xiaoyi Liu and Hao Tang. Difffno: Diffusion fourier neural operator. In Proceedings of the Computer Vision and Pattern Recognition Conference, pages 150–160, 2025

work page 2025
[39]

Swin transformer: Hierarchical vision transformer using shifted windows

Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. Swin transformer: Hierarchical vision transformer using shifted windows. InProceedings of the IEEE/CVF international conference on computer vision, pages 10012–10022, 2021

work page 2021
[40]

Zeping Liu, Hong Tang, Lin Feng, and Siqing Lyu. China building rooftop area: the first multi-annual (2016–2021) and high-resolution (2.5 m) building rooftop area dataset in china derived with super-resolution segmentation from sentinel-2 imagery.Earth System Science Data, 15(8):3547–3572, 2023

work page 2016
[41]

Transformer for single image super-resolution

Zhisheng Lu, Juncheng Li, Hong Liu, Chaoyan Huang, Linlin Zhang, and Tieyong Zeng. Transformer for single image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 457–466, 2022

work page 2022
[42]

Super- resolution of proba-v images using convolutional neural networks

Marcus Märtens, Dario Izzo, Andrej Krzic, and Daniël Cox. Super- resolution of proba-v images using convolutional neural networks. Astrodynamics, 3:387–402, 2019

work page 2019
[43]

A conditional diffusion model with fast sampling strategy for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 2024

Fanen Meng, Yijun Chen, Haoyu Jing, Laifu Zhang, Yiming Yan, Yingchao Ren, Sensen Wu, Tian Feng, Renyi Liu, and Zhenhong Du. A conditional diffusion model with fast sampling strategy for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 2024

work page 2024
[44]

Global urban areas: High-resolution population density and settlement data

Meta Data for Good. Global urban areas: High-resolution population density and settlement data. https://dataforgood.facebook.com/dfg/tools/ globalurbanareas, 2023. Accessed: 2025-12-16

work page 2023
[45]

Sen2venµs, a dataset for the training of sentinel-2 super-resolution algorithms.Data, 7(7):96, 2022

Julien Michel, Juan Vinasco-Salinas, Jordi Inglada, and Olivier Hagolle. Sen2venµs, a dataset for the training of sentinel-2 super-resolution algorithms.Data, 7(7):96, 2022

work page 2022
[46]

USRoadDetections: U.S

Microsoft. USRoadDetections: U.S. Road Network Detection Dataset. https://github.com/microsoft/USRoadDetections, 2020. Accessed: 2025- 12-11

work page 2020
[47]

Ntsg landsat gross primary production (gpp) v2

Numerical Terradynamic Simulation Group (NTSG). Ntsg landsat gross primary production (gpp) v2. https://developers.google.com/earth-engine/ JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 17 datasets/catalog/UMT_NTSG_v2_LANDSAT_GPP, 2021. Accessed: 2025-12-16

work page 2021
[48]

Landland- cov_baselc2022.tif, 2024

University of Vermont Spatial Analysis Laboratory. Landland- cov_baselc2022.tif, 2024

work page 2024
[49]

Advancing image super-resolution techniques in remote sensing: A comprehensive survey.ISPRS Journal of Photogrammetry and Remote Sensing, 231:68–100, 2026

Yunliang Qi, Meng Lou, Yimin Liu, Lu Li, Zhen Yang, and Wen Nie. Advancing image super-resolution techniques in remote sensing: A comprehensive survey.ISPRS Journal of Photogrammetry and Remote Sensing, 231:68–100, 2026

work page 2026
[50]

Cfat: Un- leashing triangular windows for image super-resolution

Abhisek Ray, Gaurav Kumar, and Maheshkumar H Kolekar. Cfat: Un- leashing triangular windows for image super-resolution. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 26120–26129, 2024

work page 2024
[51]

Lavista Ferres, and Peyman Najafirad

Caleb Robinson, Isaac Corley, Anthony Ortiz, Rahul Dodhia, Juan M. Lavista Ferres, and Peyman Najafirad. Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imagery, 2024

work page 2024
[52]

Image super-resolution via iterative refinement.IEEE transactions on pattern analysis and machine intelligence, 45(4):4713–4726, 2022

Chitwan Saharia, Jonathan Ho, William Chan, Tim Salimans, David J Fleet, and Mohammad Norouzi. Image super-resolution via iterative refinement.IEEE transactions on pattern analysis and machine intelligence, 45(4):4713–4726, 2022

work page 2022
[53]

Rethinking image evaluation in super- resolution.arXiv preprint arXiv:2503.13074, 2025

Shaolin Su, Josep M Rocafort, Danna Xue, David Serrano-Lozano, Lei Sun, and Javier Vazquez-Corral. Rethinking image evaluation in super- resolution.arXiv preprint arXiv:2503.13074, 2025

work page arXiv 2025
[54]

Deriving high spatiotemporal remote sensing images using deep convolutional network

Zhenyu Tan, Peng Yue, Liping Di, and Junmei Tang. Deriving high spatiotemporal remote sensing images using deep convolutional network. Remote Sensing, 10(7):1066, 2018

work page 2018
[55]

Computer generated building footprints for the united states, 2018

Bing Maps Team. Computer generated building footprints for the united states, 2018

work page 2018
[56]

Geological Survey

U.S. Geological Survey. Landsat dynamic surface water extent (dswe) science products. https://www.usgs.gov/landsat-missions/ landsat-dynamic-surface-water-extent-science-products, 2022. Accessed: 2025-12-16

work page 2022
[57]

Naip seam lines by state

USDA Farm Service Agency Aerial Photography Field Office. Naip seam lines by state. https://gdg.sc.egov.usda.gov/Catalog/ProductDescription/ NAIPSL.html, 2022. Accessed: 2025-12-11

work page 2022
[58]

Cropland data layer

USDA National Agricultural Statistics Service. Cropland data layer. https://nassgeodata.gmu.edu/CropScape/, 2021. Published crop-specific data layer. Accessed: 12/16/2025

work page 2021
[59]

A cnn-based sentinel- 2 image super-resolution method using multiobjective training.IEEE Transactions on Geoscience and Remote Sensing, 61:1–14, 2023

Vlad Vasilescu, Mihai Datcu, and Daniela Faur. A cnn-based sentinel- 2 image super-resolution method using multiobjective training.IEEE Transactions on Geoscience and Remote Sensing, 61:1–14, 2023

work page 2023
[60]

Towards real-world remote sensing image super- resolution: A new benchmark and an efficient model.IEEE Transactions on Geoscience and Remote Sensing, 2024

Jia Wang, Liuyu Xiang, Lei Liu, Jiaochong Xu, Peipei Li, Qizhi Xu, and Zhaofeng He. Towards real-world remote sensing image super- resolution: A new benchmark and an efficient model.IEEE Transactions on Geoscience and Remote Sensing, 2024

work page 2024
[61]

Virtual image pair-based spatio-temporal fusion.Remote Sensing of Environment, 249:112009, 2020

Qunming Wang, Yijie Tang, Xiaohua Tong, and Peter M Atkinson. Virtual image pair-based spatio-temporal fusion.Remote Sensing of Environment, 249:112009, 2020

work page 2020
[62]

Real-esrgan: Training real-world blind super-resolution with pure synthetic data

Xintao Wang, Liangbin Xie, Chao Dong, and Ying Shan. Real-esrgan: Training real-world blind super-resolution with pure synthetic data. In Proceedings of the IEEE/CVF international conference on computer vision, pages 1905–1914, 2021

work page 1905
[63]

Esrgan: Enhanced super-resolution generative adversarial networks

Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, and Chen Change Loy. Esrgan: Enhanced super-resolution generative adversarial networks. InProceedings of the European conference on computer vision (ECCV) workshops, pages 0–0, 2018

work page 2018
[64]

attention

Yan Wang, Yi Liu, Shijie Zhao, Junlin Li, and Li Zhang. Camixersr: Only details need more" attention". InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 25837– 25846, 2024

work page 2024
[65]

Treefinder: A us-scale benchmark dataset for individual tree mortality monitoring using high- resolution aerial imagery

Zhihao Wang, Cooper Li, Ruichen Wang, Lei Ma, George Hurtt, Xiaowei Jia, Gengchen Mai, Zhili Li, and Yiqun Xie. Treefinder: A us-scale benchmark dataset for individual tree mortality monitoring using high- resolution aerial imagery. InThe Thirty-ninth Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track

work page
[66]

Image quality assessment: from error visibility to structural similarity

Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4):600–612, 2004

work page 2004
[67]

Super-resolution neural operator

Min Wei and Xuesong Zhang. Super-resolution neural operator. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 18247–18256, 2023

work page 2023
[68]

Xiao, Y., Yuan, Q., Jiang, K., He, J., Jin, X., Zhang, L.,

Piper Wolters, Favyen Bastani, and Aniruddha Kembhavi. Zooming out on zooming in: Advancing super-resolution for remote sensing.arXiv preprint arXiv:2311.18082, 2023

work page arXiv 2023
[69]

Free-flowing rivers - wwf hydrosheds v1

WWF HydroSHEDS. Free-flowing rivers - wwf hydrosheds v1. https://developers.google.com/earth-engine/datasets/catalog/WWF_ HydroSHEDS_v1_FreeFlowingRivers, 2000. Accessed: 2025-12-16

work page 2000
[70]

Ediffsr: An efficient diffusion probabilistic model for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 62:1–14, 2023

Yi Xiao, Qiangqiang Yuan, Kui Jiang, Jiang He, Xianyu Jin, and Liangpei Zhang. Ediffsr: An efficient diffusion probabilistic model for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 62:1–14, 2023

work page 2023
[71]

Segformer: Simple and efficient design for semantic segmentation with transformers.Advances in neural information processing systems, 34:12077–12090, 2021

Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M Alvarez, and Ping Luo. Segformer: Simple and efficient design for semantic segmentation with transformers.Advances in neural information processing systems, 34:12077–12090, 2021

work page 2021
[72]

Neurop-diff: Continuous remote sensing image super-resolution via neural operator diffusion.arXiv preprint arXiv:2501.09054, 2025

Zihao Xu, Yuzhi Tang, Bowen Xu, and Qingquan Li. Neurop-diff: Continuous remote sensing image super-resolution via neural operator diffusion.arXiv preprint arXiv:2501.09054, 2025

work page arXiv 2025
[73]

Learning texture transformer network for image super-resolution

Fuzhi Yang, Huan Yang, Jianlong Fu, Hongtao Lu, and Baining Guo. Learning texture transformer network for image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5791–5800, 2020

work page 2020
[74]

On single image scale- up using sparse-representations

Roman Zeyde, Michael Elad, and Matan Protter. On single image scale- up using sparse-representations. InInternational conference on curves and surfaces, pages 711–730. Springer, 2010

work page 2010
[75]

Transcending the limit of local window: Advanced super-resolution transformer with adaptive token dictionary

Leheng Zhang, Yawei Li, Xingyu Zhou, Xiaorui Zhao, and Shuhang Gu. Transcending the limit of local window: Advanced super-resolution transformer with adaptive token dictionary. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2856–2865, 2024

work page 2024
[76]

Uncertainty- guided perturbation for image super-resolution diffusion model

Leheng Zhang, Weiyi You, Kexuan Shi, and Shuhang Gu. Uncertainty- guided perturbation for image super-resolution diffusion model. In Proceedings of the Computer Vision and Pattern Recognition Conference, pages 17980–17989, 2025

work page 2025
[77]

Adding conditional control to text-to-image diffusion models

Lvmin Zhang, Anyi Rao, and Maneesh Agrawala. Adding conditional control to text-to-image diffusion models. InProceedings of the IEEE/CVF international conference on computer vision, pages 3836– 3847, 2023

work page 2023
[78]

The unreasonable effectiveness of deep features as a perceptual metric

Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang. The unreasonable effectiveness of deep features as a perceptual metric. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 586–595, 2018

work page 2018
[79]

Image super-resolution using very deep residual channel attention networks

Yulun Zhang, Kunpeng Li, Kai Li, Lichen Wang, Bineng Zhong, and Yun Fu. Image super-resolution using very deep residual channel attention networks. InProceedings of the European conference on computer vision (ECCV), pages 286–301, 2018

work page 2018
[80]

Single-image super-resolution based on rational fractal inter- polation.IEEE Transactions on Image Processing, 27(8):3782–3797, 2018

Yunfeng Zhang, Qinglan Fan, Fangxun Bao, Yifang Liu, and Caiming Zhang. Single-image super-resolution based on rational fractal inter- polation.IEEE Transactions on Image Processing, 27(8):3782–3797, 2018

work page 2018

Showing first 80 references.

[1] [1]

Ntire 2017 challenge on single image super-resolution: Dataset and study

Eirikur Agustsson and Radu Timofte. Ntire 2017 challenge on single image super-resolution: Dataset and study. InProceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 126–135, 2017

work page 2017

[2] [2]

Improving component substitution pansharpening through multivariate regression of ms + pan data.IEEE Transactions on Geoscience and Remote Sensing, 45(10):3230– 3239, 2007

Bruno Aiazzi, Stefano Baronti, and Massimo Selva. Improving component substitution pansharpening through multivariate regression of ms + pan data.IEEE Transactions on Geoscience and Remote Sensing, 45(10):3230– 3239, 2007

work page 2007

[3] [3]

Canopy height model and naip imagery pairs across conus.Scientific Data, 12(1):322, 2025

Brady W Allred, Sarah E McCord, and Scott L Morford. Canopy height model and naip imagery pairs across conus.Scientific Data, 12(1):322, 2025

work page 2025

[4] [4]

Apache Sedona

Apache. Apache Sedona. https://sedona.apache.org/1.6.0/, 2025. Ac- cessed: 2025-11-05

work page 2025

[5] [5]

Cnn-based super-resolution of hyperspectral images.IEEE Transactions on Geoscience and Remote Sensing, 58(9):6106–6121, 2020

Pattathal V Arun, Krishna Mohan Buddhiraju, Alok Porwal, and Jocelyn Chanussot. Cnn-based super-resolution of hyperspectral images.IEEE Transactions on Geoscience and Remote Sensing, 58(9):6106–6121, 2020

work page 2020

[6] [6]

Toward real-world single image super-resolution: A new benchmark and a new model

Jianrui Cai, Hui Zeng, Hongwei Yong, Zisheng Cao, and Lei Zhang. Toward real-world single image super-resolution: A new benchmark and a new model. InProceedings of the IEEE/CVF international conference on computer vision, pages 3086–3095, 2019. JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 16

work page 2019

[7] [7]

Large-scale individual building extraction from open-source satellite imagery via super-resolution-based instance segmentation approach

Shenglong Chen, Yoshiki Ogawa, Chenbo Zhao, and Yoshihide Sekimoto. Large-scale individual building extraction from open-source satellite imagery via super-resolution-based instance segmentation approach. ISPRS Journal of Photogrammetry and Remote Sensing, 195:129–152, 2023

work page 2023

[8] [8]

Activating more pixels in image super-resolution transformer

Xiangyu Chen, Xintao Wang, Jiantao Zhou, Yu Qiao, and Chao Dong. Activating more pixels in image super-resolution transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 22367–22377, 2023

work page 2023

[9] [9]

Binarized diffusion model for image super- resolution.Advances in Neural Information Processing Systems, 37:30651–30669, 2024

Zheng Chen, Haotong Qin, Yong Guo, Xiongfei Su, Xin Yuan, Linghe Kong, and Yulun Zhang. Binarized diffusion model for image super- resolution.Advances in Neural Information Processing Systems, 37:30651–30669, 2024

work page 2024

[10] [10]

Recursive generalization transformer for image super-resolution.arXiv preprint arXiv:2303.06373, 2023

Zheng Chen, Yulun Zhang, Jinjin Gu, Linghe Kong, and Xiaokang Yang. Recursive generalization transformer for image super-resolution.arXiv preprint arXiv:2303.06373, 2023

work page arXiv 2023

[11] [11]

A solar panel dataset of very high resolution satellite imagery to support the sustainable development goals

Cecilia N Clark and Fabio Pacifici. A solar panel dataset of very high resolution satellite imagery to support the sustainable development goals. Scientific Data, 10(1):636, 2023

work page 2023

[12] [12]

Open high- resolution satellite imagery: The worldstrat dataset–with application to super-resolution.Advances in Neural Information Processing Systems, 35:25979–25991, 2022

Julien Cornebise, Ivan Oršoli ´c, and Freddie Kalaitzis. Open high- resolution satellite imagery: The worldstrat dataset–with application to super-resolution.Advances in Neural Information Processing Systems, 35:25979–25991, 2022

work page 2022

[13] [13]

National land cover database (nlcd) 2021 products, 2023

Jon Dewitz. National land cover database (nlcd) 2021 products, 2023. U.S. Geological Survey data release

work page 2021

[14] [14]

Learning a deep convolutional network for image super-resolution

Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. Learning a deep convolutional network for image super-resolution. InComputer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part IV 13, pages 184–199. Springer, 2014

work page 2014

[15] [15]

Esa worldcover global land cover service

European Space Agency (ESA) WorldCover. Esa worldcover global land cover service. https://esa-worldcover.org/en/data-access, 2021. Accessed: 2025-12-11

work page 2021

[16] [16]

Remote sensing time series analysis: A review of data and applications

Yingchun Fu, Zhe Zhu, Liangyun Liu, Wenfeng Zhan, Tao He, Huanfeng Shen, Jun Zhao, Yongxue Liu, Hongsheng Zhang, Zihan Liu, et al. Remote sensing time series analysis: A review of data and applications. Journal of Remote Sensing, 4:0285, 2024

work page 2024

[17] [17]

Copernicus_s2_cloud_probability: Sentinel -2 cloud probability

Google Earth Engine. Copernicus_s2_cloud_probability: Sentinel -2 cloud probability. https://developers.google.com/earth-engine/datasets/catalog/ COPERNICUS_S2_CLOUD_PROBABILITY, 2025. Accessed: 2025- 12-11

work page 2025

[18] [18]

Spatial-temporal super-resolution of satellite imagery via conditional pixel synthesis

Yutong He, Dingjie Wang, Nicholas Lai, William Zhang, Chenlin Meng, Marshall Burke, David Lobell, and Stefano Ermon. Spatial-temporal super-resolution of satellite imagery via conditional pixel synthesis. Advances in Neural Information Processing Systems, 34:27903–27915, 2021

work page 2021

[19] [19]

Tracking urbanization in developing regions with remote sensing spatial-temporal super-resolution.arXiv preprint arXiv:2204.01736, 2022

Yutong He, William Zhang, Chenlin Meng, Marshall Burke, David B Lobell, and Stefano Ermon. Tracking urbanization in developing regions with remote sensing spatial-temporal super-resolution.arXiv preprint arXiv:2204.01736, 2022

work page arXiv 2022

[20] [20]

Cascaded diffusion models for high fidelity image generation.Journal of Machine Learning Research, 23(47):1–33, 2022

Jonathan Ho, Chitwan Saharia, William Chan, David J Fleet, Mohammad Norouzi, and Tim Salimans. Cascaded diffusion models for high fidelity image generation.Journal of Machine Learning Research, 23(47):1–33, 2022

work page 2022

[21] [21]

Efficient swin transformer for remote sensing image super-resolution.IEEE Transactions on Image Processing, 2024

Xudong Kang, Puhong Duan, Jier Li, and Shutao Li. Efficient swin transformer for remote sensing image super-resolution.IEEE Transactions on Image Processing, 2024

work page 2024

[22] [22]

Lobell, and Stefano Ermon

Samar Khanna, Patrick Liu, Linqi Zhou, Chenlin Meng, Robin Rombach, Marshall Burke, David B. Lobell, and Stefano Ermon. Diffusionsat: A generative foundation model for satellite imagery. InThe Twelfth International Conference on Learning Representations, 2024

work page 2024

[23] [23]

Accurate image super-resolution using very deep convolutional networks

Jiwon Kim, Jung Kwon Lee, and Kyoung Mu Lee. Accurate image super-resolution using very deep convolutional networks. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 1646–1654, 2016

work page 2016

[24] [24]

Single-image super-resolution using sparse regression and natural image prior.IEEE transactions on pattern analysis and machine intelligence, 32(6):1127–1133, 2010

Kwang In Kim and Younghee Kwon. Single-image super-resolution using sparse regression and natural image prior.IEEE transactions on pattern analysis and machine intelligence, 32(6):1127–1133, 2010

work page 2010

[25] [25]

Toward bridging the simulated-to-real gap: Benchmarking super-resolution on real data.IEEE transactions on pattern analysis and machine intelligence, 42(11):2944–2959, 2019

Thomas Köhler, Michel Bätz, Farzad Naderi, André Kaup, Andreas Maier, and Christian Riess. Toward bridging the simulated-to-real gap: Benchmarking super-resolution on real data.IEEE transactions on pattern analysis and machine intelligence, 42(11):2944–2959, 2019

work page 2019

[26] [26]

A real-world benchmark for sentinel-2 multi-image super-resolution.Scientific Data, 10(1):644, 2023

Pawel Kowaleczko, Tomasz Tarasiewicz, Maciej Ziaja, Daniel Kostrzewa, Jakub Nalepa, Przemyslaw Rokita, and Michal Kawulok. A real-world benchmark for sentinel-2 multi-image super-resolution.Scientific Data, 10(1):644, 2023

work page 2023

[27] [27]

A high-resolution canopy height model of the earth.Nature Ecology & Evolution, 7(11):1778–1789, 2023

Nico Lang, Walter Jetz, Konrad Schindler, and Jan Dirk Wegner. A high-resolution canopy height model of the earth.Nature Ecology & Evolution, 7(11):1778–1789, 2023

work page 2023

[28] [28]

Photo-realistic single image super- resolution using a generative adversarial network

Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Jo- hannes Totz, Zehan Wang, et al. Photo-realistic single image super- resolution using a generative adversarial network. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 4681–4690, 2017

work page 2017

[29] [29]

Transformer-based multistage enhancement for remote sensing image super-resolution.IEEE Transac- tions on Geoscience and Remote Sensing, 60:1–11, 2021

Sen Lei, Zhenwei Shi, and Wenjing Mo. Transformer-based multistage enhancement for remote sensing image super-resolution.IEEE Transac- tions on Geoscience and Remote Sensing, 60:1–11, 2021

work page 2021

[30] [30]

Sed: Semantic-aware discriminator for image super-resolution

Bingchen Li, Xin Li, Hanxin Zhu, Yeying Jin, Ruoyu Feng, Zhizheng Zhang, and Zhibo Chen. Sed: Semantic-aware discriminator for image super-resolution. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 25784–25795, 2024

work page 2024

[31] [31]

A global analysis of sentinel-2a, sentinel-2b and landsat-8 data revisit intervals and implications for terrestrial monitoring

Jian Li and David P Roy. A global analysis of sentinel-2a, sentinel-2b and landsat-8 data revisit intervals and implications for terrestrial monitoring. Remote Sensing, 9(9):902, 2017

work page 2017

[32] [32]

Lsdir: A large scale dataset for image restoration

Yawei Li, Kai Zhang, Jingyun Liang, Jiezhang Cao, Ce Liu, Rui Gong, Yulun Zhang, Hao Tang, Yun Liu, Denis Demandolx, et al. Lsdir: A large scale dataset for image restoration. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1775– 1787, 2023

work page 2023

[33] [33]

A new sensor bias-driven spatio-temporal fusion model based on convolutional neural networks.Science China Information Sciences, 63:1–16, 2020

Yunfei Li, Jun Li, Lin He, Jin Chen, and Antonio Plaza. A new sensor bias-driven spatio-temporal fusion model based on convolutional neural networks.Science China Information Sciences, 63:1–16, 2020

work page 2020

[34] [34]

Swinir: Image restoration using swin transformer

Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, and Radu Timofte. Swinir: Image restoration using swin transformer. InProceedings of the IEEE/CVF international conference on computer vision, pages 1833–1844, 2021

work page 2021

[35] [35]

Enhanced deep residual networks for single image super- resolution

Bee Lim, Sanghyun Son, Heewon Kim, Seungjun Nah, and Kyoung Mu Lee. Enhanced deep residual networks for single image super- resolution. InProceedings of the IEEE conference on computer vision and pattern recognition workshops, pages 136–144, 2017

work page 2017

[36] [36]

Text2earth: Unlocking text-driven remote sensing image generation with a global-scale dataset and a foundation model.arXiv preprint arXiv:2501.00895, 2025

Chenyang Liu, Keyan Chen, Rui Zhao, Zhengxia Zou, and Zhenwei Shi. Text2earth: Unlocking text-driven remote sensing image generation with a global-scale dataset and a foundation model.arXiv preprint arXiv:2501.00895, 2025

work page arXiv 2025

[37] [37]

JG Liu. Smoothing filter-based intensity modulation: A spectral preserve image fusion technique for improving spatial details.International Journal of remote sensing, 21(18):3461–3472, 2000

work page 2000

[38] [38]

Difffno: Diffusion fourier neural operator

Xiaoyi Liu and Hao Tang. Difffno: Diffusion fourier neural operator. In Proceedings of the Computer Vision and Pattern Recognition Conference, pages 150–160, 2025

work page 2025

[39] [39]

Swin transformer: Hierarchical vision transformer using shifted windows

Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. Swin transformer: Hierarchical vision transformer using shifted windows. InProceedings of the IEEE/CVF international conference on computer vision, pages 10012–10022, 2021

work page 2021

[40] [40]

Zeping Liu, Hong Tang, Lin Feng, and Siqing Lyu. China building rooftop area: the first multi-annual (2016–2021) and high-resolution (2.5 m) building rooftop area dataset in china derived with super-resolution segmentation from sentinel-2 imagery.Earth System Science Data, 15(8):3547–3572, 2023

work page 2016

[41] [41]

Transformer for single image super-resolution

Zhisheng Lu, Juncheng Li, Hong Liu, Chaoyan Huang, Linlin Zhang, and Tieyong Zeng. Transformer for single image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 457–466, 2022

work page 2022

[42] [42]

Super- resolution of proba-v images using convolutional neural networks

Marcus Märtens, Dario Izzo, Andrej Krzic, and Daniël Cox. Super- resolution of proba-v images using convolutional neural networks. Astrodynamics, 3:387–402, 2019

work page 2019

[43] [43]

A conditional diffusion model with fast sampling strategy for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 2024

Fanen Meng, Yijun Chen, Haoyu Jing, Laifu Zhang, Yiming Yan, Yingchao Ren, Sensen Wu, Tian Feng, Renyi Liu, and Zhenhong Du. A conditional diffusion model with fast sampling strategy for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 2024

work page 2024

[44] [44]

Global urban areas: High-resolution population density and settlement data

Meta Data for Good. Global urban areas: High-resolution population density and settlement data. https://dataforgood.facebook.com/dfg/tools/ globalurbanareas, 2023. Accessed: 2025-12-16

work page 2023

[45] [45]

Sen2venµs, a dataset for the training of sentinel-2 super-resolution algorithms.Data, 7(7):96, 2022

Julien Michel, Juan Vinasco-Salinas, Jordi Inglada, and Olivier Hagolle. Sen2venµs, a dataset for the training of sentinel-2 super-resolution algorithms.Data, 7(7):96, 2022

work page 2022

[46] [46]

USRoadDetections: U.S

Microsoft. USRoadDetections: U.S. Road Network Detection Dataset. https://github.com/microsoft/USRoadDetections, 2020. Accessed: 2025- 12-11

work page 2020

[47] [47]

Ntsg landsat gross primary production (gpp) v2

Numerical Terradynamic Simulation Group (NTSG). Ntsg landsat gross primary production (gpp) v2. https://developers.google.com/earth-engine/ JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 17 datasets/catalog/UMT_NTSG_v2_LANDSAT_GPP, 2021. Accessed: 2025-12-16

work page 2021

[48] [48]

Landland- cov_baselc2022.tif, 2024

University of Vermont Spatial Analysis Laboratory. Landland- cov_baselc2022.tif, 2024

work page 2024

[49] [49]

Advancing image super-resolution techniques in remote sensing: A comprehensive survey.ISPRS Journal of Photogrammetry and Remote Sensing, 231:68–100, 2026

Yunliang Qi, Meng Lou, Yimin Liu, Lu Li, Zhen Yang, and Wen Nie. Advancing image super-resolution techniques in remote sensing: A comprehensive survey.ISPRS Journal of Photogrammetry and Remote Sensing, 231:68–100, 2026

work page 2026

[50] [50]

Cfat: Un- leashing triangular windows for image super-resolution

Abhisek Ray, Gaurav Kumar, and Maheshkumar H Kolekar. Cfat: Un- leashing triangular windows for image super-resolution. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 26120–26129, 2024

work page 2024

[51] [51]

Lavista Ferres, and Peyman Najafirad

Caleb Robinson, Isaac Corley, Anthony Ortiz, Rahul Dodhia, Juan M. Lavista Ferres, and Peyman Najafirad. Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imagery, 2024

work page 2024

[52] [52]

Image super-resolution via iterative refinement.IEEE transactions on pattern analysis and machine intelligence, 45(4):4713–4726, 2022

Chitwan Saharia, Jonathan Ho, William Chan, Tim Salimans, David J Fleet, and Mohammad Norouzi. Image super-resolution via iterative refinement.IEEE transactions on pattern analysis and machine intelligence, 45(4):4713–4726, 2022

work page 2022

[53] [53]

Rethinking image evaluation in super- resolution.arXiv preprint arXiv:2503.13074, 2025

Shaolin Su, Josep M Rocafort, Danna Xue, David Serrano-Lozano, Lei Sun, and Javier Vazquez-Corral. Rethinking image evaluation in super- resolution.arXiv preprint arXiv:2503.13074, 2025

work page arXiv 2025

[54] [54]

Deriving high spatiotemporal remote sensing images using deep convolutional network

Zhenyu Tan, Peng Yue, Liping Di, and Junmei Tang. Deriving high spatiotemporal remote sensing images using deep convolutional network. Remote Sensing, 10(7):1066, 2018

work page 2018

[55] [55]

Computer generated building footprints for the united states, 2018

Bing Maps Team. Computer generated building footprints for the united states, 2018

work page 2018

[56] [56]

Geological Survey

U.S. Geological Survey. Landsat dynamic surface water extent (dswe) science products. https://www.usgs.gov/landsat-missions/ landsat-dynamic-surface-water-extent-science-products, 2022. Accessed: 2025-12-16

work page 2022

[57] [57]

Naip seam lines by state

USDA Farm Service Agency Aerial Photography Field Office. Naip seam lines by state. https://gdg.sc.egov.usda.gov/Catalog/ProductDescription/ NAIPSL.html, 2022. Accessed: 2025-12-11

work page 2022

[58] [58]

Cropland data layer

USDA National Agricultural Statistics Service. Cropland data layer. https://nassgeodata.gmu.edu/CropScape/, 2021. Published crop-specific data layer. Accessed: 12/16/2025

work page 2021

[59] [59]

A cnn-based sentinel- 2 image super-resolution method using multiobjective training.IEEE Transactions on Geoscience and Remote Sensing, 61:1–14, 2023

Vlad Vasilescu, Mihai Datcu, and Daniela Faur. A cnn-based sentinel- 2 image super-resolution method using multiobjective training.IEEE Transactions on Geoscience and Remote Sensing, 61:1–14, 2023

work page 2023

[60] [60]

Towards real-world remote sensing image super- resolution: A new benchmark and an efficient model.IEEE Transactions on Geoscience and Remote Sensing, 2024

Jia Wang, Liuyu Xiang, Lei Liu, Jiaochong Xu, Peipei Li, Qizhi Xu, and Zhaofeng He. Towards real-world remote sensing image super- resolution: A new benchmark and an efficient model.IEEE Transactions on Geoscience and Remote Sensing, 2024

work page 2024

[61] [61]

Virtual image pair-based spatio-temporal fusion.Remote Sensing of Environment, 249:112009, 2020

Qunming Wang, Yijie Tang, Xiaohua Tong, and Peter M Atkinson. Virtual image pair-based spatio-temporal fusion.Remote Sensing of Environment, 249:112009, 2020

work page 2020

[62] [62]

Real-esrgan: Training real-world blind super-resolution with pure synthetic data

Xintao Wang, Liangbin Xie, Chao Dong, and Ying Shan. Real-esrgan: Training real-world blind super-resolution with pure synthetic data. In Proceedings of the IEEE/CVF international conference on computer vision, pages 1905–1914, 2021

work page 1905

[63] [63]

Esrgan: Enhanced super-resolution generative adversarial networks

Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, and Chen Change Loy. Esrgan: Enhanced super-resolution generative adversarial networks. InProceedings of the European conference on computer vision (ECCV) workshops, pages 0–0, 2018

work page 2018

[64] [64]

attention

Yan Wang, Yi Liu, Shijie Zhao, Junlin Li, and Li Zhang. Camixersr: Only details need more" attention". InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 25837– 25846, 2024

work page 2024

[65] [65]

Treefinder: A us-scale benchmark dataset for individual tree mortality monitoring using high- resolution aerial imagery

Zhihao Wang, Cooper Li, Ruichen Wang, Lei Ma, George Hurtt, Xiaowei Jia, Gengchen Mai, Zhili Li, and Yiqun Xie. Treefinder: A us-scale benchmark dataset for individual tree mortality monitoring using high- resolution aerial imagery. InThe Thirty-ninth Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track

work page

[66] [66]

Image quality assessment: from error visibility to structural similarity

Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4):600–612, 2004

work page 2004

[67] [67]

Super-resolution neural operator

Min Wei and Xuesong Zhang. Super-resolution neural operator. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 18247–18256, 2023

work page 2023

[68] [68]

Xiao, Y., Yuan, Q., Jiang, K., He, J., Jin, X., Zhang, L.,

Piper Wolters, Favyen Bastani, and Aniruddha Kembhavi. Zooming out on zooming in: Advancing super-resolution for remote sensing.arXiv preprint arXiv:2311.18082, 2023

work page arXiv 2023

[69] [69]

Free-flowing rivers - wwf hydrosheds v1

WWF HydroSHEDS. Free-flowing rivers - wwf hydrosheds v1. https://developers.google.com/earth-engine/datasets/catalog/WWF_ HydroSHEDS_v1_FreeFlowingRivers, 2000. Accessed: 2025-12-16

work page 2000

[70] [70]

Ediffsr: An efficient diffusion probabilistic model for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 62:1–14, 2023

Yi Xiao, Qiangqiang Yuan, Kui Jiang, Jiang He, Xianyu Jin, and Liangpei Zhang. Ediffsr: An efficient diffusion probabilistic model for remote sensing image super-resolution.IEEE Transactions on Geoscience and Remote Sensing, 62:1–14, 2023

work page 2023

[71] [71]

Segformer: Simple and efficient design for semantic segmentation with transformers.Advances in neural information processing systems, 34:12077–12090, 2021

Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M Alvarez, and Ping Luo. Segformer: Simple and efficient design for semantic segmentation with transformers.Advances in neural information processing systems, 34:12077–12090, 2021

work page 2021

[72] [72]

Neurop-diff: Continuous remote sensing image super-resolution via neural operator diffusion.arXiv preprint arXiv:2501.09054, 2025

Zihao Xu, Yuzhi Tang, Bowen Xu, and Qingquan Li. Neurop-diff: Continuous remote sensing image super-resolution via neural operator diffusion.arXiv preprint arXiv:2501.09054, 2025

work page arXiv 2025

[73] [73]

Learning texture transformer network for image super-resolution

Fuzhi Yang, Huan Yang, Jianlong Fu, Hongtao Lu, and Baining Guo. Learning texture transformer network for image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5791–5800, 2020

work page 2020

[74] [74]

On single image scale- up using sparse-representations

Roman Zeyde, Michael Elad, and Matan Protter. On single image scale- up using sparse-representations. InInternational conference on curves and surfaces, pages 711–730. Springer, 2010

work page 2010

[75] [75]

Transcending the limit of local window: Advanced super-resolution transformer with adaptive token dictionary

Leheng Zhang, Yawei Li, Xingyu Zhou, Xiaorui Zhao, and Shuhang Gu. Transcending the limit of local window: Advanced super-resolution transformer with adaptive token dictionary. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2856–2865, 2024

work page 2024

[76] [76]

Uncertainty- guided perturbation for image super-resolution diffusion model

Leheng Zhang, Weiyi You, Kexuan Shi, and Shuhang Gu. Uncertainty- guided perturbation for image super-resolution diffusion model. In Proceedings of the Computer Vision and Pattern Recognition Conference, pages 17980–17989, 2025

work page 2025

[77] [77]

Adding conditional control to text-to-image diffusion models

Lvmin Zhang, Anyi Rao, and Maneesh Agrawala. Adding conditional control to text-to-image diffusion models. InProceedings of the IEEE/CVF international conference on computer vision, pages 3836– 3847, 2023

work page 2023

[78] [78]

The unreasonable effectiveness of deep features as a perceptual metric

Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang. The unreasonable effectiveness of deep features as a perceptual metric. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 586–595, 2018

work page 2018

[79] [79]

Image super-resolution using very deep residual channel attention networks

Yulun Zhang, Kunpeng Li, Kai Li, Lichen Wang, Bineng Zhong, and Yun Fu. Image super-resolution using very deep residual channel attention networks. InProceedings of the European conference on computer vision (ECCV), pages 286–301, 2018

work page 2018

[80] [80]

Single-image super-resolution based on rational fractal inter- polation.IEEE Transactions on Image Processing, 27(8):3782–3797, 2018

Yunfeng Zhang, Qinglan Fan, Fangxun Bao, Yifang Liu, and Caiming Zhang. Single-image super-resolution based on rational fractal inter- polation.IEEE Transactions on Image Processing, 27(8):3782–3797, 2018

work page 2018