Flood Mapping from RGB imagery using a Vision Foundation Model

Andreas Weinmann; Markus Rauhut; Ronald R\"osch; Thomas M\"arz; Tilman Bucher; Vladyslav Polushko

arxiv: 2606.24120 · v1 · pith:6A6G54CHnew · submitted 2026-06-23 · 💻 cs.CV · eess.IV

Flood Mapping from RGB imagery using a Vision Foundation Model

Vladyslav Polushko , Tilman Bucher , Ronald R\"osch , Thomas M\"arz , Markus Rauhut , Andreas Weinmann This is my paper

Pith reviewed 2026-06-26 01:47 UTC · model grok-4.3

classification 💻 cs.CV eess.IV

keywords flood mappingvision foundation modelzero-shot transferRGB aerial imagerywater segmentationtransfer learningEarth observationfine-tuning

0 comments

The pith

A satellite-pretrained vision foundation model outperforms baselines when transferred to map floods from new RGB imagery in zero-shot and few-shot settings.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether a large vision model pretrained on satellite imagery can be adapted for segmenting flood water in high-resolution airborne RGB photos. It trains and evaluates the model on two separate flood events and finds that it matches or exceeds other models when trained on each dataset alone. In direct transfer from one event to the other with no additional training, the model delivers higher accuracy than the baselines. When given small amounts of labels from the second event, it improves more quickly and approaches full-training performance sooner than the comparison models.

Core claim

Fine-tuning Prithvi-2.0-UPN on the BlessemFlood21 and NeuenahrFlood RGB datasets produces state-of-the-art binary water segmentation. When trained only on BlessemFlood21, it outperforms baselines on NeuenahrFlood in zero-shot evaluation. With additional fine-tuning on small portions of NeuenahrFlood data, it improves more rapidly than the baselines and approaches the accuracy of full training on that dataset.

What carries the argument

Prithvi-2.0-UPN, formed by attaching a UPerNet decoder to the Prithvi-EO-2.0-600M Vision Transformer for pixel-level water classification after satellite pretraining.

If this is right

A model trained on one flood event can be applied directly to another event and still produce usable flood maps without new labels.
Adding only a small fraction of labels from a new event brings the model close to its best possible accuracy faster than competing models.
Satellite-pretrained models can bridge the gap to centimeter-scale nadir RGB imagery for water segmentation tasks.
Full training on either dataset alone yields segmentation accuracy at the level of current state-of-the-art methods.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Emergency mapping teams could maintain one base model and update it with minimal new labels when a different flood occurs.
The same adaptation strategy may extend to other rapid-response tasks such as mapping landslides or burned areas from RGB imagery.
Including more aerial RGB data during pretraining could reduce the amount of fine-tuning needed for future events.

Load-bearing premise

The two chosen flood datasets capture scenery and sensor differences that are representative enough for the transfer results to apply to other real-world flood events.

What would settle it

Running the same zero-shot and few-shot experiments on a third flood dataset with different terrain or imaging conditions and finding that baseline models match or exceed Prithvi-2.0-UPN performance would disprove the claimed transfer advantage.

Figures

Figures reproduced from arXiv: 2606.24120 by Andreas Weinmann, Markus Rauhut, Ronald R\"osch, Thomas M\"arz, Tilman Bucher, Vladyslav Polushko.

**Figure 1.** Figure 1: Qualitative comparison on a BlessemFlood21 test tile (RGB input) and predictions from Prithvi-2.0-UPN, and baseline methods; threshold 0.5 (Exp. 1). NeuenahrFlood test set to study how performance scales with an increasing size. The same setup is used for the baseline models. Throughout this section, the best values in each table are shown in bold and the second-best in italics. b) Experiment 1: Baseline c… view at source ↗

**Figure 5.** Figure 5: Exp. 3: Metrics of Prithvi-2.0-UPN on the [PITH_FULL_IMAGE:figures/full_fig_p004_5.png] view at source ↗

**Figure 4.** Figure 4: Zero-shot prediction vs. prediction after fine-tuning on [PITH_FULL_IMAGE:figures/full_fig_p004_4.png] view at source ↗

read the original abstract

Timely, high-resolution maps of flood extent around settlements are essential for emergency response and damage assessment. We consider airborne RGB imagery for flood mapping as it can be collected rapidly at low cost. To produce flood maps, deep learning models for water segmentation are often used. CNN based and small vision transformer models are used. However, they need much data for adaptation to a change of scenery, i.e., another flooding event. Vision foundation models or large vision transformers are known to generalize across domains. Recently, foundation models for Earth observation became available. They are pretrained on satellite data, whose spatial resolution, viewing geometry, and radiometry differ from nadir RGB imagery. Thus, adaptation is required. We investigate how a satellite-pretrained Earth observation foundation model can be adapted to centimeter-scale floodwater mapping from RGB imagery. Specifically, we fine-tune a model we call Prithvi-2.0-UPN consisting of the Prithvi-EO-2.0-600M Vision Transformer combined with a UPerNet decoder for binary water segmentation on two RGB datasets (BlessemFlood21, NeuenahrFlood). In a first experiment we observe that Prithvi-2.0-UPN reaches state-of-the-art results on BlessemFlood21 and NeuenahrFlood, when trained on their datasets. In a second experiment we show that Prithvi-2.0-UPN performs better than state-of-the-art baseline models for transfer to a new flood event (trained on BlessemFlood21, tested on NeuenahrFlood) in a zero-shot setting. However, the performance indicates room for improvement. In this respect, we investigate in a third experiment how performance improves when further fine-tuning the models with small shares of NeuenahrFlood training data: Prithvi-2.0-UPN improves the fastest and reaches almost the performance level when fully trained on NeuenahrFlood, indicating transfer capabilities.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Prithvi-2.0-UPN shows better zero-shot and few-shot transfer than baselines on the Blessem to Neuenahr pair, but the datasets look too similar for strong generalization claims and the abstract still withholds all numbers.

read the letter

The core new thing is the three transfer experiments: Prithvi-EO-2.0-600M plus UPerNet beats the listed baselines in zero-shot from BlessemFlood21 to NeuenahrFlood and then improves fastest when given small amounts of the target data. That specific empirical comparison on these two airborne RGB flood sets is not in the prior work the abstract cites.

The paper does the straightforward thing well. It takes an existing satellite foundation model, swaps in a UPerNet head, and runs the three logical tests (in-domain SOTA, zero-shot, few-shot) on the same pair of datasets. The setup is clear and the motivation for using foundation models on high-resolution nadir RGB is reasonable.

The soft spots are exactly where the stress-test note points. Both events are western German floods with comparable urban-rural mixes and nadir RGB acquisition, so the domain shift may be modest. If the advantage shrinks on more dissimilar events or sensors, the practical claim weakens. The abstract still gives zero quantitative results—no IoU scores, no dataset sizes, no variance—so it is impossible to tell whether the reported edge is large enough to matter. The full text presumably contains the numbers, but their absence from the summary is a real gap.

No equations or circular definitions appear, and the work is standard fine-tuning rather than a new method. Citation pattern looks normal for the subfield.

This is for remote-sensing practitioners who need flood maps from cheap RGB flights and want to know whether a large EO foundation model helps with data scarcity. A reader already working on foundation-model transfer for segmentation will find the experiments worth seeing, even if the generalization test is limited.

It deserves peer review. The application is practical, the transfer results are new data points, and the questions about dataset similarity and missing metrics are fixable with the full paper and perhaps one more test set.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces Prithvi-2.0-UPN, formed by combining the Prithvi-EO-2.0-600M vision transformer (pretrained on satellite data) with a UPerNet decoder, and fine-tunes it for binary water segmentation on airborne RGB flood imagery. Experiments on the BlessemFlood21 and NeuenahrFlood datasets show that the model reaches state-of-the-art results when trained on each dataset individually, outperforms baseline models in zero-shot transfer from BlessemFlood21 to NeuenahrFlood, and exhibits the fastest performance gains when further fine-tuned with small fractions of the target-domain training data.

Significance. If the reported transfer results hold under scrutiny, the work provides evidence that satellite-pretrained Earth-observation foundation models can be adapted to centimeter-scale RGB flood mapping with limited target data. This could support faster model deployment for new flood events, though the practical value hinges on whether the observed advantages generalize beyond the two evaluated German datasets.

major comments (2)

[Datasets and Experimental Setup] Datasets section: both BlessemFlood21 and NeuenahrFlood occurred in western Germany with comparable urban/rural mixes, vegetation, and nadir RGB acquisition geometry. The manuscript provides no quantitative comparison of domain shift (e.g., via image histograms, lighting statistics, or sensor response differences), which is load-bearing for the claim that superior zero-shot and few-shot transfer on this pair demonstrates advantage for arbitrary new flood events.
[Results] Zero-shot and few-shot results paragraphs: the abstract and results claim state-of-the-art and fastest adaptation, yet the provided summary supplies no numerical metrics, error bars, dataset sizes, or implementation details; without these the magnitude and statistical significance of the reported improvements cannot be verified.

minor comments (2)

[Abstract] Abstract: states performance improvements and faster adaptation but supplies no quantitative metrics, error bars, or implementation details.
[Method] Model naming: the term Prithvi-2.0-UPN is introduced without an explicit statement of whether it denotes a novel architectural contribution or simply the combination of the existing Prithvi-EO-2.0-600M backbone with UPerNet.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback, which highlights important aspects of our experimental design and presentation. We address each major comment below and commit to revisions that strengthen the manuscript without altering its core claims.

read point-by-point responses

Referee: [Datasets and Experimental Setup] Datasets section: both BlessemFlood21 and NeuenahrFlood occurred in western Germany with comparable urban/rural mixes, vegetation, and nadir RGB acquisition geometry. The manuscript provides no quantitative comparison of domain shift (e.g., via image histograms, lighting statistics, or sensor response differences), which is load-bearing for the claim that superior zero-shot and few-shot transfer on this pair demonstrates advantage for arbitrary new flood events.

Authors: We agree that the similarity in geographic location, land cover, and acquisition geometry between the two datasets represents a limitation for generalizing the transfer results to arbitrary new flood events. In the revised manuscript, we will add a dedicated subsection with quantitative domain-shift analysis, including color histogram comparisons, mean and standard deviation of pixel intensities per channel, and basic lighting statistics derived from the RGB images. This will better contextualize the observed zero-shot and few-shot performance. revision: yes
Referee: [Results] Zero-shot and few-shot results paragraphs: the abstract and results claim state-of-the-art and fastest adaptation, yet the provided summary supplies no numerical metrics, error bars, dataset sizes, or implementation details; without these the magnitude and statistical significance of the reported improvements cannot be verified.

Authors: The full manuscript includes tables reporting all numerical metrics (IoU, F1-score, precision, recall) for the in-domain, zero-shot, and few-shot experiments, along with dataset sizes for each split and training hyperparameters. Multiple runs with different random seeds provide standard deviations as error bars. We will revise the abstract to include the key numerical improvements and ensure the results section explicitly references these tables and implementation details for immediate verifiability. revision: yes

Circularity Check

0 steps flagged

No circularity: purely empirical results on held-out data

full rationale

The paper contains no equations, derivations, or parameter-fitting steps that could reduce to self-definition or fitted-input predictions. All claims rest on direct performance measurements (IoU, etc.) of models trained and evaluated on the two named flood datasets with explicit train/test splits. No self-citation is invoked to justify uniqueness or to close a logical loop; the central transfer results are falsifiable external benchmarks rather than internal redefinitions. This is the normal case of an empirical computer-vision study whose validity hinges on data representativeness, not on any circular reduction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Review performed on abstract only; no explicit free parameters, invented entities, or additional axioms beyond the domain assumption of transferability are stated.

axioms (1)

domain assumption A satellite-pretrained Earth observation foundation model can be adapted via fine-tuning to centimeter-scale airborne RGB imagery for binary water segmentation.
This premise underpins all three experiments described in the abstract.

pith-pipeline@v0.9.1-grok · 5904 in / 1287 out tokens · 27335 ms · 2026-06-26T01:47:42.001734+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

43 extracted references

[1]

CRED Crunch Newsletter, Issue No. 74: Disaster Year In Review 2023,

C. for Research on the Epidemiology of Dis- asters, “CRED Crunch Newsletter, Issue No. 74: Disaster Year In Review 2023,” April

2023
[2]

Available: https://reliefweb.int/attachments/ 6e641ebe-8d01-4a26-82c4-7819fd968ed6/CredCrunch74.pdf

[Online]. Available: https://reliefweb.int/attachments/ 6e641ebe-8d01-4a26-82c4-7819fd968ed6/CredCrunch74.pdf
[3]

Report: Natural catastrophes in 2021: the floodgates are open,

L. Bevere, F. Remondi, N. Aellen, K. Dhore, G. Mussetti, and S. T. et al., “Report: Natural catastrophes in 2021: the floodgates are open,”International Insurance Society, 2022, https://www. segurostv.es/data/informes/Swiss_Re_sigma_report1_2022.pdf

2021
[4]

Natural Disasters,

H. Ritchie and M. Roser, “Natural Disasters,”Our World in Data, 2022, https://ourworldindata.org/natural-disasters Updated: 4 January 2024. [Online]. Available: https: //ourworldindata.org/natural-disasters

2022
[5]

Arias, N

P. Arias, N. Bellouin, E. Coppola, R. Jones, Krinner, Gerhard, and J. Marotzke et al., “2021: Technical Summary. In Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change,” IPPC, United Kingdom and New York, Tech. Rep., 2021, https://www.ipcc.ch/repor...

2021
[6]

Ai-based drone assisted human rescue in disaster environments: Challenges and opportunities,

N. Papyan, M. Kulhandjian, H. Kulhandjian, and L. Aslanyan, “Ai-based drone assisted human rescue in disaster environments: Challenges and opportunities,”Pattern Recognition and Image Analysis, vol. 34, no. 1, pp. 169–186, 2024

2024
[7]

Applications of drone in disaster management: A scoping review,

S. M. S. M. Daud, M. Y . P. M. Yusof, C. C. Heo, L. S. Khoo, M. K. C. Singh, M. S. Mahmood, and H. Nawawi, “Applications of drone in disaster management: A scoping review,”Science & Justice, vol. 62, no. 1, pp. 30–42, 2022

2022
[8]

Evaluation of sugarcane crop growth monitor- ing using vegetation indices derived from rgb-based uav images and machine learning models,

P. Ruwanpathirana, K. Sakai, G. Jayasinghe, T. Nakandakari, K. Yuge, W. Wijekoon, A. Priyankara, M. Samaraweera, and P. Madushanka, “Evaluation of sugarcane crop growth monitor- ing using vegetation indices derived from rgb-based uav images and machine learning models,”Agronomy, vol. 14, no. 9, p. 2059, 2024

2059
[9]

Evaluation of an unmanned aerial vehicle (uav) for measuring and monitoring natural disaster risk areas,

M. Reiss, T. Mendes, F. Pereira, M. de Andrade, R. Mendes, S. Simões, R. de Lara, and S. de Souza, “Evaluation of an unmanned aerial vehicle (uav) for measuring and monitoring natural disaster risk areas,”The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sci- ences, vol. 43, pp. 1077–1083, 2022

2022
[10]

Using uav data to improve the situational awareness for first responders in disaster management: The ex- ample of flooding in the ahr valley, germany,

M. Halbgewachs, L. Angermann, M. Wieland, U. Kippnich, and K. Lechner, “Using uav data to improve the situational awareness for first responders in disaster management: The ex- ample of flooding in the ahr valley, germany,” inIGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, 2023, pp. 934–937

2023
[11]

Openstreetmap: User-generated street maps,

M. Haklay and P. Weber, “Openstreetmap: User-generated street maps,”IEEE Pervasive computing, vol. 7, no. 4, pp. 12–18, 2008

2008
[12]

A river flooding detection system based on deep learning and computer vision,

F. E. Fernandes Jr, L. G. Nonato, and J. Ueyama, “A river flooding detection system based on deep learning and computer vision,”Multimedia Tools and Applications, vol. 81, no. 28, pp. 40 231–40 251, 2022

2022
[13]

Urbansarfloods: Sentinel- 1 slc-based benchmark dataset for urban and open-area flood mapping,

J. Zhao, Z. Xiong, and X. X. Zhu, “Urbansarfloods: Sentinel- 1 slc-based benchmark dataset for urban and open-area flood mapping,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, June 2024, pp. 419–429

2024
[14]

Domain knowledge- aware remote sensing foundation model for flood detection in multi-spectral imagery,

Y . Li, B. Dang, F. Wei, J. Tan, and Y . Lin, “Domain knowledge- aware remote sensing foundation model for flood detection in multi-spectral imagery,” inIGARSS 2024-2024 IEEE Interna- tional Geoscience and Remote Sensing Symposium. IEEE, 2024, pp. 785–789

2024
[15]

Overcoming the uncertainty challenges in flood rapid mapping with multi- source optical data,

J. Li, H. Huang, W. He, H. Zhang, and L. Zhang, “Overcoming the uncertainty challenges in flood rapid mapping with multi- source optical data,” inIGARSS 2024-2024 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 2024, pp. 780–784

2024
[16]

Fully convolutional network for automatic road extraction from satel- lite imagery,

A. Buslaev, S. Seferbekov, V . Iglovikov, and A. Shvets, “Fully convolutional network for automatic road extraction from satel- lite imagery,” inProceedings of the IEEE conference on com- puter vision and pattern recognition workshops, 2018, pp. 207– 210

2018
[17]

Segment anything model for road network graph extraction,

C. Hetang, H. Xue, C. Le, T. Yue, W. Wang, and Y . He, “Segment anything model for road network graph extraction,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 2556–2566

2024
[18]

Stripunet: A method for dense road extraction from remote sensing images,

X. Ma, X. Zhang, D. Zhou, and Z. Chen, “Stripunet: A method for dense road extraction from remote sensing images,”IEEE Transactions on Intelligent Vehicles, 2024

2024
[19]

Encoder-decoder with atrous separable convolution for se- mantic image segmentation,

L.-C. Chen, Y . Zhu, G. Papandreou, F. Schroff, and H. Adam, “Encoder-decoder with atrous separable convolution for se- mantic image segmentation,” inProceedings of the European conference on computer vision (ECCV), 2018, pp. 801–818

2018
[20]

Unet++: A nested u-net architecture for medical image seg- mentation,

Z. Zhou, M. M. Rahman Siddiquee, N. Tajbakhsh, and J. Liang, “Unet++: A nested u-net architecture for medical image seg- mentation,” inDeep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th Inter- national Workshop, and 8th International Workshop, Held in Conjunction with MICCAI 2018, Proceedings 4. Springer, 20...

2018
[21]

SegFormer: Simple and efficient design for semantic segmentation with transformers,

E. Xie, W. Wang, Z. Yu, A. Anandkumar, J. M. Alvarez, and P. Luo, “SegFormer: Simple and efficient design for semantic segmentation with transformers,”Advances in Neural Informa- tion Processing Systems, vol. 34, pp. 12 077–12 090, 2021

2021
[22]

Uavs in disaster management: Application of integrated aerial imagery and convolutional neural network for flood detection,

H. S. Munawar, F. Ullah, S. Qayyum, S. I. Khan, and M. Mojta- hedi, “Uavs in disaster management: Application of integrated aerial imagery and convolutional neural network for flood detection,”Sustainability, vol. 13, no. 14, p. 7547, 2021

2021
[23]

Enhanced urban flood monitoring: Integrating advanced semantic segmentation and human facial feature and posture analysis,

B. B. Nair, P. Vallimeena, U. Gopalakrishnan, S. N. Rao, and S. Krishnamoorthy, “Enhanced urban flood monitoring: Integrating advanced semantic segmentation and human facial feature and posture analysis,”IEEE Access, 2024

2024
[24]

Spacenet 8-the de- tection of flooded roads and buildings,

R. Hänsch, J. Arndt, D. Lunga, M. Gibb, T. Pedelose, A. Boedi- hardjo, D. Petrie, and T. M. Bacastow, “Spacenet 8-the de- tection of flooded roads and buildings,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 1472–1480

2022
[25]

FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding,

M. Rahnemoonfar, T. Chowdhury, A. Sarkar, D. Varshney, M. Yari, and R. R. Murphy, “FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding,” IEEE Access, vol. 9, pp. 89 644–89 654, 2021

2021
[26]

Post flood damage assessment using aerial imagery,

T. Dixit and P. K. Singh, “Post flood damage assessment using aerial imagery,” in2023 IEEE International Conference on Computer Vision and Machine Intelligence (CVMI). IEEE, 2023, pp. 1–6

2023
[27]

Flood vulnerability assessment of urban buildings based on integrating high-resolution remote sensing and street view images,

Z. Xing, S. Yang, X. Zan, X. Dong, Y . Yao, Z. Liu, and X. Zhang, “Flood vulnerability assessment of urban buildings based on integrating high-resolution remote sensing and street view images,”Sustainable Cities and Society, vol. 92, p. 104467, 2023

2023
[28]

Road damage detection from post-disaster high-resolution remote sensing im- ages based on tld framework,

K. Zhao, J. Liu, Q. Wang, X. Wu, and J. Tu, “Road damage detection from post-disaster high-resolution remote sensing im- ages based on tld framework,”IEEE Access, vol. 10, pp. 43 552– 43 561, 2022

2022
[29]

Blessem- flood21: A high-resolution georeferenced dataset for advanced analysis of river flood scenarios,

V . Polushko, A. Jenal, J. Bongartz, I. Weber, D. Hatic, R. Roesch, T. März, M. Rauhut, and A. Weinmann, “Blessem- flood21: A high-resolution georeferenced dataset for advanced analysis of river flood scenarios,”IEEE Access, 2024

2024
[30]

Neuenahrflood dataset and an improved human-in-the-loop strategy for efficient flood water segmen- tation,

V . Polushko, T. Bucher, D. Hatic, R. Rösch, T. März, M. Rauhut, and A. Weinmann, “Neuenahrflood dataset and an improved human-in-the-loop strategy for efficient flood water segmen- tation,” inEarth Resources and Environmental Remote Sens- ing/GIS Applications XVI, vol. 13671. SPIE, 2025, pp. 405– 414

2025
[31]

Flooded road detection using deep learning and street map semantics for humanitarian aid support,

V . Polushko, D. Hatic, R. Rösch, T. März, M. Rauhut, and A. Weinmann, “Flooded road detection using deep learning and street map semantics for humanitarian aid support,” 08 2025, pp. 2923–2928

2025
[32]

Foundation models for generalist geospatial artificial intelligence,

J. Jakubik, S. Roy, C. Phillips, P. Fraccaro, D. Godwin, B. Zadrozny, D. Szwarcman, C. Gomes, G. Nyirjesy, B. Ed- wardset al., “Foundation models for generalist geospatial artificial intelligence,”arXiv preprint arXiv:2310.18660, 2023

arXiv 2023
[33]

Prithvi-eo-2.0: A versatile multi- temporal foundation model for earth observation applications,

D. Szwarcman, S. Roy, P. Fraccaro, Þ. E. Gíslason, B. Blu- menstiel, R. Ghosal, P. H. de Oliveira, J. L. d. S. Almeida, R. Sedona, Y . Kanget al., “Prithvi-eo-2.0: A versatile multi- temporal foundation model for earth observation applications,” arXiv preprint arXiv:2412.02732, 2024

arXiv 2024
[34]

Satmae: Pre-training transformers for temporal and multi-spectral satellite imagery,

Y . Cong, S. Khanna, C. Meng, P. Liu, E. Rozi, Y . He, M. Burke, D. Lobell, and S. Ermon, “Satmae: Pre-training transformers for temporal and multi-spectral satellite imagery,”Advances in Neural Information Processing Systems, vol. 35, pp. 197–211, 2022

2022
[35]

Geo-bench: Toward foundation models for earth monitoring,

A. Lacoste, N. Lehmann, P. Rodriguez, E. Sherwin, H. Kerner, B. Lütjens, J. Irvin, D. Dao, H. Alemohammad, A. Drouinet al., “Geo-bench: Toward foundation models for earth monitoring,” Advances in Neural Information Processing Systems, vol. 36, pp. 51 080–51 093, 2023

2023
[36]

Evaluating and benchmarking foundation models for earth observation and geospatial ai,

N. Dionelis, C. Fibaek, L. Camilleri, A. Luyts, J. Bosmans, and B. L. Saux, “Evaluating and benchmarking foundation models for earth observation and geospatial ai,”arXiv preprint arXiv:2406.18295, 2024

arXiv 2024
[37]

When geoscience meets foundation models: Toward a general geoscience artificial intelligence system,

H. Zhang, J.-J. Xu, H.-W. Cui, L. Li, Y . Yang, C.-S. Tang, and N. Boers, “When geoscience meets foundation models: Toward a general geoscience artificial intelligence system,” IEEE geoscience and remote sensing magazine, 2024

2024
[38]

As- sessment of a new geoai foundation model for flood inundation mapping,

W. Li, H. Lee, S. Wang, C.-Y . Hsu, and S. T. Arundel, “As- sessment of a new geoai foundation model for flood inundation mapping,” inProceedings of the 6th ACM SIGSPATIAL Inter- national workshop on AI for geographic knowledge discovery, 2023, pp. 102–109

2023
[39]

Earth observation foundation models for region-specific flood segmentation,

H. Tamura-Wicks, G. Dawson, A. Taylor, C. Dearden, A. Jones, and P. Fraccaro, “Earth observation foundation models for region-specific flood segmentation,” inInternational Conference on Learning Representations, 2025

2025
[40]

U- prithvi: Integrating a foundation model and u-net for enhanced flood inundation mapping,

V . Kostejn, Y . Essus, J. Abrahamson, and R. R. Vatsavai, “U- prithvi: Integrating a foundation model and u-net for enhanced flood inundation mapping,” in13th International Conference on Geographic Information Science (GIScience 2025). Schloss Dagstuhl–Leibniz-Zentrum für Informatik, 2025, pp. 18–1

2025
[41]

Rapid adapta- tion of earth observation foundation models for segmentation,

K. P. Selvam, R. Ramos-Pollan, and F. Kalaitzis, “Rapid adapta- tion of earth observation foundation models for segmentation,” arXiv preprint arXiv:2409.09907, 2024

arXiv 2024
[42]

Upernet with convnext for semantic segmentation,

R. Wang, H. Jiang, and Y . Li, “Upernet with convnext for semantic segmentation,” in2023 IEEE 3rd International Confer- ence on Electronic Technology, Communication and Information (ICETCI). IEEE, 2023, pp. 764–769

2023
[43]

Blessemflood21: Advancing flood analysis with a high-resolution georeferenced dataset for humanitarian aid support,

V . Polushko, A. Jenal, J. Bongartz, I. Weber, D. Hatic, R. Rösch, T. März, M. Rauhut, and A. Weinmann, “Blessemflood21: Advancing flood analysis with a high-resolution georeferenced dataset for humanitarian aid support,” inIEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2024

2024

[1] [1]

CRED Crunch Newsletter, Issue No. 74: Disaster Year In Review 2023,

C. for Research on the Epidemiology of Dis- asters, “CRED Crunch Newsletter, Issue No. 74: Disaster Year In Review 2023,” April

2023

[2] [2]

Available: https://reliefweb.int/attachments/ 6e641ebe-8d01-4a26-82c4-7819fd968ed6/CredCrunch74.pdf

[Online]. Available: https://reliefweb.int/attachments/ 6e641ebe-8d01-4a26-82c4-7819fd968ed6/CredCrunch74.pdf

[3] [3]

Report: Natural catastrophes in 2021: the floodgates are open,

L. Bevere, F. Remondi, N. Aellen, K. Dhore, G. Mussetti, and S. T. et al., “Report: Natural catastrophes in 2021: the floodgates are open,”International Insurance Society, 2022, https://www. segurostv.es/data/informes/Swiss_Re_sigma_report1_2022.pdf

2021

[4] [4]

Natural Disasters,

H. Ritchie and M. Roser, “Natural Disasters,”Our World in Data, 2022, https://ourworldindata.org/natural-disasters Updated: 4 January 2024. [Online]. Available: https: //ourworldindata.org/natural-disasters

2022

[5] [5]

Arias, N

P. Arias, N. Bellouin, E. Coppola, R. Jones, Krinner, Gerhard, and J. Marotzke et al., “2021: Technical Summary. In Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change,” IPPC, United Kingdom and New York, Tech. Rep., 2021, https://www.ipcc.ch/repor...

2021

[6] [6]

Ai-based drone assisted human rescue in disaster environments: Challenges and opportunities,

N. Papyan, M. Kulhandjian, H. Kulhandjian, and L. Aslanyan, “Ai-based drone assisted human rescue in disaster environments: Challenges and opportunities,”Pattern Recognition and Image Analysis, vol. 34, no. 1, pp. 169–186, 2024

2024

[7] [7]

Applications of drone in disaster management: A scoping review,

S. M. S. M. Daud, M. Y . P. M. Yusof, C. C. Heo, L. S. Khoo, M. K. C. Singh, M. S. Mahmood, and H. Nawawi, “Applications of drone in disaster management: A scoping review,”Science & Justice, vol. 62, no. 1, pp. 30–42, 2022

2022

[8] [8]

Evaluation of sugarcane crop growth monitor- ing using vegetation indices derived from rgb-based uav images and machine learning models,

P. Ruwanpathirana, K. Sakai, G. Jayasinghe, T. Nakandakari, K. Yuge, W. Wijekoon, A. Priyankara, M. Samaraweera, and P. Madushanka, “Evaluation of sugarcane crop growth monitor- ing using vegetation indices derived from rgb-based uav images and machine learning models,”Agronomy, vol. 14, no. 9, p. 2059, 2024

2059

[9] [9]

Evaluation of an unmanned aerial vehicle (uav) for measuring and monitoring natural disaster risk areas,

M. Reiss, T. Mendes, F. Pereira, M. de Andrade, R. Mendes, S. Simões, R. de Lara, and S. de Souza, “Evaluation of an unmanned aerial vehicle (uav) for measuring and monitoring natural disaster risk areas,”The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sci- ences, vol. 43, pp. 1077–1083, 2022

2022

[10] [10]

Using uav data to improve the situational awareness for first responders in disaster management: The ex- ample of flooding in the ahr valley, germany,

M. Halbgewachs, L. Angermann, M. Wieland, U. Kippnich, and K. Lechner, “Using uav data to improve the situational awareness for first responders in disaster management: The ex- ample of flooding in the ahr valley, germany,” inIGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, 2023, pp. 934–937

2023

[11] [11]

Openstreetmap: User-generated street maps,

M. Haklay and P. Weber, “Openstreetmap: User-generated street maps,”IEEE Pervasive computing, vol. 7, no. 4, pp. 12–18, 2008

2008

[12] [12]

A river flooding detection system based on deep learning and computer vision,

F. E. Fernandes Jr, L. G. Nonato, and J. Ueyama, “A river flooding detection system based on deep learning and computer vision,”Multimedia Tools and Applications, vol. 81, no. 28, pp. 40 231–40 251, 2022

2022

[13] [13]

Urbansarfloods: Sentinel- 1 slc-based benchmark dataset for urban and open-area flood mapping,

J. Zhao, Z. Xiong, and X. X. Zhu, “Urbansarfloods: Sentinel- 1 slc-based benchmark dataset for urban and open-area flood mapping,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, June 2024, pp. 419–429

2024

[14] [14]

Domain knowledge- aware remote sensing foundation model for flood detection in multi-spectral imagery,

Y . Li, B. Dang, F. Wei, J. Tan, and Y . Lin, “Domain knowledge- aware remote sensing foundation model for flood detection in multi-spectral imagery,” inIGARSS 2024-2024 IEEE Interna- tional Geoscience and Remote Sensing Symposium. IEEE, 2024, pp. 785–789

2024

[15] [15]

Overcoming the uncertainty challenges in flood rapid mapping with multi- source optical data,

J. Li, H. Huang, W. He, H. Zhang, and L. Zhang, “Overcoming the uncertainty challenges in flood rapid mapping with multi- source optical data,” inIGARSS 2024-2024 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 2024, pp. 780–784

2024

[16] [16]

Fully convolutional network for automatic road extraction from satel- lite imagery,

A. Buslaev, S. Seferbekov, V . Iglovikov, and A. Shvets, “Fully convolutional network for automatic road extraction from satel- lite imagery,” inProceedings of the IEEE conference on com- puter vision and pattern recognition workshops, 2018, pp. 207– 210

2018

[17] [17]

Segment anything model for road network graph extraction,

C. Hetang, H. Xue, C. Le, T. Yue, W. Wang, and Y . He, “Segment anything model for road network graph extraction,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, pp. 2556–2566

2024

[18] [18]

Stripunet: A method for dense road extraction from remote sensing images,

X. Ma, X. Zhang, D. Zhou, and Z. Chen, “Stripunet: A method for dense road extraction from remote sensing images,”IEEE Transactions on Intelligent Vehicles, 2024

2024

[19] [19]

Encoder-decoder with atrous separable convolution for se- mantic image segmentation,

L.-C. Chen, Y . Zhu, G. Papandreou, F. Schroff, and H. Adam, “Encoder-decoder with atrous separable convolution for se- mantic image segmentation,” inProceedings of the European conference on computer vision (ECCV), 2018, pp. 801–818

2018

[20] [20]

Unet++: A nested u-net architecture for medical image seg- mentation,

Z. Zhou, M. M. Rahman Siddiquee, N. Tajbakhsh, and J. Liang, “Unet++: A nested u-net architecture for medical image seg- mentation,” inDeep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th Inter- national Workshop, and 8th International Workshop, Held in Conjunction with MICCAI 2018, Proceedings 4. Springer, 20...

2018

[21] [21]

SegFormer: Simple and efficient design for semantic segmentation with transformers,

E. Xie, W. Wang, Z. Yu, A. Anandkumar, J. M. Alvarez, and P. Luo, “SegFormer: Simple and efficient design for semantic segmentation with transformers,”Advances in Neural Informa- tion Processing Systems, vol. 34, pp. 12 077–12 090, 2021

2021

[22] [22]

Uavs in disaster management: Application of integrated aerial imagery and convolutional neural network for flood detection,

H. S. Munawar, F. Ullah, S. Qayyum, S. I. Khan, and M. Mojta- hedi, “Uavs in disaster management: Application of integrated aerial imagery and convolutional neural network for flood detection,”Sustainability, vol. 13, no. 14, p. 7547, 2021

2021

[23] [23]

Enhanced urban flood monitoring: Integrating advanced semantic segmentation and human facial feature and posture analysis,

B. B. Nair, P. Vallimeena, U. Gopalakrishnan, S. N. Rao, and S. Krishnamoorthy, “Enhanced urban flood monitoring: Integrating advanced semantic segmentation and human facial feature and posture analysis,”IEEE Access, 2024

2024

[24] [24]

Spacenet 8-the de- tection of flooded roads and buildings,

R. Hänsch, J. Arndt, D. Lunga, M. Gibb, T. Pedelose, A. Boedi- hardjo, D. Petrie, and T. M. Bacastow, “Spacenet 8-the de- tection of flooded roads and buildings,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 1472–1480

2022

[25] [25]

FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding,

M. Rahnemoonfar, T. Chowdhury, A. Sarkar, D. Varshney, M. Yari, and R. R. Murphy, “FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding,” IEEE Access, vol. 9, pp. 89 644–89 654, 2021

2021

[26] [26]

Post flood damage assessment using aerial imagery,

T. Dixit and P. K. Singh, “Post flood damage assessment using aerial imagery,” in2023 IEEE International Conference on Computer Vision and Machine Intelligence (CVMI). IEEE, 2023, pp. 1–6

2023

[27] [27]

Flood vulnerability assessment of urban buildings based on integrating high-resolution remote sensing and street view images,

Z. Xing, S. Yang, X. Zan, X. Dong, Y . Yao, Z. Liu, and X. Zhang, “Flood vulnerability assessment of urban buildings based on integrating high-resolution remote sensing and street view images,”Sustainable Cities and Society, vol. 92, p. 104467, 2023

2023

[28] [28]

Road damage detection from post-disaster high-resolution remote sensing im- ages based on tld framework,

K. Zhao, J. Liu, Q. Wang, X. Wu, and J. Tu, “Road damage detection from post-disaster high-resolution remote sensing im- ages based on tld framework,”IEEE Access, vol. 10, pp. 43 552– 43 561, 2022

2022

[29] [29]

Blessem- flood21: A high-resolution georeferenced dataset for advanced analysis of river flood scenarios,

V . Polushko, A. Jenal, J. Bongartz, I. Weber, D. Hatic, R. Roesch, T. März, M. Rauhut, and A. Weinmann, “Blessem- flood21: A high-resolution georeferenced dataset for advanced analysis of river flood scenarios,”IEEE Access, 2024

2024

[30] [30]

Neuenahrflood dataset and an improved human-in-the-loop strategy for efficient flood water segmen- tation,

V . Polushko, T. Bucher, D. Hatic, R. Rösch, T. März, M. Rauhut, and A. Weinmann, “Neuenahrflood dataset and an improved human-in-the-loop strategy for efficient flood water segmen- tation,” inEarth Resources and Environmental Remote Sens- ing/GIS Applications XVI, vol. 13671. SPIE, 2025, pp. 405– 414

2025

[31] [31]

Flooded road detection using deep learning and street map semantics for humanitarian aid support,

V . Polushko, D. Hatic, R. Rösch, T. März, M. Rauhut, and A. Weinmann, “Flooded road detection using deep learning and street map semantics for humanitarian aid support,” 08 2025, pp. 2923–2928

2025

[32] [32]

Foundation models for generalist geospatial artificial intelligence,

J. Jakubik, S. Roy, C. Phillips, P. Fraccaro, D. Godwin, B. Zadrozny, D. Szwarcman, C. Gomes, G. Nyirjesy, B. Ed- wardset al., “Foundation models for generalist geospatial artificial intelligence,”arXiv preprint arXiv:2310.18660, 2023

arXiv 2023

[33] [33]

Prithvi-eo-2.0: A versatile multi- temporal foundation model for earth observation applications,

D. Szwarcman, S. Roy, P. Fraccaro, Þ. E. Gíslason, B. Blu- menstiel, R. Ghosal, P. H. de Oliveira, J. L. d. S. Almeida, R. Sedona, Y . Kanget al., “Prithvi-eo-2.0: A versatile multi- temporal foundation model for earth observation applications,” arXiv preprint arXiv:2412.02732, 2024

arXiv 2024

[34] [34]

Satmae: Pre-training transformers for temporal and multi-spectral satellite imagery,

Y . Cong, S. Khanna, C. Meng, P. Liu, E. Rozi, Y . He, M. Burke, D. Lobell, and S. Ermon, “Satmae: Pre-training transformers for temporal and multi-spectral satellite imagery,”Advances in Neural Information Processing Systems, vol. 35, pp. 197–211, 2022

2022

[35] [35]

Geo-bench: Toward foundation models for earth monitoring,

A. Lacoste, N. Lehmann, P. Rodriguez, E. Sherwin, H. Kerner, B. Lütjens, J. Irvin, D. Dao, H. Alemohammad, A. Drouinet al., “Geo-bench: Toward foundation models for earth monitoring,” Advances in Neural Information Processing Systems, vol. 36, pp. 51 080–51 093, 2023

2023

[36] [36]

Evaluating and benchmarking foundation models for earth observation and geospatial ai,

N. Dionelis, C. Fibaek, L. Camilleri, A. Luyts, J. Bosmans, and B. L. Saux, “Evaluating and benchmarking foundation models for earth observation and geospatial ai,”arXiv preprint arXiv:2406.18295, 2024

arXiv 2024

[37] [37]

When geoscience meets foundation models: Toward a general geoscience artificial intelligence system,

H. Zhang, J.-J. Xu, H.-W. Cui, L. Li, Y . Yang, C.-S. Tang, and N. Boers, “When geoscience meets foundation models: Toward a general geoscience artificial intelligence system,” IEEE geoscience and remote sensing magazine, 2024

2024

[38] [38]

As- sessment of a new geoai foundation model for flood inundation mapping,

W. Li, H. Lee, S. Wang, C.-Y . Hsu, and S. T. Arundel, “As- sessment of a new geoai foundation model for flood inundation mapping,” inProceedings of the 6th ACM SIGSPATIAL Inter- national workshop on AI for geographic knowledge discovery, 2023, pp. 102–109

2023

[39] [39]

Earth observation foundation models for region-specific flood segmentation,

H. Tamura-Wicks, G. Dawson, A. Taylor, C. Dearden, A. Jones, and P. Fraccaro, “Earth observation foundation models for region-specific flood segmentation,” inInternational Conference on Learning Representations, 2025

2025

[40] [40]

U- prithvi: Integrating a foundation model and u-net for enhanced flood inundation mapping,

V . Kostejn, Y . Essus, J. Abrahamson, and R. R. Vatsavai, “U- prithvi: Integrating a foundation model and u-net for enhanced flood inundation mapping,” in13th International Conference on Geographic Information Science (GIScience 2025). Schloss Dagstuhl–Leibniz-Zentrum für Informatik, 2025, pp. 18–1

2025

[41] [41]

Rapid adapta- tion of earth observation foundation models for segmentation,

K. P. Selvam, R. Ramos-Pollan, and F. Kalaitzis, “Rapid adapta- tion of earth observation foundation models for segmentation,” arXiv preprint arXiv:2409.09907, 2024

arXiv 2024

[42] [42]

Upernet with convnext for semantic segmentation,

R. Wang, H. Jiang, and Y . Li, “Upernet with convnext for semantic segmentation,” in2023 IEEE 3rd International Confer- ence on Electronic Technology, Communication and Information (ICETCI). IEEE, 2023, pp. 764–769

2023

[43] [43]

Blessemflood21: Advancing flood analysis with a high-resolution georeferenced dataset for humanitarian aid support,

V . Polushko, A. Jenal, J. Bongartz, I. Weber, D. Hatic, R. Rösch, T. März, M. Rauhut, and A. Weinmann, “Blessemflood21: Advancing flood analysis with a high-resolution georeferenced dataset for humanitarian aid support,” inIEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2024

2024