ChangeFlow -- Latent Rectified Flow for Change Detection in Remote Sensing
Pith reviewed 2026-05-19 15:47 UTC · model grok-4.3
The pith
Remote sensing change detection improves by generating distributions of plausible masks in latent space with a rectified flow model.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
ChangeFlow reformulates remote sensing change detection as the synthesis of a change mask in latent space via rectified flow. It is guided by a structured yet lightweight conditioning signal drawn from the input image pair. The stochastic design supports sampling multiple masks, whose aggregation improves robustness while their agreement supplies a practical estimate of confidence that highlights ambiguous regions. Across four benchmarks the method reaches an average F1 of 80.4 percent, 1.3 points above the previous best, with inference speed comparable to recent strong baselines.
What carries the argument
Latent-space rectified flow for synthesizing entire change masks, conditioned by a lightweight structured signal from the input pair
If this is right
- Aggregating several sampled masks yields more robust final predictions than any single output.
- Agreement across samples provides a built-in confidence map that flags regions where annotations tend to vary.
- Global consistency of changed regions emerges automatically from the generative formulation rather than from post-processing.
- Inference cost remains comparable to strong discriminative baselines despite the generative nature of the approach.
Where Pith is reading between the lines
- The same latent-flow setup could be tested on other ambiguous segmentation tasks where labels reflect region-level conventions rather than pure local differences.
- Sampling multiple masks might supply uncertainty estimates useful for active learning or human-in-the-loop review in operational remote-sensing pipelines.
- The approach suggests exploring whether other flow-based or diffusion models in latent space can replace per-pixel classifiers in settings that prize coherent region outputs over raw speed.
Load-bearing premise
A structured yet lightweight conditioning signal in latent space is sufficient for the rectified-flow model to capture both global consistency of changed regions and the distribution of plausible masks that reflect annotation ambiguity.
What would settle it
On the same four benchmarks, single-sample predictions from the model fail to improve when aggregated or when sample agreement fails to correlate with human-labeled ambiguous areas.
Figures
read the original abstract
Remote sensing change detection (RSCD) aims to localise changes between two images of the same geographic region. In practice, change masks often follow region-level annotation conventions rather than purely local appearance differences, making them context-dependent and occasionally ambiguous. Most state-of-the-art methods utilise per-pixel discriminative classification, which produces a single prediction per input and fails to explicitly model the changed region as a coherent whole. A natural alternative is generative formulation, which can model a distribution of plausible masks, enabling sampling to capture ambiguity and encourage global consistency. However, existing generative RSCD approaches typically lag behind strong discriminative baselines due to the high computational cost of pixel-space generation and the complexity of their conditioning mechanisms. To address the limitations of prior discriminative and generative methods, we propose ChangeFlow, a generative framework that reformulates change detection as the synthesis of a change mask in latent space via rectified flow. ChangeFlow is guided by a structured yet lightweight conditioning signal, and its stochastic design naturally supports sampling-based prediction ensembling. Namely, aggregating multiple predicted change masks improves robustness, while sample agreement provides a practical confidence estimation that highlights ambiguous regions. Across four benchmarks, ChangeFlow achieves an average F1 of 80.4\%, improving by 1.3 points on average over the previous best method, while maintaining inference speed comparable to recent strong baselines. Project page: https://blaz-r.github.io/changeflow_cd
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces ChangeFlow, a generative framework for remote sensing change detection that reformulates the task as synthesizing change masks in latent space via rectified flow. It is guided by a structured yet lightweight conditioning signal derived from the input image pair and leverages stochastic sampling for prediction ensembling and confidence estimation. Across four benchmarks, the method reports an average F1 of 80.4%, a 1.3-point improvement over the previous best method, while maintaining inference speed comparable to recent strong baselines.
Significance. If the central performance claim holds, the work demonstrates that a latent-space rectified-flow formulation can deliver modest but consistent gains over strong discriminative baselines in RSCD by modeling distributions of plausible masks rather than single per-pixel predictions. The approach mitigates the computational cost of prior generative RSCD methods and provides practical benefits through sampling-based ensembling and ambiguity highlighting. These elements, if substantiated with reproducible details, represent a useful contribution to structured prediction tasks in remote sensing.
major comments (2)
- [§3] The description of the conditioning mechanism (abstract and §3) leaves open whether it supplies explicit spatial or semantic structure sufficient to enforce region-level coherence in the flow ODE. If implemented as a simple global embedding or low-resolution broadcast, the rectified-flow machinery risks being incidental, reducing the model to a more expensive latent autoencoder that would not be expected to outperform per-pixel baselines by a meaningful margin.
- [§4] The experiments section reports a 1.3-point average F1 lift but provides no details on training procedure, exact conditioning implementation, statistical significance of the gains, or variance across runs. These omissions are load-bearing for verifying the central empirical claim, as the abstract's performance numbers cannot be assessed beyond the stated figures without this information.
minor comments (2)
- [§3.1] Notation for the latent conditioning signal and flow ODE could be introduced more explicitly with a dedicated equation to improve clarity for readers unfamiliar with rectified flows.
- The project page link in the abstract should be confirmed to contain the promised code and models for reproducibility.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed review. We address each major comment below and describe the revisions that will be incorporated into the next version of the manuscript.
read point-by-point responses
-
Referee: [§3] The description of the conditioning mechanism (abstract and §3) leaves open whether it supplies explicit spatial or semantic structure sufficient to enforce region-level coherence in the flow ODE. If implemented as a simple global embedding or low-resolution broadcast, the rectified-flow machinery risks being incidental, reducing the model to a more expensive latent autoencoder that would not be expected to outperform per-pixel baselines by a meaningful margin.
Authors: We appreciate the referee raising this point. The current description in §3 is indeed concise and can be clarified. In the revised manuscript we will expand §3 with additional equations, a detailed architecture diagram, and explicit description of how the conditioning signal injects spatially aligned features from the bi-temporal pair into the latent flow. This will demonstrate that the conditioning supplies the region-level structure required for coherent mask synthesis and that the rectified-flow formulation is not incidental to the performance gains. revision: yes
-
Referee: [§4] The experiments section reports a 1.3-point average F1 lift but provides no details on training procedure, exact conditioning implementation, statistical significance of the gains, or variance across runs. These omissions are load-bearing for verifying the central empirical claim, as the abstract's performance numbers cannot be assessed beyond the stated figures without this information.
Authors: We agree that these omissions hinder reproducibility and verification. In the revised manuscript we will augment §4 with a complete account of the training procedure (optimizer, schedule, data augmentations), the precise implementation of the conditioning signal, and new experimental results that report standard deviation across multiple random seeds together with statistical significance tests for the reported F1 improvements. These additions will be presented in an expanded experimental protocol subsection and an accompanying table. revision: yes
Circularity Check
No significant circularity; empirical claims rest on independent benchmark evaluation
full rationale
The paper presents ChangeFlow as a novel generative reformulation of remote sensing change detection using latent-space rectified flow with structured conditioning. Performance is reported via direct empirical evaluation on four standard benchmarks, yielding an average F1 of 80.4% with a 1.3-point lift over prior best methods. No equations, derivations, or self-citations are shown that reduce this result to a fitted parameter, renamed input, or load-bearing self-reference by construction. The method introduces new architectural choices (latent flow, stochastic ensembling) whose contribution is measured externally against baselines rather than being tautological with the training procedure itself. The derivation chain is therefore self-contained against external data.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
reformulates change detection as the synthesis of a change mask in latent space via rectified flow... guided by a structured yet lightweight conditioning signal
-
IndisputableMonolith/Foundation/AlphaCoordinateFixation.leancostAlphaLog_fourth_deriv_at_zero unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
rectified flow... straight-line trajectory... velocity field
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Ba, J.L., Kiros, J.R., Hinton, G.E.: Layer Normalization. arXiv preprint arXiv:1607.06450 (2016) 7, 12, 4, 13
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[2]
In: Proceedings of the IEEE/CVF International Conference on Computer Vision
Bagchi, A., Bao, Z., Wang, Y.X., Tokmakov, P., Hebert, M.: ReferEverything: Towards Segmenting Everything We Can Speak of in Videos. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 23221–23231 (2025) 14
work page 2025
-
[3]
In: IEEE/CVF Winter Conference on Applications of Computer Vision
Bandara, W.G.C., Nair, N.G., Patel, V.: DDPM-CD: Denoising Diffusion Proba- bilistic Models as Feature Extractors for Remote Sensing Change Detection. In: IEEE/CVF Winter Conference on Applications of Computer Vision. pp. 5250–5262 (2025) 2, 3, 4, 9, 11
work page 2025
-
[4]
In: IEEE International Geoscience and Remote Sensing Symposium
Bandara, W.G.C., Patel, V.M.: A Transformer-Based Siamese Network for Change Detection. In: IEEE International Geoscience and Remote Sensing Symposium. pp. 207–210. IEEE (2022) 3, 8, 9, 2, 11, 16
work page 2022
-
[5]
arXiv preprint arXiv:2512.05140 (2025) 4
Bellier,G.L.,Audebert,N.:FlowEO:GenerativeUnsupervisedDomainAdaptation for Earth Observation. arXiv preprint arXiv:2512.05140 (2025) 4
-
[6]
In: Pro- ceedings of the Computer Vision and Pattern Recognition Conference
Benidir, Y., Gonthier, N., Mallet, C.: The Change You Want To Detect: Semantic Change Detection In Earth Observation With Hybrid Data Generation. In: Pro- ceedings of the Computer Vision and Pattern Recognition Conference. pp. 2204– 2214 (2025) 3, 4, 9, 2, 11, 16
work page 2025
-
[7]
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Cai, H., Cao, S., Du, R., Gao, P., Hoi, S., Huang, S., Hou, Z., Jiang, D., Jin, X., Li, L., et al.: Z-Image: An Efficient Image Generation Foundation Model With Single-Stream Diffusion Transformer. arXiv preprint arXiv:2511.22699 (2025) 13
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[8]
IEEE Transactions on Geoscience and Remote Sensing60, 1–14 (2021) 1, 2, 8
Chen, H., Qi, Z., Shi, Z.: Remote Sensing Image Change Detection With Trans- formers. IEEE Transactions on Geoscience and Remote Sensing60, 1–14 (2021) 1, 2, 8
work page 2021
-
[9]
Remote Sensing12, 1662 (2020) 3, 8, 1 16 B
Chen, H., Shi, Z.: A Spatial-Temporal Attention-Based Method and A New Dataset for Remote Sensing Image Change Detection. Remote Sensing12, 1662 (2020) 3, 8, 1 16 B. Rolih et al
work page 2020
-
[10]
IEEE Transactions on Geoscience and Remote Sensing62, 1–20 (2024) 3, 9, 2, 11, 16
Chen, H., Song, J., Han, C., Xia, J., Yokoya, N.: ChangeMamba: Remote Sensing Change Detection With Spatiotemporal State Space Model. IEEE Transactions on Geoscience and Remote Sensing62, 1–20 (2024) 3, 9, 2, 11, 16
work page 2024
-
[11]
In: Pro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recogni- tion
Chen, J., Lu, J., Zhu, X., Zhang, L.: Generative Semantic Segmentation. In: Pro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recogni- tion. pp. 7111–7120 (2023) 4
work page 2023
-
[12]
In: Proceedings of the IEEE/CVF International Conference on Computer Vision
Chen, S., Sun, P., Song, Y., Luo, P.: DiffusionDet: Diffusion Model for Object De- tection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 19830–19843 (2023) 4
work page 2023
-
[13]
arXiv preprint arXiv:2511.16322 (2025) 2, 3, 9, 11, 17
Cheng, C.H., Hsu, C.C.: ChangeDino: DINOv3-Driven Building Change Detection in Optical Remote Sensing Imagery. arXiv preprint arXiv:2511.16322 (2025) 2, 3, 9, 11, 17
-
[14]
In: IEEE International Conference on Image Processing
Daudt, R.C., Le Saux, B., Boulch, A.: Fully Convolutional Siamese Networks for Change Detection. In: IEEE International Conference on Image Processing. pp. 4063–4067. IEEE (2018) 1, 3, 8, 9, 2, 11, 16
work page 2018
-
[15]
In: IEEE International Geoscience and Remote Sensing Symposium
Daudt, R.C., Le Saux, B., Boulch, A., Gousseau, Y.: Urban Change Detection for Multispectral Earth Observation Using Convolutional Neural Networks. In: IEEE International Geoscience and Remote Sensing Symposium. pp. 2115–2118. IEEE (2018) 1, 8
work page 2018
-
[16]
IEEE Transactions on Geoscience and Remote Sensing62, 1–14 (2024) 3
Ding, L., Zhang, J., Guo, H., Zhang, K., Liu, B., Bruzzone, L.: Joint Spatio- Temporal Modeling for Semantic Change Detection in Remote Sensing Images. IEEE Transactions on Geoscience and Remote Sensing62, 1–14 (2024) 3
work page 2024
-
[17]
In: Forty-first International Conference on Ma- chine Learning (2024) 6, 13, 3
Esser, P., Kulal, S., Blattmann, A., Entezari, R., Müller, J., Saini, H., Levi, Y., Lorenz, D., Sauer, A., Boesel, F., et al.: Scaling Rectified Flow Transformers for High-Resolution Image Synthesis. In: Forty-first International Conference on Ma- chine Learning (2024) 6, 13, 3
work page 2024
-
[18]
In: European Conference on Computer Vision
Fučka, M., Zavrtanik, V., Skočaj, D.: Transfusion–a Transparency-Based Diffusion Model for Anomaly Detection. In: European Conference on Computer Vision. pp. 91–108. Springer (2024) 4
work page 2024
-
[19]
arXiv preprint arXiv:2511.20306 (2025) 3
Guo, H., Liu, C., Zhang, H., Chen, B., Zou, Z., Shi, Z.: TaCo: Capturing Spatio- Temporal Semantic Consistency in Remote Sensing Change Detection. arXiv preprint arXiv:2511.20306 (2025) 3
-
[20]
In: IEEE International Geoscience and Remote Sensing Symposium
Hänsch,R.,Chaurasia,M.A.:EarthObservationandMachineLearningforClimate Change. In: IEEE International Geoscience and Remote Sensing Symposium. pp. 1676–1682. IEEE (2024) 1
work page 2024
-
[21]
Jia, J., Lee, G., Wang, Z., Lyu, Z., He, Y.: Siamese Meets Diffusion Network: Smdnet for Enhanced Change Detection in High-Resolution Rs Imagery. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing17, 8189–8202 (2024) 2, 4, 7
work page 2024
-
[22]
Jia, Y., Marsocci, V., Gong, Z., Yang, X., Vergauwen, M., Nascetti, A.: Can Gener- ative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models? In: International Conference on Computer Vision (ICCV) (2025) 4, 9, 2, 11, 16
work page 2025
-
[23]
IEEE Transactions on Geoscience and Remote Sensing (2025) 4
Jiang, F., Huo, X., Zhang, M., Gong, M., Pu, Y., Zhou, Y., Zhao, W., Guan, Z.: D3PM: Dual-Stream Denoising Diffusion Probabilistic Model for Change Detection in Multimodal Remote Sensing Images. IEEE Transactions on Geoscience and Remote Sensing (2025) 4
work page 2025
-
[24]
Jordan, K., Jin, Y., Boza, V., Jiacheng, Y., Cesista, F., Newhouse, L., Bernstein, J.: Muon: An Optimizer for Hidden Layers in Neural Networks (2024),https: //kellerjordan.github.io/posts/muon/8 ChangeFlow 17
work page 2024
-
[25]
In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
Ke, B., Obukhov, A., Huang, S., Metzger, N., Daudt, R.C., Schindler, K.: Re- purposing Diffusion-Based Image Generators for Monocular Depth Estimation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 9492–9502 (2024) 4
work page 2024
-
[26]
Kingma, D.P., Welling, M.: Auto-Encoding Variational Bayes. In: 2nd Interna- tional Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings (2014) 6
work page 2014
-
[27]
Korkmaz, Y., Paranjape, J.N., de Melo, C.M., Patel, V.M.: Referring Change De- tectioninRemoteSensingImagery.IEEE/CVFWinterConferenceonApplications of Computer Vision (WACV) (2026) 4
work page 2026
-
[28]
Labs, B.F.: Flux.https://github.com/black-forest-labs/flux(2024) 13
work page 2024
-
[29]
In: IEEE International Geoscience and Remote Sensing Sym- posium
Le Saux, B., Randrianarivo, H.: Urban Change Detection in SAR Images by In- teractive Learning. In: IEEE International Geoscience and Remote Sensing Sym- posium. pp. 3990–3993. IEEE (2013) 3
work page 2013
-
[30]
IEEE Transactions on Geoscience and Remote Sensing62, 1–12 (2024) 3
Li, K., Cao, X., Meng, D.: A New Learning Paradigm for Foundation Model-Based Remote-Sensing Change Detection. IEEE Transactions on Geoscience and Remote Sensing62, 1–12 (2024) 3
work page 2024
-
[31]
IEEE Transactions on Geoscience and Remote Sensing61, 1–12 (2023) 3
Li, Z., Tang, C., Liu, X., Zhang, W., Dou, J., Wang, L., Zomaya, A.Y.: Lightweight Remote Sensing Change Detection With Progressive Feature Aggregation and Su- pervised Attention. IEEE Transactions on Geoscience and Remote Sensing61, 1–12 (2023) 3
work page 2023
-
[32]
Liu, M., Chai, Z., Deng, H., Liu, R.: A CNN-Transformer Network With Multiscale Context Aggregation for Fine-Grained Cropland Change Detection. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing15, 4297– 4306 (2022) 8, 1
work page 2022
-
[33]
Liu, X., Gong, C., Liu, Q.: Flow Straight and Fast: Learning to Generate and Transfer Data With Rectified Flow. In: The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023 (2023) 2, 3, 4, 6, 7, 8
work page 2023
-
[34]
In: IEEE/CVF International Conference on Computer Vision
Mendieta, M., Han, B., Shi, X., Zhu, Y., Chen, C.: Towards Geospatial Founda- tion Models via Continual Pretraining. In: IEEE/CVF International Conference on Computer Vision. pp. 16806–16816 (2023) 9, 2, 11, 16
work page 2023
-
[35]
Meneses III, S., Blanco, A.: Rapid Mapping and Assessment of Damages Due to Typhoon Rai Using Sentinel-1 Synthetic Aperture Radar Data. The International ArchivesofthePhotogrammetry,RemoteSensingandSpatialInformationSciences 43, 1139–1146 (2022) 1
work page 2022
-
[36]
PFG–Journal of Photogrammetry, Re- mote Sensing and Geoinformation Science91(6), 443–452 (2023) 3
Metzger, N., Türkoglu, M.Ö., Daudt, R.C., Wegner, J.D., Schindler, K.: Urban Change Forecasting From Satellite Images. PFG–Journal of Photogrammetry, Re- mote Sensing and Geoinformation Science91(6), 443–452 (2023) 3
work page 2023
-
[37]
In: International Conference on Machine Learning
Nichol, A.Q., Dhariwal, P.: Improved Denoising Diffusion Probabilistic Models. In: International Conference on Machine Learning. pp. 8162–8171. PMLR (2021) 4, 17
work page 2021
-
[38]
Transactions on Machine Learning Research (2024) 13
Oquab, M., Darcet, T., Moutakanni, T., Vo, H.V., Szafraniec, M., Khalidov, V., Fernandez, P., HAZIZA, D., Massa, F., El-Nouby, A., et al.: DINOv2: Learning Robust Visual Features Without Supervision. Transactions on Machine Learning Research (2024) 13
work page 2024
- [39]
-
[40]
International Journal of Applied Earth Observation and Geoin- formation136, 104282 (2025) 3
Peng,D.,Liu,X.,Zhang,Y.,Guan,H.,Li,Y.,Bruzzone,L.:DeepLearningChange Detection Techniques for Optical Remote Sensing Imagery: Status, Perspectives and Challenges. International Journal of Applied Earth Observation and Geoin- formation136, 104282 (2025) 3
work page 2025
-
[41]
In: The Twelfth International Conference on Learning Represen- tations (2024) 6, 8, 13, 12
Podell, D., English, Z., Lacey, K., Blattmann, A., Dockhorn, T., Müller, J., Penna, J., Rombach, R.: SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis. In: The Twelfth International Conference on Learning Represen- tations (2024) 6, 8, 13, 12
work page 2024
-
[42]
In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Ranzinger, M., Heinrich, G., Kautz, J., Molchanov, P.: Am-RADIO: Agglomerative Vision Foundation Model Reduce All Domains Into One. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 12490–12500 (June 2024) 13
work page 2024
-
[43]
arXiv preprint arXiv:2601.17237 (2026) 13
Ranzinger, M., Heinrich, G., McCarthy, C., Kautz, J., Tao, A., Catanzaro, B., Molchanov, P.: C-RADIOv4 (tech Report). arXiv preprint arXiv:2601.17237 (2026) 13
-
[44]
IEEE Transactions on Geoscience and Remote Sensing63, 1–11 (2025) 2, 3, 8, 9, 11, 12, 14, 15, 16, 17
Rolih, B., Fučka, M., Wolf, F., Čehovin Zajc, L.: Be the Change You Want to See: Revisiting Remote Sensing Change Detection Practices. IEEE Transactions on Geoscience and Remote Sensing63, 1–11 (2025) 2, 3, 8, 9, 11, 12, 14, 15, 16, 17
work page 2025
-
[45]
IEEE Transactions on Geoscience and Remote Sensing60, 1–16 (2022) 8, 1
Shi, Q., Liu, M., Li, S., Liu, X., Wang, F., Zhang, L.: A Deeply Supervised Atten- tion Metric-Based Network and an Open Aerial Image Dataset for Remote Sensing Change Detection. IEEE Transactions on Geoscience and Remote Sensing60, 1–16 (2022) 8, 1
work page 2022
-
[46]
Siméoni, O., Vo, H.V., Seitzer, M., Baldassarre, F., Oquab, M., Jose, C., Khali- dov, V., Szafraniec, M., Yi, S., Ramamonjisoa, M., et al.: DINOv3. arXiv preprint arXiv:2508.10104 (2025) 8, 9, 13
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[47]
International journal of remote sensing10(6), 989–1003 (1989) 3
Singh, A.: Review Article Digital Change Detection Techniques Using Remotely- Sensed Data. International journal of remote sensing10(6), 989–1003 (1989) 3
work page 1989
-
[48]
In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision
Song, J., Chen, H., Yokoya, N.: Syntheworld: A Large-Scale Synthetic Dataset for Land Cover Mapping and Building Change Detection. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. pp. 8287–8296 (2024) 4
work page 2024
-
[49]
arXiv preprint arXiv:2512.20153 (2025) 4
Šuštar, G., Pelhan, J., Lukežič, A., Kristan, M.: CoDi–an Exemplar-Conditioned Diffusion Model for Low-Shot Counting. arXiv preprint arXiv:2512.20153 (2025) 4
-
[50]
In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Wan, S., Wu, T.Y., Wong, W.H., Lee, C.Y.: ConfNet: Predict With Confidence. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). pp. 2921–2925. IEEE (2018) 8
work page 2018
-
[51]
Advances in Neural Information Processing Systems37, 138981–139001 (2024) 3, 4
Wang, C., Li, X., Qi, L., Ding, H., Tong, Y., Yang, M.H.: SemFlow: Binding Se- mantic Segmentation and Image Synthesis via Rectified Flow. Advances in Neural Information Processing Systems37, 138981–139001 (2024) 3, 4
work page 2024
-
[52]
Wang, D., Zhang, J., Xu, M., Liu, L., Wang, D., Gao, E., Han, C., Guo, H., Du, B., Tao, D., et al.: MTP: Advancing Remote Sensing Foundation Model via Multi- Task Pretraining. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (2024) 9, 2, 11, 14, 16
work page 2024
-
[53]
IEEE Transactions on Geoscience and Remote Sensing (2024) 4
Wang, J.X., Li, T., Chen, S.B., Gu, C.J., You, Z.H., Luo, B.: Diffusion Models and Pseudo-Change: A Transfer Learning-Based Change Detection in Remote Sensing Images. IEEE Transactions on Geoscience and Remote Sensing (2024) 4
work page 2024
-
[54]
In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
Wang, S., Leroy, V., Cabon, Y., Chidlovskii, B., Revaud, J.: DUSt3R: Geometric 3D Vision Made Easy. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 20697–20709 (2024) 8 ChangeFlow 19
work page 2024
-
[55]
IEEE Transactions on Geoscience and Remote Sensing62, 1–16 (2024) 2, 3, 4, 7, 9, 10, 11, 16
Wen, Y., Ma, X., Zhang, X., Pun, M.O.: Gcd-ddpm: A Generative Change De- tection Model Based on Difference-Feature-Guided DDPM. IEEE Transactions on Geoscience and Remote Sensing62, 1–16 (2024) 2, 3, 4, 7, 9, 10, 11, 16
work page 2024
-
[56]
IEEE Transactions on Geoscience and Remote Sensing (2024) 3, 9, 10, 2, 11, 16
Yu, W., Zhang, X., Das, S., Zhu, X.X., Ghamisi, P.: MaskCD: A Remote Sensing Change Detection Network Based on Mask Classification. IEEE Transactions on Geoscience and Remote Sensing (2024) 3, 9, 10, 2, 11, 16
work page 2024
-
[57]
IEEE Transactions on Geoscience and Remote Sensing60, 1–13 (2022) 3, 9, 2, 11, 16
Zhang, C., Wang, L., Cheng, S., Li, Y.: SwinSUNet: Pure Transformer Network for Remote Sensing Image Change Detection. IEEE Transactions on Geoscience and Remote Sensing60, 1–13 (2022) 3, 9, 2, 11, 16
work page 2022
-
[58]
IEEE Transactions on Geoscience and Remote Sensing (2024) 9, 2, 11, 16
Zhang, H., Chen, H., Zhou, C., Chen, K., Liu, C., Zou, Z., Shi, Z.: Bifa: Re- mote Sensing Image Change Detection With Bitemporal Feature Alignment. IEEE Transactions on Geoscience and Remote Sensing (2024) 9, 2, 11, 16
work page 2024
-
[59]
IEEE Transactions on Pat- tern Analysis and Machine Intelligence47(2), 725–741 (2025) 4
Zheng, Z., Ermon, S., Kim, D., Zhang, L., Zhong, Y.: Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model. IEEE Transactions on Pat- tern Analysis and Machine Intelligence47(2), 725–741 (2025) 4
work page 2025
-
[60]
Zhu, Z., Qiu, S., Ye, S.: Remote Sensing of Land Change: A Multifaceted Perspec- tive. Remote Sensing of Environment282, 113266 (2022) 1 ChangeFlow 1 ChangeFlow - Latent Rectified Flow for Change Detection in Remote Sensing Supplementary Material In this Appendix, we provide additional details that extend beyond the scope of the main manuscript. The Appen...
work page 2022
-
[61]
We do not use class embeddings or classifier-free guidance. The input channel dimension is set to the sum of the image encoder dimensioncand the VAE latent dimensiond, specifically 1024 + 4, for a total of 1028, since the model receives a concatenation of feature difference and noise in the shape of a mask VAE latent (see Section 4.1). Output channel dime...
-
[62]
was selected as it represents a good speed-performance trade-off. While we could’ve selected a higher value to achieve even better CD performance, we be- lieve our selection is a fair choice given its similar inference speed to the previous best method. As already explained in the main paper, we use rotation and flipping aug- mentations, each applied with...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.