IB-HFN introduces a dual-stream backbone with spatial information bottleneck fusion, local-global gating, and joint optimization to achieve superior structural and spectral fidelity in SAR-assisted optical cloud removal on the SEN12MS-CR dataset.
Task-Driven Prompt Learning: A Joint Framework for Multi-modal Cloud Removal and Segmentation
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Optical remote sensing imagery is indispensable for Earth observation, yet persistent cloud occlusion limits its downstream utility. Most cloud removal (CR) methods are optimized for low-level fidelity and can over-smooth textures and boundaries that are critical for analysis-ready data (ARD), leading to a mismatch between visually plausible restoration and semantic utility. To bridge this gap, we propose TDP-CR, a task-driven multimodal framework that jointly performs cloud removal and land-cover segmentation. Central to our approach is a Prompt-Guided Fusion (PGF) mechanism, which utilizes a learnable degradation prompt to encode cloud thickness and spatial uncertainty. By combining global channel context with local prompt-conditioned spatial bias, PGF adaptively integrates Synthetic Aperture Radar (SAR) information only where optical data is corrupted. We further introduce a parameter-efficient two-phase training strategy that decouples reconstruction and semantic representation learning. Experiments on the LuojiaSET-OSFCR dataset demonstrate the superiority of our framework: TDP-CR surpasses heavy state-of-the-art baselines by 0.18 dB in PSNR while using only 15\% of the parameters, and achieves a 1.4\% improvement in mIoU consistently against multi-task competitors, effectively delivering analysis-ready data.
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
IB-HFN: Information Bottleneck-Driven SAR-Optical Fusion Network for High-Fidelity Cloud Removal
IB-HFN introduces a dual-stream backbone with spatial information bottleneck fusion, local-global gating, and joint optimization to achieve superior structural and spectral fidelity in SAR-assisted optical cloud removal on the SEN12MS-CR dataset.