DiffIML applies score-based generative modeling to image manipulation localization, recovering coherent masks iteratively from noise to improve generalization on unseen manipulation types.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3roles
method 1polarities
use method 1representative citing papers
GTPBD-MM is the first multimodal benchmark for global terraced parcel extraction, integrating image, text, and DEM data with experiments showing that textual and terrain cues improve delineation accuracy over image-only approaches.
GTC improves multi-modal recommendation by using user-conditional diffusion-based feature filtering and total correlation optimization, achieving up to 28.3% gains in NDCG@5 on benchmarks.
citing papers explorer
-
Towards Generalized Image Manipulation Localization via Score-based Model
DiffIML applies score-based generative modeling to image manipulation localization, recovering coherent masks iteratively from noise to improve generalization on unseen manipulation types.
-
GTPBD-MM: A Global Terraced Parcel and Boundary Dataset with Multi-Modality
GTPBD-MM is the first multimodal benchmark for global terraced parcel extraction, integrating image, text, and DEM data with experiments showing that textual and terrain cues improve delineation accuracy over image-only approaches.
-
User-Aware Conditional Generative Total Correlation Learning for Multi-Modal Recommendation
GTC improves multi-modal recommendation by using user-conditional diffusion-based feature filtering and total correlation optimization, achieving up to 28.3% gains in NDCG@5 on benchmarks.