DGSSM formulates multimodal salient object detection as a progressive denoising process using diffusion-guided Mamba models, achieving better boundary accuracy and outperforming prior methods on 13 benchmarks.
arXiv preprint arXiv:2404.02668 (2024)
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
Controlled tests on LoveDA and ISPRS Potsdam show visual SSM encoders deliver favorable speed-accuracy trade-offs but suffer most from boundary errors under domain shift, indicating that robustness and boundary-aware decoding will matter more than intra-family encoder scaling.
The paper consolidates existing research on Mamba models, their architecture variants, adaptations to different data modalities, and applications across domains.
citing papers explorer
-
DGSSM: Diffusion guided state-space models for multimodal salient object detection
DGSSM formulates multimodal salient object detection as a progressive denoising process using diffusion-guided Mamba models, achieving better boundary accuracy and outperforming prior methods on 13 benchmarks.
-
A Controlled Benchmark of Visual State-Space Backbones with Domain-Shift and Boundary Analysis for Remote-Sensing Segmentation
Controlled tests on LoveDA and ISPRS Potsdam show visual SSM encoders deliver favorable speed-accuracy trade-offs but suffer most from boundary errors under domain shift, indicating that robustness and boundary-aware decoding will matter more than intra-family encoder scaling.
-
A Survey of Mamba
The paper consolidates existing research on Mamba models, their architecture variants, adaptations to different data modalities, and applications across domains.