GraphScan replaces geometric or coordinate-based scanning in Vision SSMs with learned local semantic graph routing, yielding SOTA results among such models on classification and segmentation tasks.
Scalable visual state space model with fractal scanning
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CV 3years
2026 3verdicts
UNVERDICTED 3representative citing papers
HAMSA achieves 85.7% ImageNet-1K top-1 accuracy as a spectral-domain SSM with 2.2x faster inference and lower memory than transformers or scanning-based SSMs.
Benchmarks Vision Mamba variants for AI-generated image detection against CNN, ViT, and VLM detectors on diverse datasets and synthetic sources, reporting promise alongside limitations.
citing papers explorer
-
Can Graphs Help Vision SSMs See Better?
GraphScan replaces geometric or coordinate-based scanning in Vision SSMs with learned local semantic graph routing, yielding SOTA results among such models on classification and segmentation tasks.
-
HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet
HAMSA achieves 85.7% ImageNet-1K top-1 accuracy as a spectral-domain SSM with 2.2x faster inference and lower memory than transformers or scanning-based SSMs.
-
Can Visual Mamba Improve AI-Generated Image Detection? An In-Depth Investigation
Benchmarks Vision Mamba variants for AI-generated image detection against CNN, ViT, and VLM detectors on diverse datasets and synthetic sources, reporting promise alongside limitations.