Scalable visual state space model with fractal scanning

Tang, L · 2024 · arXiv 2405.14480

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

baseline 1 other 1

citation-polarity summary

baseline 1 unclear 1

representative citing papers

Can Graphs Help Vision SSMs See Better?

cs.CV · 2026-05-11 · unverdicted · novelty 7.0

GraphScan replaces geometric or coordinate-based scanning in Vision SSMs with learned local semantic graph routing, yielding SOTA results among such models on classification and segmentation tasks.

HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet

cs.CV · 2026-04-16 · unverdicted · novelty 6.0

HAMSA achieves 85.7% ImageNet-1K top-1 accuracy as a spectral-domain SSM with 2.2x faster inference and lower memory than transformers or scanning-based SSMs.

Can Visual Mamba Improve AI-Generated Image Detection? An In-Depth Investigation

cs.CV · 2026-05-14 · unverdicted · novelty 4.0

Benchmarks Vision Mamba variants for AI-generated image detection against CNN, ViT, and VLM detectors on diverse datasets and synthetic sources, reporting promise alongside limitations.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Can Graphs Help Vision SSMs See Better? cs.CV · 2026-05-11 · unverdicted · none · ref 52
GraphScan replaces geometric or coordinate-based scanning in Vision SSMs with learned local semantic graph routing, yielding SOTA results among such models on classification and segmentation tasks.
HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet cs.CV · 2026-04-16 · unverdicted · none · ref 61
HAMSA achieves 85.7% ImageNet-1K top-1 accuracy as a spectral-domain SSM with 2.2x faster inference and lower memory than transformers or scanning-based SSMs.
Can Visual Mamba Improve AI-Generated Image Detection? An In-Depth Investigation cs.CV · 2026-05-14 · unverdicted · none · ref 71
Benchmarks Vision Mamba variants for AI-generated image detection against CNN, ViT, and VLM detectors on diverse datasets and synthetic sources, reporting promise alongside limitations.

Scalable visual state space model with fractal scanning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer