Plainmamba: Improving non- hierarchical mamba in visual recognition

Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J Crowley · 2024 · arXiv 2403.17695

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

read on arXiv browse 9 citing papers

citation-role summary

background 1 baseline 1

citation-polarity summary

background 1 baseline 1

representative citing papers

FractalMamba++: Scaling Vision Mamba Across Resolutions via Hilbert Fractal Geometry

cs.CV · 2025-05-20 · unverdicted · novelty 7.0

FractalMamba++ scales Vision Mamba across resolutions by using Hilbert fractal serialization, hierarchy-based skip connections, and fractal-aware 2D rotary position encoding.

Mamba-Based Graph Convolutional Networks: Tackling Over-smoothing with Selective State Space

cs.LG · 2025-01-26 · unverdicted · novelty 7.0

MbaGCN combines message aggregation, selective state space transitions, and node state prediction to create a more scalable deep graph convolutional network.

Deformba: Vision State Space Model with Adaptive State Fusion

cs.CV · 2026-05-20 · unverdicted · novelty 6.0

Deformba introduces context-adaptive state fusion to vision SSMs for better spatial augmentation and cross-stream interactions, showing strong results on 2D classification/detection/segmentation and 3D BEV perception benchmarks.

HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet

cs.CV · 2026-04-16 · unverdicted · novelty 6.0

HAMSA achieves 85.7% ImageNet-1K top-1 accuracy as a spectral-domain SSM with 2.2x faster inference and lower memory than transformers or scanning-based SSMs.

SCRWKV: Ultra-Compact Structure-Calibrated Vision-RWKV for Topological Crack Segmentation

cs.CV · 2026-05-14 · unverdicted · novelty 5.0

SCRWKV is a 1.22M-parameter Vision-RWKV model using Structure-Field Encoder with AMCM and SCIU modules plus CSHF decoder that reports F1 0.8428 and mIoU 0.8512 on TUT crack dataset while claiming to outperform prior SOTA.

TopoMamba: Topology-Aware Scanning and Fusion for Segmenting Heterogeneous Medical Visual Media

cs.CV · 2026-04-28 · unverdicted · novelty 5.0

TopoMamba improves medical image segmentation by combining topology-aware diagonal scans with standard cross-scans and a HSIC Gate for efficient fusion, yielding gains on thin and curved targets like the pancreas.

Beyond ZOH: Advanced Discretization Strategies for Vision Mamba

cs.CV · 2026-04-22 · unverdicted · novelty 4.0

Bilinear discretization improves Vision Mamba accuracy over zero-order hold on classification, segmentation, and detection benchmarks with only modest extra training cost.

Beyond Mamba: Enhancing State-space Models with Deformable Dilated Convolutions for Multi-scale Traffic Object Detection

cs.CV · 2026-04-09 · unverdicted · novelty 4.0

MDDCNet combines Mamba blocks with deformable dilated convolutions, enhanced feed-forward networks, and an attention-aggregating feature pyramid to achieve better multi-scale traffic object detection than prior detectors.

A Survey of Mamba

cs.LG · 2024-08-02 · unverdicted · novelty 2.0

The paper consolidates existing research on Mamba models, their architecture variants, adaptations to different data modalities, and applications across domains.

citing papers explorer

Showing 9 of 9 citing papers.

FractalMamba++: Scaling Vision Mamba Across Resolutions via Hilbert Fractal Geometry cs.CV · 2025-05-20 · unverdicted · none · ref 25
FractalMamba++ scales Vision Mamba across resolutions by using Hilbert fractal serialization, hierarchy-based skip connections, and fractal-aware 2D rotary position encoding.
Mamba-Based Graph Convolutional Networks: Tackling Over-smoothing with Selective State Space cs.LG · 2025-01-26 · unverdicted · none · ref 37
MbaGCN combines message aggregation, selective state space transitions, and node state prediction to create a more scalable deep graph convolutional network.
Deformba: Vision State Space Model with Adaptive State Fusion cs.CV · 2026-05-20 · unverdicted · none · ref 9
Deformba introduces context-adaptive state fusion to vision SSMs for better spatial augmentation and cross-stream interactions, showing strong results on 2D classification/detection/segmentation and 3D BEV perception benchmarks.
HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet cs.CV · 2026-04-16 · unverdicted · none · ref 72
HAMSA achieves 85.7% ImageNet-1K top-1 accuracy as a spectral-domain SSM with 2.2x faster inference and lower memory than transformers or scanning-based SSMs.
SCRWKV: Ultra-Compact Structure-Calibrated Vision-RWKV for Topological Crack Segmentation cs.CV · 2026-05-14 · unverdicted · none · ref 10
SCRWKV is a 1.22M-parameter Vision-RWKV model using Structure-Field Encoder with AMCM and SCIU modules plus CSHF decoder that reports F1 0.8428 and mIoU 0.8512 on TUT crack dataset while claiming to outperform prior SOTA.
TopoMamba: Topology-Aware Scanning and Fusion for Segmenting Heterogeneous Medical Visual Media cs.CV · 2026-04-28 · unverdicted · none · ref 12
TopoMamba improves medical image segmentation by combining topology-aware diagonal scans with standard cross-scans and a HSIC Gate for efficient fusion, yielding gains on thin and curved targets like the pancreas.
Beyond ZOH: Advanced Discretization Strategies for Vision Mamba cs.CV · 2026-04-22 · unverdicted · none · ref 28
Bilinear discretization improves Vision Mamba accuracy over zero-order hold on classification, segmentation, and detection benchmarks with only modest extra training cost.
Beyond Mamba: Enhancing State-space Models with Deformable Dilated Convolutions for Multi-scale Traffic Object Detection cs.CV · 2026-04-09 · unverdicted · none · ref 9
MDDCNet combines Mamba blocks with deformable dilated convolutions, enhanced feed-forward networks, and an attention-aggregating feature pyramid to achieve better multi-scale traffic object detection than prior detectors.
A Survey of Mamba cs.LG · 2024-08-02 · unverdicted · none · ref 213
The paper consolidates existing research on Mamba models, their architecture variants, adaptations to different data modalities, and applications across domains.

Plainmamba: Improving non- hierarchical mamba in visual recognition

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer