pith. sign in
Pith Number

pith:OB24PFTL

pith:2024:OB24PFTLDJTXXN5SRDB6TCVQY7
not attested not anchored not stored refs resolved

VMamba: Visual State Space Model

Hongtian Yu, Jianbin Jiao, Lingxi Xie, Qixiang Ye, Yaowei Wang, Yue Liu, Yunfan Liu, Yunjie Tian, Yuzhong Zhao

VMamba adapts Mamba's state-space model to vision by scanning 2D images along four fixed routes to reach linear time complexity.

arxiv:2401.10166 v4 · 2024-01-18 · cs.CV

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{OB24PFTLDJTXXN5SRDB6TCVQY7}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Extensive experiments demonstrate VMamba's promising performance across diverse visual perception tasks, highlighting its superior input scaling efficiency compared to existing benchmark models.

C2weakest assumption

That scanning along exactly four fixed routes in the SS2D module collects sufficient contextual information from 2D data to match or exceed the modeling power of full 2D attention or convolution without missing important spatial relationships.

C3one line summary

VMamba introduces a state-space vision backbone using 2D selective scanning across four routes to achieve linear complexity and strong performance on image tasks.

References

86 extracted · 86 resolved · 7 Pith anchors

[1] Xcit: Cross-covariance image trans- formers 2021
[2] Prefix sums and their applications 1990
[3] MMDetection: Open mmlab detection toolbox and benchmark 1906 · arXiv:1906.07155
[4] MMSegmentation: Openmmlab semantic segmentation toolbox and benchmark 2020
[5] Deformable convolutional networks 2017

Formal links

2 machine-checked theorem links

Cited by

26 papers in Pith

Receipt and verification
First computed 2026-05-17T23:38:46.990659Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

7075c7966b1a677bb7b288c3e98ab0c7e08b89cb4ba464e45e93771df872ee2a

Aliases

arxiv: 2401.10166 · arxiv_version: 2401.10166v4 · doi: 10.48550/arxiv.2401.10166 · pith_short_12: OB24PFTLDJTX · pith_short_16: OB24PFTLDJTXXN5S · pith_short_8: OB24PFTL
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/OB24PFTLDJTXXN5SRDB6TCVQY7 \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 7075c7966b1a677bb7b288c3e98ab0c7e08b89cb4ba464e45e93771df872ee2a
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "d1976141e4bbd82760778fc2d41a1732eb1b3a55c6903af519e0f06de9e5597e",
    "cross_cats_sorted": [],
    "license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
    "primary_cat": "cs.CV",
    "submitted_at": "2024-01-18T17:55:39Z",
    "title_canon_sha256": "03bf44286ec830754e2d9fbd1ce07ca970a3364b70139e148c255367b9575b0d"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2401.10166",
    "kind": "arxiv",
    "version": 4
  }
}