hub

Simba: Simplified mamba-based architecture for vision and multivariate time series

Badri N Patro, Vijay S Agneeswaran · 2024 · arXiv 2403.15360

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

read on arXiv browse 12 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1 baseline 1

citation-polarity summary

background 1 baseline 1

representative citing papers

CHASM: Cross-frequency Harmonized Axis-Separable Mixing for Spectral Token Operators

cs.CV · 2026-05-14 · unverdicted · novelty 7.0

CHASM introduces a cross-frequency harmonized axis-separable spectral mixer using a shared channel eigenbasis plus per-frequency positive gains, yielding consistent gains over same-backbone baselines in medical and natural image tasks.

A Novel Schur-Decomposition-Based Weight Projection Method for Stable State-Space Neural-Network Architectures

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

A real Schur decomposition projection maps the state matrix of discrete-time state-space layers onto its nearest stable counterpart, delivering accuracy comparable to prior stable identification methods with fewer weights.

GeoCert: Certified Geometric AI for Reliable Forecasting

cs.LG · 2026-04-25 · unverdicted · novelty 6.0

GeoCert uses hyperbolic geometry to unify forecasting with physical reasoning and built-in formal certification, claiming major gains in accuracy and efficiency.

NAKUL-Med: Spectral-Graph State Space Models with Dynamics Kernels for Medical Signals

eess.SP · 2026-04-24 · unverdicted · novelty 6.0

NAKUL achieves 91.7% accuracy on motor imagery EEG with 28% fewer parameters than EEG-Conformer by using dynamic kernel generation, spectral context modeling, and graph-guided spatial attention.

HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet

cs.CV · 2026-04-16 · unverdicted · novelty 6.0

HAMSA achieves 85.7% ImageNet-1K top-1 accuracy as a spectral-domain SSM with 2.2x faster inference and lower memory than transformers or scanning-based SSMs.

ABMAMBA: Multimodal Large Language Model with Aligned Hierarchical Bidirectional Scan for Efficient Video Captioning

cs.CV · 2026-04-09 · unverdicted · novelty 6.0

ABMamba uses Mamba-based linear-complexity processing plus a novel Aligned Hierarchical Bidirectional Scan to deliver competitive video captioning on VATEX and MSR-VTT at roughly 3x higher throughput than typical Transformer MLLMs.

UniMamba: A Unified Spatial-Temporal Modeling Framework with State-Space and Attention Integration

cs.LG · 2026-03-06 · unverdicted · novelty 6.0

UniMamba integrates Mamba state-space dynamics with attention layers and transforms like FFT-Laplace to outperform prior models on multivariate time series forecasting benchmarks.

Titans: Learning to Memorize at Test Time

cs.LG · 2024-12-31 · unverdicted · novelty 6.0

Titans combine attention for current context with a learnable neural memory for long-term history, achieving better performance and scaling to over 2M-token contexts on language, reasoning, genomics, and time-series tasks.

Deep Learning Surrogates for Emulating Stochastic Climate Tipping Dynamics

cs.LG · 2026-05-20 · unverdicted · novelty 5.0

A dynamics-informed Temporal Fusion Transformer surrogate emulates stochastic tipping events in global ocean transport simulations with 465x speedup and high-fidelity timing predictions.

Dual Mamba for Node-Specific Representation Learning: Tackling Over-Smoothing with Selective State Space Modeling

cs.LG · 2025-11-10 · unverdicted · novelty 5.0

DMbaGCN combines a local state-evolution Mamba for node-specific dynamics with a global context-aware Mamba to reduce over-smoothing in deep graph neural networks.

The Hyperscale Lottery: How State-Space Models Have Sacrificed Edge Efficiency

cs.AR · 2026-04-09 · unverdicted · novelty 4.0

Mamba-3 architectural changes optimized for hyperscale GPUs cause 28% higher edge latency at 880M parameters and 48% at 15M parameters compared to earlier versions.

Advancing Intelligent Sequence Modeling: Evolution, Trade-offs, and Applications of State- Space Architectures from S4 to Mamba

cs.LG · 2025-03-22 · unverdicted · novelty 0.0

A survey tracing the evolution of state-space models like S4 and Mamba, their efficiency trade-offs, and applications in NLP, vision, and other domains.

citing papers explorer

Showing 12 of 12 citing papers.

CHASM: Cross-frequency Harmonized Axis-Separable Mixing for Spectral Token Operators cs.CV · 2026-05-14 · unverdicted · none · ref 52
CHASM introduces a cross-frequency harmonized axis-separable spectral mixer using a shared channel eigenbasis plus per-frequency positive gains, yielding consistent gains over same-backbone baselines in medical and natural image tasks.
A Novel Schur-Decomposition-Based Weight Projection Method for Stable State-Space Neural-Network Architectures cs.LG · 2026-05-14 · unverdicted · none · ref 21
A real Schur decomposition projection maps the state matrix of discrete-time state-space layers onto its nearest stable counterpart, delivering accuracy comparable to prior stable identification methods with fewer weights.
GeoCert: Certified Geometric AI for Reliable Forecasting cs.LG · 2026-04-25 · unverdicted · none · ref 31
GeoCert uses hyperbolic geometry to unify forecasting with physical reasoning and built-in formal certification, claiming major gains in accuracy and efficiency.
NAKUL-Med: Spectral-Graph State Space Models with Dynamics Kernels for Medical Signals eess.SP · 2026-04-24 · unverdicted · none · ref 28
NAKUL achieves 91.7% accuracy on motor imagery EEG with 28% fewer parameters than EEG-Conformer by using dynamic kernel generation, spectral context modeling, and graph-guided spatial attention.
HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet cs.CV · 2026-04-16 · unverdicted · none · ref 43
HAMSA achieves 85.7% ImageNet-1K top-1 accuracy as a spectral-domain SSM with 2.2x faster inference and lower memory than transformers or scanning-based SSMs.
ABMAMBA: Multimodal Large Language Model with Aligned Hierarchical Bidirectional Scan for Efficient Video Captioning cs.CV · 2026-04-09 · unverdicted · none · ref 45
ABMamba uses Mamba-based linear-complexity processing plus a novel Aligned Hierarchical Bidirectional Scan to deliver competitive video captioning on VATEX and MSR-VTT at roughly 3x higher throughput than typical Transformer MLLMs.
UniMamba: A Unified Spatial-Temporal Modeling Framework with State-Space and Attention Integration cs.LG · 2026-03-06 · unverdicted · none · ref 24
UniMamba integrates Mamba state-space dynamics with attention layers and transforms like FFT-Laplace to outperform prior models on multivariate time series forecasting benchmarks.
Titans: Learning to Memorize at Test Time cs.LG · 2024-12-31 · unverdicted · none · ref 82
Titans combine attention for current context with a learnable neural memory for long-term history, achieving better performance and scaling to over 2M-token contexts on language, reasoning, genomics, and time-series tasks.
Deep Learning Surrogates for Emulating Stochastic Climate Tipping Dynamics cs.LG · 2026-05-20 · unverdicted · none · ref 53
A dynamics-informed Temporal Fusion Transformer surrogate emulates stochastic tipping events in global ocean transport simulations with 465x speedup and high-fidelity timing predictions.
Dual Mamba for Node-Specific Representation Learning: Tackling Over-Smoothing with Selective State Space Modeling cs.LG · 2025-11-10 · unverdicted · none · ref 1
DMbaGCN combines a local state-evolution Mamba for node-specific dynamics with a global context-aware Mamba to reduce over-smoothing in deep graph neural networks.
The Hyperscale Lottery: How State-Space Models Have Sacrificed Edge Efficiency cs.AR · 2026-04-09 · unverdicted · none · ref 14
Mamba-3 architectural changes optimized for hyperscale GPUs cause 28% higher edge latency at 880M parameters and 48% at 15M parameters compared to earlier versions.
Advancing Intelligent Sequence Modeling: Evolution, Trade-offs, and Applications of State- Space Architectures from S4 to Mamba cs.LG · 2025-03-22 · unverdicted · none · ref 64
A survey tracing the evolution of state-space models like S4 and Mamba, their efficiency trade-offs, and applications in NLP, vision, and other domains.

Simba: Simplified mamba-based architecture for vision and multivariate time series

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer