Spikingformer: Spike-driven residual learning for transformer- based spiking neural network

· 2023 · arXiv 2304.11954

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

representative citing papers

Otters++: A Time-to-first-spike Based Energy Efficient Optical Spiking Transformer

cs.AI · 2026-06-11 · unverdicted · novelty 7.0

Otters++ realizes TTFS via measured device decay in optical synapses, uses hybrid QNN-equivalent training with noise awareness, and reports 84.17% average GLUE score with energy gains over prior spiking transformers.

Elastic Spiking Transformers for Efficient Gesture Understanding

cs.NE · 2026-05-04 · unverdicted · novelty 7.0

A single Elastic Spiking Transformer model dynamically slices network width and attention heads at runtime via granularity-aware weight sharing, matching or exceeding fixed baselines on CIFAR and gesture datasets while reducing spike operations.

Winner-Take-All Spiking Transformer for Language Modeling

cs.NE · 2026-04-13 · unverdicted · novelty 7.0

Winner-take-all spiking self-attention replaces softmax in spiking transformers to support language modeling on 16 datasets with spike-driven, energy-efficient architectures.

Temporal-Aware Spiking Transformer Hashing Based on 3D-DWT

cs.CV · 2025-01-12 · unverdicted · novelty 7.0

Spikinghash combines 3D-DWT Spiking WaveMixer, Spiking Self-Attention, and a dynamic soft similarity loss to produce energy-efficient hash codes for DVS data retrieval.

Vision SmolMamba: Spike-Guided Token Pruning for Energy-Efficient Spiking State-Space Vision Models

cs.CV · 2026-04-28 · unverdicted · novelty 6.0

Vision SmolMamba adds spike-guided spatio-temporal token pruning to a bidirectional spiking state-space backbone, cutting estimated energy by at least 1.5x versus prior spiking Transformers and Spiking Mamba variants on ImageNet-1K and event-based datasets while keeping competitive accuracy.

BiSpikCLM: A Spiking Language Model integrating Softmax-Free Spiking Attention and Spike-Aware Alignment Distillation

cs.NE · 2026-04-14 · unverdicted · novelty 6.0

BiSpikCLM is the first fully binary spiking MatMul-free causal language model that matches ANN performance on generation tasks using only 4-6 percent of the compute via softmax-free spiking attention and spike-aware distillation.

Reconsidering the energy efficiency of spiking neural networks

cs.NE · 2024-08-29 · unverdicted · novelty 6.0

Rate-encoded SNNs with T timesteps outperform bit-equivalent QNNs in energy only when average spike rate falls below 6.4% for T in [5,10] under typical neuromorphic hardware, per an analytical model covering computation and data movement.

Breaking Global Self-Attention Bottlenecks in Transformer-based Spiking Neural Networks with Local Structure-Aware Self-Attention

cs.NE · 2026-05-12 · unverdicted · novelty 5.0

LSFormer uses local structure-aware spiking self-attention and spiking response pooling to cut global attention bottlenecks, delivering 4.3% and 8.6% accuracy gains on Tiny-ImageNet and N-CALTECH101 over prior transformer-based SNNs.

citing papers explorer

Showing 8 of 8 citing papers.

Otters++: A Time-to-first-spike Based Energy Efficient Optical Spiking Transformer cs.AI · 2026-06-11 · unverdicted · none · ref 34
Otters++ realizes TTFS via measured device decay in optical synapses, uses hybrid QNN-equivalent training with noise awareness, and reports 84.17% average GLUE score with energy gains over prior spiking transformers.
Elastic Spiking Transformers for Efficient Gesture Understanding cs.NE · 2026-05-04 · unverdicted · none · ref 16
A single Elastic Spiking Transformer model dynamically slices network width and attention heads at runtime via granularity-aware weight sharing, matching or exceeding fixed baselines on CIFAR and gesture datasets while reducing spike operations.
Winner-Take-All Spiking Transformer for Language Modeling cs.NE · 2026-04-13 · unverdicted · none · ref 19
Winner-take-all spiking self-attention replaces softmax in spiking transformers to support language modeling on 16 datasets with spike-driven, energy-efficient architectures.
Temporal-Aware Spiking Transformer Hashing Based on 3D-DWT cs.CV · 2025-01-12 · unverdicted · none · ref 8
Spikinghash combines 3D-DWT Spiking WaveMixer, Spiking Self-Attention, and a dynamic soft similarity loss to produce energy-efficient hash codes for DVS data retrieval.
Vision SmolMamba: Spike-Guided Token Pruning for Energy-Efficient Spiking State-Space Vision Models cs.CV · 2026-04-28 · unverdicted · none · ref 21
Vision SmolMamba adds spike-guided spatio-temporal token pruning to a bidirectional spiking state-space backbone, cutting estimated energy by at least 1.5x versus prior spiking Transformers and Spiking Mamba variants on ImageNet-1K and event-based datasets while keeping competitive accuracy.
BiSpikCLM: A Spiking Language Model integrating Softmax-Free Spiking Attention and Spike-Aware Alignment Distillation cs.NE · 2026-04-14 · unverdicted · none · ref 20
BiSpikCLM is the first fully binary spiking MatMul-free causal language model that matches ANN performance on generation tasks using only 4-6 percent of the compute via softmax-free spiking attention and spike-aware distillation.
Reconsidering the energy efficiency of spiking neural networks cs.NE · 2024-08-29 · unverdicted · none · ref 7
Rate-encoded SNNs with T timesteps outperform bit-equivalent QNNs in energy only when average spike rate falls below 6.4% for T in [5,10] under typical neuromorphic hardware, per an analytical model covering computation and data movement.
Breaking Global Self-Attention Bottlenecks in Transformer-based Spiking Neural Networks with Local Structure-Aware Self-Attention cs.NE · 2026-05-12 · unverdicted · none · ref 31
LSFormer uses local structure-aware spiking self-attention and spiking response pooling to cut global attention bottlenecks, delivering 4.3% and 8.6% accuracy gains on Tiny-ImageNet and N-CALTECH101 over prior transformer-based SNNs.

Spikingformer: Spike-driven residual learning for transformer- based spiking neural network

fields

years

verdicts

representative citing papers

citing papers explorer