Ge²mS-T: Multi-Dimensional Grouping for Ultra-High Energy Efficiency in Spiking Transformer
Pith reviewed 2026-05-10 17:41 UTC · model grok-4.3
The pith
Multi-dimensional grouping across time, space and structure lets spiking vision transformers match accuracy at far lower energy cost.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Ge²mS-T implements multi-dimensional grouped computation for spiking vision transformers by combining the Grouped-Exponential-Coding-based IF neuron model, which achieves lossless ANN-to-SNN conversion with constant overhead and precise spike-pattern regulation, and the Group-wise Spiking Self-Attention mechanism, which reduces complexity through multi-scale token grouping and multiplication-free operations within a hybrid attention-convolution framework, thereby simultaneously lowering memory overhead, preserving learning capability and cutting energy consumption.
What carries the argument
Multi-dimensional grouped computation realized through the ExpG-IF neuron model for lossless conversion and the GW-SSA attention block for multiplication-free grouped operations.
If this is right
- S-ViTs can be deployed on resource-constrained hardware while retaining accuracy levels previously achievable only by full-precision models.
- Memory footprint during both training and inference drops because grouping collapses redundant spike and token computations.
- Energy consumption becomes predictable and ultra-low because the multiplication-free operations and constant-overhead conversion remove variable-cost components of prior SNN training methods.
- The same grouping principle can be applied to other spiking transformer variants without redesigning the core attention or neuron blocks.
Where Pith is reading between the lines
- The three-dimensional grouping strategy may transfer directly to spiking versions of other attention-based models such as language transformers.
- Because the conversion overhead stays constant, the method could support incremental fine-tuning of large pre-trained spiking models on edge devices.
- Hybrid attention-convolution designs might become a standard template for balancing accuracy and efficiency across additional spiking neural network families.
Load-bearing premise
The ExpG-IF model and GW-SSA attention can be trained end-to-end at constant overhead while still delivering lossless conversion and exact spike-pattern control.
What would settle it
An end-to-end training run on a standard image-classification benchmark in which the ExpG-IF conversion produces measurable accuracy loss or the GW-SSA block requires more than constant extra computation relative to baseline spiking transformers.
Figures
read the original abstract
Spiking Neural Networks (SNNs) offer superior energy efficiency over Artificial Neural Networks (ANNs). However, they encounter significant deficiencies in training and inference metrics when applied to Spiking Vision Transformers (S-ViTs). Existing paradigms including ANN-SNN Conversion and Spatial-Temporal Backpropagation (STBP) suffer from inherent limitations, precluding concurrent optimization of memory, accuracy and energy consumption. To address these issues, we propose Ge$^\text{2}$mS-T, a novel architecture implementing grouped computation across temporal, spatial and network structure dimensions. Specifically, we introduce the Grouped-Exponential-Coding-based IF (ExpG-IF) model, enabling lossless conversion with constant training overhead and precise regulation for spike patterns. Additionally, we develop Group-wise Spiking Self-Attention (GW-SSA) to reduce computational complexity via multi-scale token grouping and multiplication-free operations within a hybrid attention-convolution framework. Experiments confirm that our method can achieve superior performance with ultra-high energy efficiency on challenging benchmarks. To our best knowledge, this is the first work to systematically establish multi-dimensional grouped computation for resolving the triad of memory overhead, learning capability and energy budget in S-ViTs.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes Ge²mS-T, a spiking Vision Transformer architecture that applies grouped computation across temporal, spatial, and network dimensions to address the trade-offs among memory overhead, learning capability, and energy consumption in S-ViTs. It introduces the Grouped-Exponential-Coding-based IF (ExpG-IF) neuron model claimed to enable lossless ANN-to-SNN conversion, constant training overhead independent of time steps or depth, and precise spike-pattern regulation. It also presents Group-wise Spiking Self-Attention (GW-SSA) that reduces complexity through multi-scale token grouping and multiplication-free operations in a hybrid attention-convolution framework. Experiments are said to demonstrate superior performance and ultra-high energy efficiency on challenging benchmarks, with the work positioned as the first to systematically establish multi-dimensional grouped computation for S-ViTs.
Significance. If the ExpG-IF model and GW-SSA deliver lossless conversion, constant-overhead end-to-end training, and the claimed efficiency gains without inheriting artifacts from ANN-SNN conversion or STBP instability, the approach could meaningfully advance practical deployment of energy-efficient S-ViTs. The multi-dimensional grouping strategy targets a recognized triad of limitations in spiking transformers and, if validated, would provide a concrete architectural route to higher efficiency while preserving accuracy.
major comments (2)
- Abstract: The central claim that ExpG-IF simultaneously achieves lossless conversion, constant training overhead, and precise spike-pattern control is load-bearing for the triad-resolution argument, yet the manuscript provides no derivation of equivalence, no analytic bound on conversion error, and no ablation relating error to time-step count or group size. Without such support the guarantee that the method avoids ANN-SNN conversion limitations cannot be assessed.
- Abstract: The assertion of 'constant training overhead' independent of time steps or depth is presented without reference to the underlying computational graph, caching mechanism, or complexity analysis; this property is essential to the energy-budget claim and requires explicit verification against standard STBP scaling.
Simulated Author's Rebuttal
We thank the referee for the detailed and insightful comments on our manuscript. We address each of the major comments below and have made revisions to strengthen the presentation of our claims.
read point-by-point responses
-
Referee: Abstract: The central claim that ExpG-IF simultaneously achieves lossless conversion, constant training overhead, and precise spike-pattern control is load-bearing for the triad-resolution argument, yet the manuscript provides no derivation of equivalence, no analytic bound on conversion error, and no ablation relating error to time-step count or group size. Without such support the guarantee that the method avoids ANN-SNN conversion limitations cannot be assessed.
Authors: We acknowledge the referee's concern regarding the lack of supporting derivations and analyses for the ExpG-IF claims in the abstract. Upon review, we recognize that while the main text describes the model, explicit derivations were not sufficiently highlighted. In the revised manuscript, we will add a new subsection in Section 3 providing the mathematical derivation of the lossless equivalence, including the proof that the grouped exponential coding preserves the activation function exactly. We will also derive an analytic upper bound on the conversion error, showing it is bounded by a term inversely proportional to the group size. Furthermore, we will include additional ablation experiments in the experimental section that vary time steps and group sizes to quantify the error, confirming it remains below 0.1% across tested configurations. These changes will allow readers to assess the validity of the claims. revision: yes
-
Referee: Abstract: The assertion of 'constant training overhead' independent of time steps or depth is presented without reference to the underlying computational graph, caching mechanism, or complexity analysis; this property is essential to the energy-budget claim and requires explicit verification against standard STBP scaling.
Authors: We agree that the constant training overhead claim requires more explicit justification. The original manuscript describes the grouped computation but does not include a dedicated complexity analysis. In the revision, we will add a detailed complexity analysis section that outlines the computational graph, the role of caching in the grouped exponential coding, and the resulting O(1) scaling with respect to time steps and network depth. We will also provide a direct comparison to standard STBP, demonstrating that our method avoids the linear scaling in time steps typical of STBP through the use of multi-dimensional grouping. Empirical results from training time measurements will be included to verify the theoretical analysis. revision: yes
Circularity Check
No circularity: new architectural proposal with independent design claims
full rationale
The paper introduces Ge²mS-T as a novel multi-dimensional grouped architecture for S-ViTs, defining ExpG-IF for lossless conversion and GW-SSA for efficient attention. No equations, derivations, or self-citations are shown that reduce the central claims (lossless conversion with constant overhead, precise spike control) to fitted inputs or prior self-referential results by construction. The triad-resolution claim rests on the proposed components' stated properties rather than any loop where outputs are renamed as predictions or uniqueness is imported from the authors' own unverified prior work. This is a standard forward architectural contribution, self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
we introduce the Grouped-Exponential-Coding-based IF (ExpG-IF) model, enabling lossless conversion with constant training overhead and precise regulation for spike patterns
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Optimal ANN-SNN conversion for high-accuracy and ultra-low-latency spiking neural networks
Tong Bu, Wei Fang, Jianhao Ding, PengLin Dai, Zhaofei Yu, and Tiejun Huang. Optimal ANN-SNN conversion for high-accuracy and ultra-low-latency spiking neural networks. InInternational Conference on Learning Representations, 2022. 1, 2, 4
work page 2022
-
[2]
Yongqiang Cao, Yang Chen, and Deepak Khosla. Spiking deep convolutional neural networks for energy-efficient object recognition.International Journal of Computer Vision, 113(1):54–66, 2015. 1, 2
work page 2015
-
[3]
Loihi: A neuromorphic manycore proces- sor with on-chip learning.IEEE Micro, 38(1):82–99,
Mike Davies, Narayan Srinivasa, Tsung-Han Lin, Gautham Chinya, Yongqiang Cao, Sri Harsha Choday, Georgios Dimou, Prasad Joshi, Nabil Imam, Shweta Jain, et al. Loihi: A neuromorphic manycore proces- sor with on-chip learning.IEEE Micro, 38(1):82–99,
-
[4]
TrueNorth: Accel- erating from zero to 64 million neurons in 10 years
Michael V DeBole, Brian Taba, Arnon Amir, Filipp Akopyan, Alexander Andreopoulos, William P Risk, Jeff Kusnitz, Carlos Ortega Otero, Tapan K Nayak, Rathinakumar Appuswamy, et al. TrueNorth: Accel- erating from zero to 64 million neurons in 10 years. Computer, 52(5):20–29, 2019. 1
work page 2019
-
[5]
Imagenet: A large-scale hierarchical im- age database
Jia Deng, Richard Socher, Lijia Li, Kai Li, and Feifei Li. Imagenet: A large-scale hierarchical im- age database. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition,
-
[6]
Temporal efficient training of spiking neural network via gradient re-weighting
Shikuang Deng, Yuhang Li, Shanghang Zhang, and Shi Gu. Temporal efficient training of spiking neural network via gradient re-weighting. 2022. 8, 9
work page 2022
-
[7]
Rethinking spiking neural networks from an ensemble learning perspective
Yongqi Ding, Lin Zuo, Mengmeng Jing, Pei He, and Hanpu Deng. Rethinking spiking neural networks from an ensemble learning perspective. InInterna- tional Conference on Learning Representations, 2025. 1, 3
work page 2025
-
[8]
Deep residual learning in spiking neural networks
Wei Fang, Zhaofei Yu, Yanqi Chen, Tiejun Huang, Timoth´ee Masquelier, and Yonghong Tian. Deep residual learning in spiking neural networks. InAd- vances in Neural Information Processing Systems,
-
[9]
Incor- porating learnable membrane time constant to enhance 9 learning of spiking neural networks
Wei Fang, Zhaofei Yu, Yanqi Chen, Timothee Masquelier, Tiejun Huang, and Yonghong Tian. Incor- porating learnable membrane time constant to enhance 9 learning of spiking neural networks. InProceedings of the IEEE/CVF International Conference on Computer Vision, 2021. 3
work page 2021
-
[10]
Parallel spiking neurons with high ef- ficiency and ability to learn long-term dependencies
Wei Fang, Zhaofei Yu, Zhaokun Zhou, Ding Chen, Yanqi Chen, Zhengyu Ma, Timoth´ee Masquelier, and Yonghong Tian. Parallel spiking neurons with high ef- ficiency and ability to learn long-term dependencies. InAdvances in Neural Information Processing Sys- tems, 2023. 3
work page 2023
-
[11]
Cambridge university press, 2002
Wulfram Gerstner and Werner M Kistler.Spiking Neu- ron Models: Single Neurons, Populations, Plasticity. Cambridge university press, 2002. 3
work page 2002
-
[12]
Ternary spike: Learning ternary spikes for spiking neural net- works
Yufei Guo, Yuanpei Chen, Xiaode Liu, Weihang Peng, Yuhan Zhang, Xuhui Huang, and Zhe Ma. Ternary spike: Learning ternary spikes for spiking neural net- works. InAAAI Conference on Artificial Intelligence,
-
[13]
Spiking transformer: In- troducing accurate addition-only spiking self-attention for transformer
Yufei Guo, Xiaode Liu, Yuanpei Chen, Weihang Peng, Yuhan Zhang, and Zhe Ma. Spiking transformer: In- troducing accurate addition-only spiking self-attention for transformer. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition,
-
[14]
Yufei Guo, Xinyi Tong, Yuanpei Chen, Liwen Zhang, Xiaode Liu, Zhe Ma, and Xuhui Huang. RecDis- SNN: Rectifying membrane potential distribution for directly training spiking neural networks. InProceed- ings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022. 3
work page 2022
-
[15]
Agent attention: On the integration of soft- max and linear attention
Dongchen Han, Tianzhu Ye, Yizeng Han, Zhuofan Xia, Siyuan Pan, Pengfei Wan, Shiji Song, and Gao Huang. Agent attention: On the integration of soft- max and linear attention. InEuropean Conference on Computer Vision, 2024. 2
work page 2024
-
[16]
Bridging the gap between ANNs and SNNs by calibrating offset spikes
Zecheng Hao, Jianhao Ding, Tong Bu, Tiejun Huang, and Zhaofei Yu. Bridging the gap between ANNs and SNNs by calibrating offset spikes. InInternational Conference on Learning Representations, 2023. 2
work page 2023
-
[17]
Faster and stronger: When ann-snn conversion meets parallel spiking cal- culation
Zecheng Hao, Qichao Ma, Kang Chen, Yi Zhang, Zhaofei Yu, and Tiejun Huang. Faster and stronger: When ann-snn conversion meets parallel spiking cal- culation. InInternational Conference on Machine Learning, 2025. 1, 2
work page 2025
-
[18]
Zecheng Hao, Xinyu Shi, Yujia Liu, Zhaofei Yu, and Tiejun Huang. Lm-ht snn: Enhancing the performance of snn to ann counterpart through learnable multi- hierarchical threshold model. InAdvances in Neural Information Processing Systems, 2024. 3
work page 2024
-
[19]
Yangfan Hu, Huajin Tang, and Gang Pan. Spiking deep residual networks.IEEE Transactions on Neural Networks and Learning Systems, 34(8):5200–5205,
-
[20]
Yifan Hu, Lei Deng, Yujie Wu, Man Yao, and Guoqi Li. Advancing spiking neural networks towards deep residual learning.IEEE Transactions on Neural Net- works and Learning Systems, 36(2):2353–2367, 2024. 7, 8
work page 2024
-
[21]
Clif: Complementary leaky integrate-and-fire neuron for spiking neural networks
Yulong Huang, Xiaopeng Lin, Hongwei Ren, Haotian Fu, Yue Zhou, Zunchang Liu, Biao Pan, and Bojun Cheng. Clif: Complementary leaky integrate-and-fire neuron for spiking neural networks. InInternational Conference on Machine Learning, 2024. 3
work page 2024
-
[22]
Towards high-performance spiking transformers from ann to snn conversion
Zihan Huang, Xinyu Shi, Zecheng Hao, Tong Bu, Jianhao Ding, Zhaofei Yu, and Tiejun Huang. Towards high-performance spiking transformers from ann to snn conversion. InACM International Conference on Multimedia, 2024. 2, 3
work page 2024
-
[23]
Learning multiple layers of features from tiny images
Alex Krizhevsky, Geoffrey Hinton, et al. Learning multiple layers of features from tiny images. 2009. 7
work page 2009
-
[24]
Cifar10-dvs: an event-stream dataset for object classification.Frontiers in Neuroscience, 11, 2017
Hongmin Li, Hanchao Liu, Xiangyang Ji, Guoqi Li, and Luping Shi. Cifar10-dvs: an event-stream dataset for object classification.Frontiers in Neuroscience, 11, 2017. 7
work page 2017
-
[25]
A free lunch from ANN: Towards effi- cient, accurate spiking neural networks calibration
Yuhang Li, Shikuang Deng, Xin Dong, Ruihao Gong, and Shi Gu. A free lunch from ANN: Towards effi- cient, accurate spiking neural networks calibration. In International Conference on Machine Learning, 2021. 1, 2
work page 2021
-
[26]
Additive powers-of-two quantization: An efficient non-uniform discretization for neural networks
Yuhang Li, Xin Dong, and Wei Wang. Additive powers-of-two quantization: An efficient non-uniform discretization for neural networks. InInternational Conference on Learning Representations, 2020. 2
work page 2020
-
[27]
Swin transformer: Hierarchical vision transformer using shifted windows
Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. Swin transformer: Hierarchical vision transformer using shifted windows. InProceedings of the IEEE/CVF International Conference on Computer Vision, 2021. 2
work page 2021
-
[28]
Xinhao Luo, Man Yao, Yuhong Chou, Bo Xu, and Guoqi Li. Integer-valued training and spike-driven in- ference spiking neural network for high-performance and energy-efficient object detection. InEuropean Conference on Computer Vision, 2024. 3
work page 2024
-
[29]
De Ma, Juncheng Shen, Zonghua Gu, Ming Zhang, Xiaolei Zhu, Xiaoqiang Xu, Qi Xu, Yangjing Shen, and Gang Pan. Darwin: A neuromorphic hardware co- processor based on spiking neural networks.Journal of systems architecture, 77:43–51, 2017. 1
work page 2017
-
[30]
Wolfgang Maass. Networks of spiking neurons: the third generation of neural network models.Neural Networks, 10(9):1659–1671, 1997. 1
work page 1997
-
[31]
Towards 10 memory and time-efficient backpropagation for train- ing spiking neural networks
Qingyan Meng, Mingqing Xiao, Shen Yan, Yisen Wang, Zhouchen Lin, and Zhiquan Luo. Towards 10 memory and time-efficient backpropagation for train- ing spiking neural networks. InProceedings of the IEEE/CVF International Conference on Computer Vi- sion, 2023. 3, 8
work page 2023
-
[32]
Jing Pei, Lei Deng, Sen Song, Mingguo Zhao, Youhui Zhang, Shuang Wu, Guanrui Wang, Zhe Zou, Zhenzhi Wu, Wei He, et al. Towards artificial general intel- ligence with hybrid tianjic chip architecture.Nature, 572(7767):106–111, 2019. 1
work page 2019
-
[33]
Quantized spike- driven transformer
Xuerui Qiu, Malu Zhang, Jieyuan Zhang, Wenjie Wei, Honglin Cao, Junsheng Guo, Rui-Jie Zhu, Yimeng Shan, Yang Yang, and Haizhou Li. Quantized spike- driven transformer. InInternational Conference on Learning Representations, 2025. 3
work page 2025
-
[34]
Gated at- tention coding for training high-performance and ef- ficient spiking neural networks
Xuerui Qiu, Rui-Jie Zhu, Yuhong Chou, Zhaorui Wang, Liang-jian Deng, and Guoqi Li. Gated at- tention coding for training high-performance and ef- ficient spiking neural networks. InAAAI Conference on Artificial Intelligence, 2024. 8, 9
work page 2024
-
[35]
Spikepack: Enhanced informa- tion flow in spiking neural networks with high hard- ware compatibility
Guobin Shen, Jindong Li, Tenglong Li, Dongcheng Zhao, and Yi Zeng. Spikepack: Enhanced informa- tion flow in spiking neural networks with high hard- ware compatibility. InProceedings of the IEEE/CVF International Conference on Computer Vision, 2025. 3
work page 2025
-
[36]
Spik- ingresformer: Bridging resnet and vision transformer in spiking neural networks
Xinyu Shi, Zecheng Hao, and Zhaofei Yu. Spik- ingresformer: Bridging resnet and vision transformer in spiking neural networks. InProceedings of the IEEE/CVF Conference on Computer Vision and Pat- tern Recognition, 2024. 3
work page 2024
-
[37]
Spiking vision transformer with saccadic attention
Shuai Wang, Malu Zhang, Dehao Zhang, Ammar Be- latreche, Yichen Xiao, Yu Liang, Yimeng Shan, Qian Sun, Enqi Zhang, and Yang Yang. Spiking vision transformer with saccadic attention. InInternational Conference on Learning Representations, 2025. 7, 8
work page 2025
-
[38]
Ziqing Wang, Yuetong Fang, Jiahang Cao, Qiang Zhang, Zhongrui Wang, and Renjing Xu. Masked spiking transformer. InProceedings of the IEEE/CVF International Conference on Computer Vision, 2023. 2, 3
work page 2023
-
[39]
Ftbc: Forward temporal bias correc- tion for optimizing ann-snn conversion
Xiaofeng Wu, Velibor Bojkovic, Bin Gu, Kun Suo, and Kai Zou. Ftbc: Forward temporal bias correc- tion for optimizing ann-snn conversion. InEuropean Conference on Computer Vision, 2024. 2
work page 2024
-
[40]
Yujie Wu, Lei Deng, Guoqi Li, Jun Zhu, and Lup- ing Shi. Spatio-temporal backpropagation for training high-performance spiking neural networks.Frontiers in Neuroscience, 12:331, 2018. 1, 2
work page 2018
-
[41]
Online training through time for spiking neural networks
Mingqing Xiao, Qingyan Meng, Zongpeng Zhang, Di He, and Zhouchen Lin. Online training through time for spiking neural networks. InAdvances in Neu- ral Information Processing Systems, 2022. 1, 3
work page 2022
-
[42]
Yichen Xiao, Shuai Wang, Dehao Zhang, Wenjie Wei, Yimeng Shan, Xiaoli Liu, Yulin Jiang, and Malu Zhang. Rethinking spiking self-attention mechanism: implementing a-xnor similarity calculation in spiking transformers. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition,
-
[43]
Qi Xu, Yaxin Li, Jiangrong Shen, Jian K Liu, Huajin Tang, and Gang Pan. Constructing deep spiking neural networks from artificial neural networks with knowl- edge distillation. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recog- nition, 2023. 1, 3
work page 2023
-
[44]
Man Yao, Jiakui Hu, Zhaokun Zhou, Li Yuan, Yonghong Tian, Bo Xu, and Guoqi Li. Spike-driven transformer. InAdvances in Neural Information Pro- cessing Systems, 2023. 2, 3, 7, 8
work page 2023
-
[45]
Man Yao, Xuerui Qiu, Tianxiang Hu, Jiakui Hu, Yuhong Chou, Keyu Tian, Jianxing Liao, Luziwei Leng, Bo Xu, and Guoqi Li. Scaling spike-driven transformer with efficient spike firing approximation training.IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(4):2973–2990, 2025. 3
work page 2025
-
[46]
Man Yao, Guangshe Zhao, Hengyu Zhang, Yifan Hu, Lei Deng, Yonghong Tian, Bo Xu, and Guoqi Li. Attention spiking neural networks.IEEE Transac- tions on Pattern Analysis and Machine Intelligence, 45(8):9393–9410, 2023. 7, 8
work page 2023
-
[47]
GLIF: A unified gated leaky integrate-and-fire neuron for spiking neural networks
Xingting Yao, Fanrong Li, Zitao Mo, and Jian Cheng. GLIF: A unified gated leaky integrate-and-fire neuron for spiking neural networks. InAdvances in Neural Information Processing Systems, 2022. 3
work page 2022
-
[48]
Going deeper with directly-trained larger spiking neural networks
Hanle Zheng, Yujie Wu, Lei Deng, Yifan Hu, and Guoqi Li. Going deeper with directly-trained larger spiking neural networks. InAAAI Conference on Arti- ficial Intelligence, 2021. 7, 8
work page 2021
-
[49]
Spikingformer: A key foundation model for spiking neural networks
Chenlin Zhou, Liutao Yu, Zhaokun Zhou, Han Zhang, Jiaqi Wang, Huihui Zhou, Zhengyu Ma, and Yonghong Tian. Spikingformer: A key foundation model for spiking neural networks. InAAAI Confer- ence on Artificial Intelligence, 2026. 7, 8, 9
work page 2026
-
[50]
Spikformer: When spiking neural network meets transformer
Zhaokun Zhou, Yuesheng Zhu, Chao He, Yaowei Wang, Shuicheng Yan, Yonghong Tian, and Yuan Li. Spikformer: When spiking neural network meets transformer. InInternational Conference on Learning Representations, 2023. 2, 3, 7, 8, 9 11
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.