Memory Wall is not gone: A Critical Outlook on Memory Architecture in Digital Neuromorphic Computing
Pith reviewed 2026-05-10 16:59 UTC · model grok-4.3
The pith
Digital neuromorphic processors create a new memory wall as on-chip SRAM and STT-MRAM dominate area and energy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Analysis of energy and area efficiency across selected digital neuromorphic processors shows that on-chip memory, including SRAM and STT-MRAM, has become the dominant consumer, forming a new memory wall that may prevent these processors from competing in edge applications unless memory organization is re-evaluated.
What carries the argument
Critical comparison of area and energy costs of on-chip memory technologies within distributed digital neuromorphic architectures.
If this is right
- Digital neuromorphic processors will struggle to compete in edge and embedded applications.
- Re-evaluation of memory organization is required before these systems can scale effectively.
- Emerging memories such as STT-MRAM still contribute to the dominance of on-chip costs rather than solving them.
- Future designs must explore new memory organizations to remove the fresh bottleneck.
Where Pith is reading between the lines
- The same memory overhead pattern may appear in other non-von Neumann hardware beyond the digital neuromorphic class examined here.
- Adoption rates for neuromorphic chips in IoT and mobile devices could remain low until memory integration improves.
- Designers could test whether hybrid on-chip and off-chip memory splits change overall system efficiency in practice.
Load-bearing premise
The energy and area patterns seen in the chosen processors and memory technologies will hold for digital neuromorphic designs in general and will block competitive edge performance without major redesign.
What would settle it
Publication of a digital neuromorphic chip for edge tasks where on-chip memory uses less than 30 percent of total area and less than 40 percent of energy while maintaining claimed performance would falsify the new memory wall as a general barrier.
Figures
read the original abstract
The rapid advancement of neuromorphic technology aims to address the memory wall challenge inherent in conventional von Neumann architectures. This paper critically examines current digital neuromorphic processors and their strategies to mitigate this bottleneck. While designed to bring computation closer to memory through distributed architectures, our findings indicate that on-chip memory systems, including SRAM and emerging technologies like STT-MRAM, have become significant consumers of area and energy, leading to a new memory wall. Through an analysis of energy and area efficiency in various memory technologies, we argue that without a re-evaluation of memory organization, digital neuromorphic processors may struggle to compete effectively in edge and embedded applications. We conclude with potential pathways for future research to overcome the limitations of on-chip memory in neuromorphic systems.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper critically examines digital neuromorphic processors and their distributed architectures intended to mitigate the von Neumann memory wall. It argues that on-chip memory systems (SRAM and emerging technologies such as STT-MRAM) have become dominant consumers of area and energy, thereby creating a new memory wall. Through an analysis of energy and area efficiency across selected memory technologies, the manuscript concludes that without re-evaluation of memory organization, these processors will struggle to compete in edge and embedded applications, and it outlines potential future research pathways.
Significance. If the quantitative analysis proves representative and the efficiency comparisons hold under broader sampling, the result would be significant for neuromorphic hardware design by identifying a persistent on-chip memory bottleneck that could limit scalability and energy efficiency at the edge. The work usefully redirects attention from computation-centric optimizations to memory organization as the load-bearing constraint.
major comments (2)
- [Abstract] Abstract: the central claim that on-chip memory systems have become significant consumers of area and energy (creating a new memory wall) rests on an analysis of energy and area efficiency, yet the abstract supplies no quantitative results, data tables, methods, or specific processor examples to substantiate the magnitude or generality of this dominance.
- [Abstract] Abstract: the generalization from the examined processors and memory technologies (SRAM, STT-MRAM) to the broader class of digital neuromorphic designs is asserted without evidence of representativeness, such as coverage statistics, systematic sampling across core counts/sparsity/technology nodes, or sensitivity checks. This step is load-bearing for the conclusion that the new memory wall will prevent effective competition in edge applications.
minor comments (1)
- [Abstract] The abstract would benefit from a single sentence naming the specific processors or technology nodes analyzed, to allow readers to gauge scope immediately.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We address each major comment below with point-by-point responses and indicate planned revisions to strengthen the presentation.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central claim that on-chip memory systems have become significant consumers of area and energy (creating a new memory wall) rests on an analysis of energy and area efficiency, yet the abstract supplies no quantitative results, data tables, methods, or specific processor examples to substantiate the magnitude or generality of this dominance.
Authors: We agree that the abstract, as a concise summary, omits specific quantitative results and examples. The full manuscript contains detailed energy and area breakdowns, including comparisons across SRAM and STT-MRAM in representative processors, supported by figures and tables. To better ground the central claim, we will revise the abstract to incorporate key quantitative highlights (e.g., observed memory dominance percentages) and reference specific processor examples while preserving its brevity. revision: yes
-
Referee: [Abstract] Abstract: the generalization from the examined processors and memory technologies (SRAM, STT-MRAM) to the broader class of digital neuromorphic designs is asserted without evidence of representativeness, such as coverage statistics, systematic sampling across core counts/sparsity/technology nodes, or sensitivity checks. This step is load-bearing for the conclusion that the new memory wall will prevent effective competition in edge applications.
Authors: The analysis draws on prominent, well-documented digital neuromorphic processors from the recent literature that span varying core counts, memory technologies, and technology nodes to illustrate the emerging trend. A exhaustive statistical sampling is not feasible given the limited number of fabricated designs in this emerging field. We will add an explicit discussion of selection criteria and scope limitations (e.g., in the introduction or a dedicated subsection) to justify the generalization and acknowledge boundaries, thereby addressing the concern without overclaiming universality. revision: partial
Circularity Check
No significant circularity; purely observational critique with no derivations or self-referential modeling
full rationale
The paper contains no equations, fitted parameters, or derivation chains. Its central claim rests on an observational analysis of selected existing processors and memory technologies (SRAM, STT-MRAM) to argue that on-chip memory now dominates area/energy. This is a direct critique of published hardware data rather than any self-definitional loop, fitted-input prediction, or load-bearing self-citation that reduces the conclusion to its own inputs by construction. The argument is therefore self-contained against external benchmarks and receives the default non-circularity finding.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Open the box of digital neuromorphic processor: Towards effective algorithm-hardware co-design,
G. Tang, A. Safa, K. Shidqi, P. Detterer, S. Traferro, M. Konijnenburg, M. Sifalakis, G.-J. van Schaik, and A. Yousefzadeh, “Open the box of digital neuromorphic processor: Towards effective algorithm-hardware co-design,” in2023 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, 2023, pp. 1–5
work page 2023
-
[2]
Hitting the memory wall: Implications of the obvious,
W. A. Wulf and S. A. McKee, “Hitting the memory wall: Implications of the obvious,”ACM SIGARCH computer architecture news, vol. 23, no. 1, pp. 20–24, 1995
work page 1995
-
[3]
Seneca: building a fully digital neuromorphic processor, design trade-offs and challenges,
G. Tang, K. Vadivel, Y . Xu, R. Bilgic, K. Shidqi, P. Detterer, S. Traferro, M. Konijnenburg, M. Sifalakis, G.-J. van Schaiket al., “Seneca: building a fully digital neuromorphic processor, design trade-offs and challenges,” Frontiers in Neuroscience, vol. 17, p. 1187252, 2023
work page 2023
-
[4]
Convolutional networks for fast, energy-efficient neuromorphic computing,
S. K. Esser, P. A. Merolla, J. V . Arthur, A. S. Cassidy, R. Appuswamy, A. Andreopoulos, D. J. Berg, J. L. McKinstry, T. Melano, D. R. Barch, C. di Nolfo, P. Datta, A. Amir, B. Taba, M. D. Flickner, and D. S. Modha, “Convolutional networks for fast, energy-efficient neuromorphic computing,”Proceedings of the National Academy of Sciences, vol. 113, no. 41,...
-
[5]
Nxtf: An api and compiler for deep spiking neural networks on intel loihi,
B. Rueckauer, C. Bybee, R. Goettsche, Y . Singh, J. Mishra, and A. Wild, “Nxtf: An api and compiler for deep spiking neural networks on intel loihi,”ACM Journal on Emerging Technologies in Computing Systems (JETC), vol. 18, no. 3, pp. 1–22, 2022
work page 2022
-
[6]
Arts: An adaptive regularization training schedule for activation sparsity exploration,
Z. Zhu, A. Pourtaherian, L. Waeijen, L. Bamberg, E. Bondarev, and O. Moreira, “Arts: An adaptive regularization training schedule for activation sparsity exploration,” in2022 25th Euromicro Conference on Digital System Design (DSD). IEEE, 2022, pp. 415–422
work page 2022
-
[7]
O. Richter, Y . Xing, M. De Marchi, C. Nielsen, M. Katsimpris, R. Catta- neo, Y . Ren, Y . Hu, Q. Liu, S. Sheiket al., “Speck: A smart event-based vision sensor with a low latency 327k neuron convolutional neuronal network processing pipeline,”arXiv preprint arXiv:2304.06793, 2023
-
[8]
C. Arjmand, Y . Xu, K. Shidqi, A. F. Dobrita, K. Vadivel, P. Detterer, M. Sifalakis, A. Yousefzadeh, and G. Tang, “Trip: Trainable region-of- interest prediction for hardware-efficient neuromorphic processing on event-based vision,” in2024 International Conference on Neuromorphic Systems (ICONS). IEEE, 2024, pp. 94–101
work page 2024
-
[9]
Sparse convolutional recurrent learning for efficient event-base neuromorphic object detection,
S. Wang andet al., “Sparse convolutional recurrent learning for efficient event-base neuromorphic object detection,” inIJCNN. IEEE, 2025
work page 2025
-
[10]
Y . Xu, K. Shidqi, G.-J. van Schaik, R. Bilgic, A. Dobrita, S. Wang, R. Meijer, P. Nembhani, C. Arjmand, P. Martinelloet al., “Optimizing event-based neural networks on digital neuromorphic architecture: a comprehensive design space exploration,”Frontiers in Neuroscience, vol. 18, p. 1335422, 2024
work page 2024
-
[11]
Non-uniform memory partitioning for low-power spiking neural networks,
S. Richter andet al., “Non-uniform memory partitioning for low-power spiking neural networks,” inAICAS. IEEE, 2025
work page 2025
-
[12]
S. H ¨oppner, Y . Yan, A. Dixius, S. Scholze, J. Partzsch, M. Stolba, F. Kelber, B. V ogginger, F. Neum ¨arker, G. Ellguthet al., “The spin- naker 2 processing element architecture for hybrid digital neuromorphic computing,”arXiv preprint arXiv:2103.08392, 2021
-
[13]
S. B. Furber, F. Galluppi, S. Temple, and L. A. Plana, “The spinnaker project,”Proceedings of the IEEE, vol. 102, no. 5, pp. 652–665, 2014
work page 2014
-
[14]
A. Yousefzadeh, M. Soto, T. Serrano-Gotarredona, F. Galluppi, L. Plana, S. Furber, and B. Linares-Barranco, “Performance comparison of time- step-driven versus event-driven neural state update approaches in spin- naker,” in2018 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, 2018, pp. 1–4
work page 2018
-
[15]
Sensim: An event- driven parallel simulator for multi-core neuromorphic systems,
P. Nembhani, K. Vadivel, G. Tang, M. Tahghighi, G.-J. van Schaik, M. Sifalakis, Z. Al-Ars, and A. Yousefzadeh, “Sensim: An event- driven parallel simulator for multi-core neuromorphic systems,” in2024 International Joint Conference on Neural Networks (IJCNN). IEEE, 2024, pp. 1–6
work page 2024
-
[16]
Tutorial on memristor-based computing for smart edge applications,
A. Gebregiorgis, A. Singh, A. Yousefzadeh, D. Wouters, R. Bishnoi, F. Catthoor, and S. Hamdioui, “Tutorial on memristor-based computing for smart edge applications,”Memories-Materials, Devices, Circuits and Systems, vol. 4, p. 100025, 2023
work page 2023
-
[17]
Crossbar-constrained technology mapping for reram based in-memory computing,
D. Bhattacharjee, Y . Tavva, A. Easwaran, and A. Chattopadhyay, “Crossbar-constrained technology mapping for reram based in-memory computing,”IEEE Transactions on Computers, vol. 69, no. 5, pp. 734– 748, 2020
work page 2020
-
[18]
R. Khaddam-Aljameh, M. Stanisavljevic, J. F. Mas, G. Karunaratne, M. Br ¨andli, F. Liu, A. Singh, S. M. M ¨uller, U. Egger, A. Petropoulos et al., “Hermes-core—a 1.59-tops/mm 2 pcm on 14-nm cmos in-memory compute core using 300-ps/lsb linearized cco-based adcs,”IEEE Journal of Solid-State Circuits, vol. 57, no. 4, pp. 1027–1038, 2022
work page 2022
-
[19]
A critical assessment of dram-pim architectures-trends, challenges and solutions,
C. Sudarshan, M. H. Sadi, L. Steiner, C. Weis, and N. Wehn, “A critical assessment of dram-pim architectures-trends, challenges and solutions,” inInternational Conference on Embedded Computer Systems. Springer, 2022, pp. 362–379
work page 2022
-
[20]
Multidie 3-d stacking of memory dominated neuromorphic architec- tures,
L. M. G. Rocha, R. Bilgic, M. Naeim, S. Das, H. Oprins, A. Youse- fzadeh, M. Konijnenburg, D. Milojevic, J. Myers, J. Ryckaertet al., “Multidie 3-d stacking of memory dominated neuromorphic architec- tures,”IEEE Transactions on V ery Large Scale Integration (VLSI) Systems, 2024
work page 2024
-
[21]
A hybrid memristor/cmos snn for implementing one-shot winner-takes-all training,
J. Ahmadi-Farsani, S. Ricci, S. Hashemkhani, D. Ielmini, B. Linares- Barranco, and T. Serrano-Gotarredona, “A hybrid memristor/cmos snn for implementing one-shot winner-takes-all training,” in2022 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, 2022, pp. 210–214
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.