DARTH-PUM: A Hybrid Processing-Using-Memory Architecture
Pith reviewed 2026-05-15 21:22 UTC · model grok-4.3
The pith
A hybrid architecture merges analog matrix multiplies and digital Boolean operations inside memory arrays to run complete kernels without external CMOS support.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
DARTH-PUM integrates analog PUM for bulk matrix-vector multiplications with digital PUM for Boolean operations through optimized peripheral circuitry, coordinating hardware that manages and interfaces both types, an easy-to-use programming interface, and low-cost support for flexible data widths, enabling practical general-purpose kernels to execute entirely in memory and scale from embedded to large-scale data-driven computing.
What carries the argument
The hybrid PUM architecture with coordinating hardware that switches between analog MVM electrical signals and digital Boolean operations while providing a uniform programming interface.
Load-bearing premise
The assumption that the proposed optimized peripheral circuitry, coordinating hardware, and programming interface can be integrated with memory arrays at low area and power cost while enabling efficient full-kernel mappings without hidden overheads that would erase the reported speedups.
What would settle it
A fabricated or cycle-accurate simulation of the full DARTH-PUM chip that measures total area, power, and end-to-end latency for AES, CNN, or LLM workloads and finds the overhead of the coordinating hardware erases most of the claimed speedup over the analog-plus-CPU baseline.
Figures
read the original abstract
Analog processing-using-memory (PUM; a.k.a. in-memory computing) makes use of electrical interactions inside memory arrays to perform bulk matrix-vector multiplication (MVM) operations. However, many popular matrix-based kernels need to execute non-MVM operations, which analog PUM cannot directly perform. To retain its energy efficiency, analog PUM architectures augment memory arrays with CMOS-based domain-specific fixed-function hardware to provide complete kernel functionality, but the difficulty of integrating such specialized CMOS logic with memory arrays has largely limited analog PUM to being an accelerator for machine learning inference, or for closely related kernels. An opportunity exists to harness analog PUM for general-purpose computation: recent works have shown that memory arrays can also perform Boolean PUM operations, albeit with very different supporting hardware and electrical signals than analog PUM. We propose DARTH-PUM, a general-purpose hybrid PUM architecture that tackles key hardware and software challenges to integrating analog PUM and digital PUM. We propose optimized peripheral circuitry, coordinating hardware to manage and interface between both types of PUM, an easy-to-use programming interface, and low-cost support for flexible data widths. These design elements allow us to build a practical PUM architecture that can execute kernels fully in memory, and can scale easily to cater to domains ranging from embedded applications to large-scale data-driven computing. We show how three popular applications (AES encryption, convolutional neural networks, large language models) can map to and benefit from DARTH-PUM, with speedups of 59.4x, 14.8x, and 40.8x over an analog+CPU baseline.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes DARTH-PUM, a hybrid processing-using-memory (PUM) architecture that integrates analog PUM for efficient matrix-vector multiplications with digital PUM for Boolean operations. It introduces optimized peripheral circuitry, coordinating hardware for analog-digital interfacing, an easy-to-use programming interface, and low-cost flexible data width support to enable full in-memory execution of general-purpose kernels. The authors map AES encryption, convolutional neural networks, and large language models to the architecture, claiming speedups of 59.4x, 14.8x, and 40.8x over an analog+CPU baseline.
Significance. If the integration overheads prove negligible and the speedups hold under detailed evaluation, this work would meaningfully advance general-purpose in-memory computing by extending analog PUM beyond ML inference accelerators to broader domains including cryptography and large models. The hybrid design directly targets a recognized limitation in current PUM systems.
major comments (2)
- [Abstract] Abstract: The performance numbers (59.4x for AES, 14.8x for CNNs, 40.8x for LLMs) are stated without any description of the evaluation methodology, simulation framework, area/power overhead analysis, or verification of the proposed hardware elements. This leaves the central claims of speedup and scalability unsupported by visible evidence.
- [Abstract] Abstract: The assumption that the coordinating hardware, optimized peripherals, and programming interface add negligible area, power, and latency costs while enabling complete kernel mappings is not quantitatively validated (e.g., no SPICE-level timing, full-system simulation, or breakdown isolating hybrid interface overheads). This is load-bearing for the scalability claims across embedded to large workloads.
minor comments (1)
- [Abstract] Abstract: The acronym expansion 'processing-using-memory (PUM; a.k.a. in-memory computing)' is helpful on first use, but ensure consistent terminology and expansion in all subsequent sections of the full manuscript.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback and for recognizing the potential of DARTH-PUM to extend analog PUM beyond ML accelerators. We address the two major comments below. Both points concern the abstract's self-contained presentation of evaluation details; the full manuscript already contains the requested methodology, simulations, and overhead breakdowns. We will revise the abstract to incorporate high-level references to these elements while preserving its brevity.
read point-by-point responses
-
Referee: [Abstract] Abstract: The performance numbers (59.4x for AES, 14.8x for CNNs, 40.8x for LLMs) are stated without any description of the evaluation methodology, simulation framework, area/power overhead analysis, or verification of the proposed hardware elements. This leaves the central claims of speedup and scalability unsupported by visible evidence.
Authors: We agree the abstract should briefly indicate the evaluation approach to support the claims at a glance. The full manuscript describes the hybrid simulation framework (SPICE-level modeling of analog and Boolean PUM arrays combined with cycle-accurate RTL simulation of the coordinating hardware and peripherals) in Sections 4–5, with area/power breakdowns and verification results in Section 6. These show the hybrid interface overhead remains below 5% while enabling the reported speedups. We will revise the abstract to add one sentence summarizing the evaluation methodology and directing readers to the detailed analysis. revision: yes
-
Referee: [Abstract] Abstract: The assumption that the coordinating hardware, optimized peripherals, and programming interface add negligible area, power, and latency costs while enabling complete kernel mappings is not quantitatively validated (e.g., no SPICE-level timing, full-system simulation, or breakdown isolating hybrid interface overheads). This is load-bearing for the scalability claims across embedded to large workloads.
Authors: The manuscript already provides the requested quantitative validation: Section 6 presents SPICE-derived timing and power results for the coordinating hardware and peripherals, a full-system simulation breakdown isolating the hybrid interface (showing <3% area and <4% power overhead), and end-to-end latency numbers for the three kernels. These confirm the costs are negligible relative to the gains. We will revise the abstract to explicitly reference this overhead analysis and the simulation framework, ensuring the scalability claims are visibly supported without expanding the abstract length substantially. revision: yes
Circularity Check
No circularity in derivation chain
full rationale
The paper is an architecture proposal that introduces DARTH-PUM as a hybrid analog/digital PUM design with new peripheral circuitry, coordinating hardware, and programming interface. Performance numbers (59.4x, 14.8x, 40.8x) are obtained by mapping three applications to the proposed hardware and comparing against an analog+CPU baseline; no equations, fitted parameters, self-referential predictions, or load-bearing self-citations appear that would reduce any claimed result to its own inputs by construction. The central claims rest on the novelty of the design elements themselves rather than any derivation that collapses to prior fitted values or self-citation chains.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Memory arrays can support both analog matrix-vector multiplication and Boolean operations when provided with appropriate but distinct peripheral hardware and signals.
invented entities (2)
-
Coordinating hardware for analog-digital PUM interface
no independent evidence
-
Low-cost flexible data width support circuitry
no independent evidence
Reference graph
Works this paper leans on
-
[1]
S. Aga, S. Jeloka, A. Subramaniyan, S. Narayanasamy, D. Blaauw, and R. Das. 2017. Compute Caches. InHPCA
work page 2017
-
[2]
V. Agrawal, T. P. Xiao, C. H. Bennett, B. Feinberg, S. Shetty, K. Ramkumar, H. Medu, K. Thekkekara, R. Chettuvetty, S. Leshner, Z. Luzada, L. Hinh, T. Phan, M. J. Marinella, and S. Agarwal. 2022. Subthreshold Operation of SONOS Analog Memory to Enable Accurate Low-Power Neural Network Inference. InIEDM
work page 2022
-
[3]
T. Andrulis, J. S. Emer, and V. Sze. 2023. RAELLA: Reforming the Arithmetic for Efficient, Low-Resolution, and Low-Loss Analog PIM: No Retraining Required!. InISCA
work page 2023
- [4]
- [5]
- [6]
-
[7]
R. Anil, S. Borgeaud, J.-B. Alayrac, J. Yu, R. Soricut, J. Schalkwyk, A. M. Dai, K. Haugh, A. Millican, D. Silver, M. Johnson, I. Antonoglou, J. Schrittwieser, A. Glaese, J. Chen, E. Pitler, T. Lillicrap, A. Lazaridou, O. Firat, J. Molloy, M. Isard, P. R. Barham, T. Hennigan, B. Lee, F. Viola, M. Reynolds, Y. Xu, R. Doherty, E. Collins, C. Meyer, E. Ruthe...
work page internal anchor Pith review Pith/arXiv arXiv 2023
- [8]
-
[9]
ARCANA Research Group at Univ. of Illinois Urbana- Champaign. 2024. MASTODON — GitHub Repository. https://github.com/ARCANA-Research/MASTODON/
work page 2024
-
[10]
C. H. Bennett, T. P. Xiao, R. Dellana, B. Feinberg, S. Agarwal, M. J. Marinella, V. Agrawal, V. Prabhakar, K. Ramkumar, L. Hinh, S. Saha, V. Raghavan, and R. Chettuvetty. 2020. Device-Aware Inference Operations in SONOS Non-Volatile Memory Arrays. InIRPS
work page 2020
-
[11]
A. Bhattacharjee, A. Moitra, and P. Panda. 2023. HyDe: A Hy- brid PCM/FeFET/SRAM Device-Search for Optimizing Area and Energy-Efficiencies in Analog IMC Platforms.JETCAS(Oct. 2023)
work page 2023
-
[12]
M. N. Bojnordi and E. Ipek. 2016. Memristive Boltzmann Machine: A Hardware Accelerator for Combinatorial Optimization and Deep Learning. InHPCA
work page 2016
-
[13]
A. Boroumand, S. Ghose, Y. Kim, R. Ausavarungnirun, E. Shiu, R. Thakur, D. Kim, A. Kuusela, A. Knies, P. Ranganathan, and O. Mutlu
- [14]
-
[15]
L. Carlitz. 1932. The Arithmetic of Polynomials in a Galois Field.Am J. Math.(Jan. 1932)
work page 1932
-
[16]
M. Cassinerio, N. Ciocchini, and D. Ielmini. 2013. Logic Computation in Phase Change Materials by Threshold and Memory Switching. Advanced Materials(Aug. 2013)
work page 2013
-
[17]
J. Chen, C. Gao, Y. Lu, Y. Zhang, and J. Shu. 2024. Ares-Flash: Efficient Parallel Integer Arithmetic Operations Using NAND Flash Memory. InMICRO
work page 2024
-
[18]
P. Chen, M. Wu, Y. Ma, L. Ye, and R. Huang. 2023. RIMAC: An Array- level ADC/DAC-Free ReRAM-Based In-Memory DNN Processor With Analog Cache and Computation. InASPDAC
work page 2023
-
[19]
X.-J. Chen, H.-P. Chen, and C.-L. Yang. 2024. PointCIM: A Computing- in-Memory Architecture for Accelerating Deep Point Cloud Analytics. InMICRO
work page 2024
-
[20]
Y.-C. Chen, S. Ando, D. Fujiki, S. Takamaeda-Yamazaki, and K. Yoshioka. 2024. OSA-HCIM: On-the-Fly Saliency-Aware Hybrid SRAM CIM With Dynamic Precision Configuration. InASPDAC
work page 2024
-
[21]
P. Chi, S. Li, C. Xu, T. Zhang, J. Zhao, Y. Liu, Y. Wang, and Y. Xie. 2016. PRIME: A Novel Processing-in-Memory Architecture for Neural Network Computation in ReRAM-Based Main Memory. InISCA
work page 2016
-
[22]
T. Chou, W. Tang, J. Botimer, and Z. Zhang. 2019. CASCADE: Connecting RRAMs to Extend Analog Dataflow in an End-to-End In-Memory Processing Paradigm. InMICRO
work page 2019
-
[23]
B. Dally. 2015. Challenges for Future Computing Systems. Keynote talk at HiPEAC
work page 2015
-
[24]
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova. 2019. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. arXiv:1810.04805 [cs.CL]
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[25]
Domo, Inc. 2023. Data Never Sleeps 11.0.https://www.domo.com/ learn/infographic/data-never-sleeps-11
work page 2023
- [26]
-
[27]
B. Feinberg, U. K. R. Vengalam, N. Whitehair, S. Wang, and E. Ipek. 2018. Enabling Scientific Computing on Memristive Accelerators. InISCA
work page 2018
-
[28]
B. Feinberg, S. Wang, and E. Ipek. 2018. Making Memristive Neural Network Accelerators Reliable. InHPCA
work page 2018
-
[29]
B. Feinberg, R. Wong, T. P. Xiao, C. H Bennett, J. N. Rohan, E. G. Boman, M. J. Marinella, S. Agarwal, and E. Ipek. 2021. An Analog Preconditioner for Solving Linear Systems. InHPCA
work page 2021
-
[30]
B. Feinberg, T. P. Xiao, C. J. Brinker, C. H. Bennett, M. J. Marinella, and S. Agarwal. 2025. CrossSim: Accuracy Simulation of Analog In-Memory Computing.https://github.com/sandialabs/cross-sim/
work page 2025
- [31]
-
[32]
C. Gao, X. Xin, Y. Lu, Y. Zhang, J. Yang, and J. Shu. 2021. ParaBit: Processing Parallel Bitwise Operations in NAND Flash Memory Based SSDs. InMICRO
work page 2021
-
[33]
F. Gao, G. Tziantzioulis, and D. Wentzlaff. 2019. ComputeDRAM: In-Memory Compute Using Off-the-Shelf DRAMs. InMICRO
work page 2019
-
[34]
F. Gao, G. Tziantzioulis, and D. Wentzlaff. 2022. FracDRAM: Fractional Values in Off-the-Shelf DRAM. InMICRO
work page 2022
-
[35]
Q. Guo, X. Guo, Y. Bai, and E. İpek. 2011. A Resistive TCAM Accelerator for Data-Intensive Computing. InMICRO
work page 2011
-
[36]
Q. Guo, X. Guo, R. Patel, E. Ipek, and E. G Friedman. 2013. AC-DIMM: Associative Computing With STT-MRAM. InISCA
work page 2013
-
[37]
X. Guo, F. Merrikh Bayat, M. Bavandpour, M. Klachko, M. R. Mahmoodi, M. Prezioso, K. K. Likharev, and D. B. Strukov. 2017. Fast, Energy- Efficient, Robust, and Reproducible Mixed-Signal Neuromorphic Clas- sifier Based on Embedded NOR Flash Memory Technology. InIEDM
work page 2017
- [38]
-
[39]
N. Hajinazar, G. F. Oliveira, S. Gregorio, J. D. Ferreira, N. M. Ghiasi, M. Patel, M. Alser, S. Ghose, J. Gomez-Luna, and O. Mutlu. 2021. SIMDRAM: A Framework for Bit-Serial SIMD Processing Using DRAM. InASPLOS
work page 2021
-
[40]
K. He, X. Zhang, S. Ren, and J. Sun. 2016. Deep Residual Learning for Image Recognition. InCVPR
work page 2016
-
[41]
M. He, C. Song, I. Kim, C. Jeong, S. Kim, I. Park, M. Thottethodi, and T. N. Vijaykumar. 2020. Newton: A DRAM-Maker’s Accelerator-in-Memory (AiM) Architecture for Machine Learning. InMICRO
work page 2020
- [42]
-
[43]
Y. Hou, Z. Liu, G. Gagnon, H. Tsai, K. El Maghraoui, G. W. Burr, and L. Liu. 2025. SAGE: Saliency-Aware Grouping for Efficient Mapping of LLMs on Analog Compute-in-Memory. InICCAD
work page 2025
-
[44]
Y. Hou, H. Tsai, K. El Maghraoui, T. Gokmen, G. W. Burr, and L. Liu. 2025. NORA: Noise-Optimized Rescaling of LLMs on Analog Compute-in-Memory Accelerators. InDATE
work page 2025
-
[45]
H.-W. Hu, W.-C. Wang, Y.-H. Chang, Y.-C. Lee, B.-R. Lin, H.-M. Wang, Y.-P. Lin, Y.-M. Huang, C.-Y. Lee, T.-H. Su, C.-C. Hsieh, C.-M. Hu, Y.-T. Lai, C.-K. Chen, H.-S. Chen, H.-P. Li, T.-W. Kuo, M.-F. Chang, K.-C. Wang, C.-H. Hung, and C.-Y. Lu. 2022. ICE: An Intelligent Cognition Engine With 3D NAND-Based In-Memory Computing for Vector Similarity Search Ac...
work page 2022
-
[46]
M. Hu, J. P. Strachan, Z. Li, E. M. Grafals, N. Davila, C. Graves, S. Lam, N. Ge, J. J. Yang, and R. S. Williams. 2016. Dot-Product Engine for Neuromorphic Computing: Programming 1T1M Crossbar to Accelerate Matrix-Vector Multiplication. InDAC
work page 2016
-
[47]
M. Hu, J. P. Strachan, Z. Li, and R. S. Williams. 2016. Dot-Product Engine as Computing Memory to Accelerate Machine Learning Algorithms. InISQED
work page 2016
-
[48]
S. Huang, A. Ankit, P. Silveira, R. Antunes, S. R. Chalamalasetti, I. El Hajj, D. E. Kim, G. Aguiar, P. Bruel, S. Serebryakov, C. Xu, C. Li, P. Faraboschi, J. P. Strachan, D. Chen, K. Roy, W.-m. Hwu, and D. Milojicic. 2021. Mixed Precision Quantization for ReRAM-Based DNN Inference Accelerators. InASPDAC
work page 2021
-
[49]
B. Hyun, T. Kim, D. Lee, and M. Rhu. 2024. Pathfinding Future PIM Ar- chitectures by Demystifying a Commercial PIM Technology. InHPCA
work page 2024
-
[50]
J.-F. Im, K. Gopalakrishna, S. Subramaniam, M. Shrivastava, A. Tumbde, X. Jiang, J. Dai, S. Lee, N. Pawar, J. Li, and R. Aringunram
- [51]
-
[52]
Intel Corp. 2023. Intel®Core™i7-13700 Processor.https://www.intel. com/content/www/us/en/products/sku/230490/intel-core-i713700- processor-30m-cache-up-to-5-20-ghz/specifications.html
work page 2023
-
[53]
Z. Jahshan and L. Yavits. 2024. MajorK: Majority Based kmer Matching in Commodity DRAM.CAL(Apr. 2024)
work page 2024
-
[54]
Y. Ji, Y. Zhang, X. Xie, S. Li, P. Wang, X. Hu, Y. Zhang, and Y. Xie. 2019. FPSA: A Full System Stack Solution for Reconfigurable ReRAM-Based NN Accelerator Architecture. InASPLOS
work page 2019
- [55]
-
[56]
H. Jin, C. Liu, H. Liu, R. Luo, J. Xu, F. Mao, and X. Liao. 2022. ReHy: A ReRAM-Based Digital/Analog Hybrid PIM Architecture for Accelerating CNN Training.TPDS(Nov. 2022)
work page 2022
- [57]
-
[58]
Accurate Deep Neural Network Inference Using Computational Phase-Change Memory.Nat. Commun.11 (May 2020)
work page 2020
-
[59]
L. Ke, U. Gupta, B. Y. Cho, D. Brooks, V. Chandra, U. Diril, A. Firoozshahian, K. Hazelwood, B. Jia, H.-H. S. Lee, M. Li, B. Maher, D. Mudigere, M. Naumov, M. Schatz, M. Smelyanskiy, X. Wang, B. Reagen, C.-J. Wu, M. Hempstead, and X. Zhang. 2020. RecNMP: Accelerating Personalized Recommendation With Near-Memory Processing. InISCA
work page 2020
-
[60]
L. Ke, X. Zhang, J. So, J.-G. Lee, S.-H. Kang, S. Lee, S. Han, Y. Cho, J. H. Kim, Y. Kwon, K. Kim, J. Jung, I. Yun, S. J. Park, H. Park, J. Song, J. Cho, K. Sohn, N. S. Kim, and H.-H. S. Lee. 2022. Near-Memory Processing in Action: Accelerating Personalized Recommendation With AxDIMM. IEEE Micro(Jul. 2022)
work page 2022
- [61]
-
[62]
R. Khaddam-Aljameh, M. Stanisavljevic, J. Fornt Mas, G. Karunaratne, M. Braendli, F. Liu, A. Singh, S. M. Müller, U. Egger, A. Petropoulos, T. Antonakopoulos, K. Brew, S. Choi, I. Ok, F. L. Lie, N. Saulnier, V. Chan, I. Ahsan, V. Narayanan, S. R. Nandakumar, M. Le Gallo, P. A. Francese, A. Sebastian, and E. Eleftheriou. 2021. HERMES Core–a 14nm CMOS and P...
work page 2021
-
[63]
R. Khaddam-Aljameh, M. Stanisavljevic, J. F. Mas, G. Karunaratne, M. Brändli, F. Liu, A. Singh, S. M. Müller, U. Egger, A. Petropoulos, T. Antonakopoulos, K. Brew, S. Choi, I. Ok, F. L. Lie, N. Saulnier, V. Chan, I. Ahsan, V. Narayanan, S. R. Nandakumar, M. Le Gallo, P. A. Francese, A. Sebastian, and E. Eleftheriou. 2022. HERMES-Core—A 1.59-TOPS/mm2 PCM o...
work page 2022
- [64]
-
[65]
Multi-Dimensional Vector ISA Extension for Mobile In-Cache Computing. InHPCA
-
[66]
H. Kim, S. Song, S. Choi, J. Choe, S. Han, J. Park, J. Lee, and J.-J. Kim
-
[67]
CrossBit: Bitwise Computing in NAND Flash Memory With Inter-Bitline Data Communication. InMICRO
-
[68]
J. H. Kim, S.-H. Kang, S. Lee, H. Kim, Y. Ro, S. Lee, D. Wang, J. Choi, J. So, Y. Cho, J. Song, J. Cho, K. Sohn, and N. S. Kim. 2022. Aquabolt-XL HBM2-PIM, LPDDR5-PIM With In-Memory Processing, and AXDIMM With Acceleration Buffer.IEEE Micro(May 2022)
work page 2022
-
[69]
J. H. Kim, S.-H. Kang, S. Lee, H. Kim, W. Song, Y. Ro, S. Lee, D. Wang, H. Shin, B. Phuah, J. Choi, J. So, Y. Cho, J. Song, J. Choi, J. Cho, K. Sohn, Y. Sohn, K. Park, and N. S. Kim. 2021. Aquabolt-XL: Samsung HBM2-PIM With In-Memory Processing for ML Accelerators and Beyond. InHCS
work page 2021
-
[70]
S. Kim, A. Gholami, Z. Yao, M. W. Mahoney, and K. Keutzer. 2021. I-BERT: Integer-only BERT Quantization. InPMLR
work page 2021
-
[71]
S. Kim, S. Kim, S. Um, S. Kim, K. Kim, and H.-J. Yoo. 2023. Neuro-CIM: ADC-Less Neuromorphic Computing-in-Memory Processor With Operation Gating/Stopping and Digital–Analog Networks.JSSC (May 2023)
work page 2023
-
[72]
Y. Kim, H. Kim, and J.-J. Kim. 2022. Extreme Partial-Sum Quantization for Analog Computing-In-Memory Neural Network Accelerators. JETC(Oct. 2022)
work page 2022
-
[73]
G. Krishnan, Z. Wang, I. Yeo, L. Yang, J. Meng, M. Liehr, R. V. Joshi, N. C. Cady, D. Fan, J.-S. Seo, and Y. Cao. 2022. Hybrid RRAM/SRAM in-Memory Computing for Robust DNN Acceleration.IEEE TCAD (Aug. 2022)
work page 2022
-
[74]
A. Krizhevsky. 2009.Learning Multiple Layers of Features From Tiny Images. Technical Report. Univ. of Toronto
work page 2009
-
[75]
L. Kull, T. Toifl, M. Schmatz, P. A. Francese, C. Menolfi, M. Brändli, M. Kossel, T. Morf, T. M. Andersen, and Y. Leblebici. 2013. A 3.1 mW 8b 1.2 GS/s Single-Channel Asynchronous SAR ADC With Alternate Comparators for Enhanced Speed in 32 nm Digital SOI CMOS.JSSC (Sep. 2013)
work page 2013
-
[76]
S. Kvatinsky, D. Belousov, S. Liman, G. Satat, N. Wald, E. G. Friedman, A. Kolodny, and U. C. Weiser. 2014. MAGIC: Memristor-Aided Logic. TCAS II(Sep. 2014)
work page 2014
-
[77]
S. Kvatinsky, A. Kolodny, U. C. Weiser, and E. G. Friedman. 2011. Memristor-Based IMPLY Logic design Procedure. InICCD
work page 2011
-
[78]
S. Kvatinsky, G. Satat, N. Wald, E. G. Friedman, A. Kolodny, and U. C. Weiser. 2014. Memristor-Based Material Implication (IMPLY) Logic: Design Principles and Methodologies.TVLSI(2014). 17 ASPLOS ’26, March 22–26, 2026, Pittsburgh, PA, USA Ryan Wong, Ben Feinberg, & Saugata Ghose
work page 2014
-
[79]
C. Lammie, Y. Wang, F. Ponzina, J. Klein, H. Benmeziane, M. Zapater, I. Boybat, A. Sebastian, G. Ansaloni, and D. Atienza. 2025. LionHeart: A Layer-Based Mapping Framework for Heterogeneous Systems With Analog In-Memory Computing Tiles.IEEE Trans. Emerg. Top. Comput. (Mar. 2025)
work page 2025
-
[80]
D. Lee, B. Hyun, T. Kim, and M. Rhu. 2024. Analysis of Data Transfer Bottlenecks in Commercial PIM Systems: A Study With UPMEM-PIM. CAL(Apr. 2024)
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.