23.8 an 88.36tops/w bit-level-weight-compressed large-language- model accelerator with cluster-aligned int-fp-gemm and bi-dimensional workflow reformulation,

· 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

CIMple: Standard-cell SRAM-based CIM with LUT-based split softmax for attention acceleration

cs.AR · 2026-04-17 · unverdicted · novelty 5.0

CIMple delivers a 32 kb digital SRAM-based compute-in-memory accelerator for transformer self-attention that reaches 26.1 TOPS/W at 0.85 V in 28 nm with INT8 precision using dual-banked architecture and LUT-based split softmax.

citing papers explorer

Showing 1 of 1 citing paper.

CIMple: Standard-cell SRAM-based CIM with LUT-based split softmax for attention acceleration cs.AR · 2026-04-17 · unverdicted · none · ref 26
CIMple delivers a 32 kb digital SRAM-based compute-in-memory accelerator for transformer self-attention that reaches 26.1 TOPS/W at 0.85 V in 28 nm with INT8 precision using dual-banked architecture and LUT-based split softmax.

23.8 an 88.36tops/w bit-level-weight-compressed large-language- model accelerator with cluster-aligned int-fp-gemm and bi-dimensional workflow reformulation,

fields

years

verdicts

representative citing papers

citing papers explorer