A 28-nm 64-kb 31.6-tflops/w digital-domain floating-point-computing-unit and double- bit 6t-sram computing-in-memory macro for floating-point cnns,

· 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Balancing FP8 Computation Accuracy and Efficiency on Digital CIM via Shift-Aware On-the-fly Aligned-Mantissa Bitwidth Prediction

cs.AR · 2026-02-05 · unverdicted · novelty 5.0

A 28nm digital CIM accelerator for FP8 uses on-the-fly shift-aware bitwidth prediction, FIFO alignment, and scalable MACs to reach 20.4 TFLOPS/W and 2.8x better efficiency than prior work while supporting variable mantissa widths.

citing papers explorer

Showing 1 of 1 citing paper.

Balancing FP8 Computation Accuracy and Efficiency on Digital CIM via Shift-Aware On-the-fly Aligned-Mantissa Bitwidth Prediction cs.AR · 2026-02-05 · unverdicted · none · ref 8
A 28nm digital CIM accelerator for FP8 uses on-the-fly shift-aware bitwidth prediction, FIFO alignment, and scalable MACs to reach 20.4 TFLOPS/W and 2.8x better efficiency than prior work while supporting variable mantissa widths.

A 28-nm 64-kb 31.6-tflops/w digital-domain floating-point-computing-unit and double- bit 6t-sram computing-in-memory macro for floating-point cnns,

fields

years

verdicts

representative citing papers

citing papers explorer