A dual-precision hybrid FP MAC PE using bit-partitioning to run FP8 or two FP4 operations on shared 4-bit hardware, claiming 60% area and 87% power savings in 28nm.
A 4.27 TFLOPS/W FP4/FP8 Hybrid- Precision Neural Network Training Processor Using Shift-Add MAC and Reconfigurable PE Array,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AR 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
DHFP-PE: Dual-Precision Hybrid Floating Point Processing Element for AI Acceleration
A dual-precision hybrid FP MAC PE using bit-partitioning to run FP8 or two FP4 operations on shared 4-bit hardware, claiming 60% area and 87% power savings in 28nm.