An adaptation of the Ozaki-II scheme allows DGEMM emulation on FP8 MMA units with significantly reduced computational cost compared to FP8-based Ozaki-I.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
verdicts
UNVERDICTED 2representative citing papers
Error analysis and cost estimator for recasting floating-point matrix multiplication as accumulated integer products on mixed-precision hardware.
citing papers explorer
-
Double-Precision Matrix Multiplication Emulation via Ozaki-II Scheme with FP8 Quantization
An adaptation of the Ozaki-II scheme allows DGEMM emulation on FP8 MMA units with significantly reduced computational cost compared to FP8-based Ozaki-I.
-
Analysis of Floating-Point Matrix Multiplication Computed via Integer Arithmetic
Error analysis and cost estimator for recasting floating-point matrix multiplication as accumulated integer products on mixed-precision hardware.