Wanted: Floating-Point Add Round-off Error instruction

Jason Riedy; Marat Dukhan; Richard Vuduc

arxiv: 1603.00491 · v1 · pith:LFI6MYW7new · submitted 2016-03-01 · 💻 cs.NA · cs.PF

Wanted: Floating-Point Add Round-off Error instruction

Marat Dukhan , Richard Vuduc , Jason Riedy This is my paper

classification 💻 cs.NA cs.PF

keywords instructiondouble-doubleadditionfloating-pointintelarithmeticbenefitscompensated

0 comments

read the original abstract

We propose a new instruction (FPADDRE) that computes the round-off error in floating-point addition. We explain how this instruction benefits high-precision arithmetic operations in applications where double precision is not sufficient. Performance estimates on Intel Haswell, Intel Skylake, and AMD Steamroller processors, as well as Intel Knights Corner co-processor, demonstrate that such an instruction would improve the latency of double-double addition by up to 55% and increase double-double addition throughput by up to 103%, with smaller, but non-negligible benefits for double-double multiplication. The new instruction delivers up to 2x speedups on three benchmarks that use high-precision floating-point arithmetic: double-double matrix-matrix multiplication, compensated dot product, and polynomial evaluation via the compensated Horner scheme.

This paper has not been read by Pith yet.

Wanted: Floating-Point Add Round-off Error instruction

discussion (0)