Ascend HiFloat8 format for deep learning,

· 2024 · arXiv 2409.16626

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Boundary-Protection W8A8 HiFloat8 Quantization for Large-Scale Text-to-Video Diffusion Transformers

cs.CV · 2026-05-31 · unverdicted · novelty 5.0

A boundary-protection PTQ strategy for Wan2.1-T2V-14B matches BF16 VBench performance by retaining boundary blocks in higher precision and quantizing the rest to W8A8 HiF8.

Analysis of Floating-Point Matrix Multiplication Computed via Integer Arithmetic

math.NA · 2025-06-12 · unverdicted · novelty 5.0

Error analysis and cost estimator for recasting floating-point matrix multiplication as accumulated integer products on mixed-precision hardware.

GoldenFloat: A Phi-Derived Static-Split Floating-Point Family from GF4 to GF1024 with a Lucas-Exact Integer Identity

cs.AR · 2026-06-03 · unverdicted · novelty 4.0

GoldenFloat introduces a phi-derived rule for setting exponent and fraction widths across floating-point formats from 4 to 1024 bits, backed by open RTL generator, Lucas-exact accumulator, and FPGA implementation.

OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning

cs.CV · 2026-05-27 · unverdicted · novelty 4.0

OSP-Next reports 83.73% VBench score and up to 2.27x speedup via hybrid sparse attention, SSP parallelism, HiF8 quantization, and Mix-GRPO on diffusion transformers.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Boundary-Protection W8A8 HiFloat8 Quantization for Large-Scale Text-to-Video Diffusion Transformers cs.CV · 2026-05-31 · unverdicted · none · ref 8
A boundary-protection PTQ strategy for Wan2.1-T2V-14B matches BF16 VBench performance by retaining boundary blocks in higher precision and quantizing the rest to W8A8 HiF8.
Analysis of Floating-Point Matrix Multiplication Computed via Integer Arithmetic math.NA · 2025-06-12 · unverdicted · none · ref 28
Error analysis and cost estimator for recasting floating-point matrix multiplication as accumulated integer products on mixed-precision hardware.
GoldenFloat: A Phi-Derived Static-Split Floating-Point Family from GF4 to GF1024 with a Lucas-Exact Integer Identity cs.AR · 2026-06-03 · unverdicted · none · ref 20
GoldenFloat introduces a phi-derived rule for setting exponent and fraction widths across floating-point formats from 4 to 1024 bits, backed by open RTL generator, Lucas-exact accumulator, and FPGA implementation.
OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning cs.CV · 2026-05-27 · unverdicted · none · ref 18
OSP-Next reports 83.73% VBench score and up to 2.27x speedup via hybrid sparse attention, SSP parallelism, HiF8 quantization, and Mix-GRPO on diffusion transformers.

Ascend HiFloat8 format for deep learning,

fields

years

verdicts

representative citing papers

citing papers explorer