Xla : Compiling machine learning for peak performance,

· 2020

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

HGQ-LUT: Fast LUT-Aware Training and Efficient Architectures for DNN Inference

cs.AR · 2026-04-24 · unverdicted · novelty 6.0

HGQ-LUT delivers a practical LUT-aware training framework with new tensor-based layers, heterogeneous quantization, and a resource surrogate that automates accuracy-efficiency trade-offs for FPGA DNN inference.

Evaluating Cross-Architecture Performance Modeling of Distributed ML Workloads Using StableHLO

cs.DC · 2026-04-13 · unverdicted · novelty 4.0

StableHLO serves as a viable unified representation for cross-architecture performance modeling of distributed ML workloads, preserving relative trends while exposing fidelity trade-offs.

citing papers explorer

Showing 2 of 2 citing papers.

HGQ-LUT: Fast LUT-Aware Training and Efficient Architectures for DNN Inference cs.AR · 2026-04-24 · unverdicted · none · ref 16
HGQ-LUT delivers a practical LUT-aware training framework with new tensor-based layers, heterogeneous quantization, and a resource surrogate that automates accuracy-efficiency trade-offs for FPGA DNN inference.
Evaluating Cross-Architecture Performance Modeling of Distributed ML Workloads Using StableHLO cs.DC · 2026-04-13 · unverdicted · none · ref 11
StableHLO serves as a viable unified representation for cross-architecture performance modeling of distributed ML workloads, preserving relative trends while exposing fidelity trade-offs.

Xla : Compiling machine learning for peak performance,

fields

years

verdicts

representative citing papers

citing papers explorer