hub

Mlir: Scaling compiler infrastructure for domain specific computation

Ajay Brahmakshatriya, Yunming Zhang, Changwan Hong, Shoaib Kamil, Julian Shun, Saman Amarasinghe · 2021 · arXiv 1591.2021

13 Pith papers cite this work. Polarity classification is still indexing.

13 Pith papers citing it

read on arXiv browse 13 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 4

citation-polarity summary

background 4

representative citing papers

LLM Translation of Compiler Intermediate Representation

cs.PL · 2026-05-07 · unverdicted · novelty 8.0

IRIS-14B is the first LLM trained explicitly for GIMPLE-to-LLVM IR translation and outperforms much larger models by up to 44 percentage points on real-world C code.

Mat2Boundary: Treating User-Defined Boundary Condition as SpMV for Distributed PDE Solvers on Block-Structured Grids

cs.PL · 2026-05-14 · unverdicted · novelty 7.0

Mat2Boundary treats boundary conditions as sparse matrix-vector products and uses multi-stage compilation with polyhedral analysis to generate efficient matrix-free kernels and communication schedules for distributed block-structured PDE solvers.

Demonstrating a Future for MLIR-native DSL Compilers on a NumPy-like Example

cs.PL · 2026-04-21 · unverdicted · novelty 7.0

An MLIR-native NumPy-like DSL with a new dialect-agnostic type checker and parallel-first lowering to a dataflow dialect, shown on weather modeling and CFD workloads in Fortran.

SSA without Dominance for Higher-Order Programs

cs.PL · 2026-04-10 · unverdicted · novelty 7.0

Free-variable sets and a nesting tree can replace dominance relations in SSA for higher-order programs, improving precision without requiring explicit control-flow graphs.

Optimism in Equality Saturation

cs.PL · 2025-11-25 · unverdicted · novelty 7.0

A new abstract interpretation algorithm enables sound optimistic analysis of e-graphs during equality saturation, unifying it with non-destructive rewriting and improving precision on cyclic SSA programs.

LEO: Tracing GPU Stall Root Causes via Cross-Vendor Backward Slicing

cs.DC · 2026-04-21 · unverdicted · novelty 6.0

LEO performs cross-vendor backward slicing from stalled GPU instructions to attribute root causes to source code, enabling optimizations that produce geometric-mean speedups of 1.73-1.82x on 21 workloads.

EquivFusion: Unifying Hardware Equivalence Checking from Algorithms to Netlists via MLIR

cs.AR · 2026-04-17 · unverdicted · novelty 6.0

EquivFusion unifies equivalence checking across hardware design levels by lowering PyTorch, C/C++, Chisel, Verilog, and netlists via MLIR into SMT-LIB, BTOR2, and AIGER formats.

KEET: Explaining Performance of GPU Kernels Using LLM Agents

cs.PF · 2026-05-06 · unverdicted · novelty 5.0

KEET uses LLM agents to generate data-grounded natural language explanations of performance issues in GPU kernels from Nsight Compute profiles and shows these improve downstream LLM-based optimization tasks.

Aquas: Enhancing Domain Specialization through Holistic Hardware-Software Co-Optimization based on MLIR

cs.AR · 2025-11-27 · unverdicted · novelty 5.0

Aquas delivers a holistic hardware-software co-optimization framework on MLIR that models memory interfaces with cache effects and uses an e-graph retargetable compiler, achieving up to 15.61x speedup with 14.5% area overhead across four domains.

Analysis of Floating-Point Matrix Multiplication Computed via Integer Arithmetic

math.NA · 2025-06-12 · unverdicted · novelty 5.0

Error analysis and cost estimator for recasting floating-point matrix multiplication as accumulated integer products on mixed-precision hardware.

AutoLALA: Automatic Loop Algebraic Locality Analysis for AI and HPC Kernels

cs.PL · 2026-04-06 · unverdicted · novelty 4.0

AutoLALA automatically generates symbolic formulas for reuse distance and data movement complexity in affine loop programs using polyhedral lowering and Barvinok counting.

SkCC: Portable and Secure Skill Compilation for Cross-Framework LLM Agents

cs.CR · 2026-05-05

The EDGE Language: Extended General Einsums for Graph Algorithms

cs.DS · 2024-04-17

citing papers explorer

Showing 13 of 13 citing papers.

LLM Translation of Compiler Intermediate Representation cs.PL · 2026-05-07 · unverdicted · none · ref 19
IRIS-14B is the first LLM trained explicitly for GIMPLE-to-LLVM IR translation and outperforms much larger models by up to 44 percentage points on real-world C code.
Mat2Boundary: Treating User-Defined Boundary Condition as SpMV for Distributed PDE Solvers on Block-Structured Grids cs.PL · 2026-05-14 · unverdicted · none · ref 33
Mat2Boundary treats boundary conditions as sparse matrix-vector products and uses multi-stage compilation with polyhedral analysis to generate efficient matrix-free kernels and communication schedules for distributed block-structured PDE solvers.
Demonstrating a Future for MLIR-native DSL Compilers on a NumPy-like Example cs.PL · 2026-04-21 · unverdicted · none · ref 14
An MLIR-native NumPy-like DSL with a new dialect-agnostic type checker and parallel-first lowering to a dataflow dialect, shown on weather modeling and CFD workloads in Fortran.
SSA without Dominance for Higher-Order Programs cs.PL · 2026-04-10 · unverdicted · none · ref 21
Free-variable sets and a nesting tree can replace dominance relations in SSA for higher-order programs, improving precision without requiring explicit control-flow graphs.
Optimism in Equality Saturation cs.PL · 2025-11-25 · unverdicted · none · ref 23
A new abstract interpretation algorithm enables sound optimistic analysis of e-graphs during equality saturation, unifying it with non-destructive rewriting and improving precision on cyclic SSA programs.
LEO: Tracing GPU Stall Root Causes via Cross-Vendor Backward Slicing cs.DC · 2026-04-21 · unverdicted · none · ref 6
LEO performs cross-vendor backward slicing from stalled GPU instructions to attribute root causes to source code, enabling optimizations that produce geometric-mean speedups of 1.73-1.82x on 21 workloads.
EquivFusion: Unifying Hardware Equivalence Checking from Algorithms to Netlists via MLIR cs.AR · 2026-04-17 · unverdicted · none · ref 23
EquivFusion unifies equivalence checking across hardware design levels by lowering PyTorch, C/C++, Chisel, Verilog, and netlists via MLIR into SMT-LIB, BTOR2, and AIGER formats.
KEET: Explaining Performance of GPU Kernels Using LLM Agents cs.PF · 2026-05-06 · unverdicted · none · ref 3
KEET uses LLM agents to generate data-grounded natural language explanations of performance issues in GPU kernels from Nsight Compute profiles and shows these improve downstream LLM-based optimization tasks.
Aquas: Enhancing Domain Specialization through Holistic Hardware-Software Co-Optimization based on MLIR cs.AR · 2025-11-27 · unverdicted · none · ref 12
Aquas delivers a holistic hardware-software co-optimization framework on MLIR that models memory interfaces with cache effects and uses an e-graph retargetable compiler, achieving up to 15.61x speedup with 14.5% area overhead across four domains.
Analysis of Floating-Point Matrix Multiplication Computed via Integer Arithmetic math.NA · 2025-06-12 · unverdicted · none · ref 26
Error analysis and cost estimator for recasting floating-point matrix multiplication as accumulated integer products on mixed-precision hardware.
AutoLALA: Automatic Loop Algebraic Locality Analysis for AI and HPC Kernels cs.PL · 2026-04-06 · unverdicted · none · ref 12
AutoLALA automatically generates symbolic formulas for reuse distance and data movement complexity in affine loop programs using polyhedral lowering and Barvinok counting.
SkCC: Portable and Secure Skill Compilation for Cross-Framework LLM Agents cs.CR · 2026-05-05 · unreviewed · ref 25
The EDGE Language: Extended General Einsums for Graph Algorithms cs.DS · 2024-04-17 · unreviewed · ref 15

Mlir: Scaling compiler infrastructure for domain specific computation

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer