A benchmark suite for improving per- formance portability of the sycl programming model

Zheming Jin, Jeffrey S · 2023

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

citation-role summary

background 1 baseline 1 dataset 1

citation-polarity summary

background 1 baseline 1 use dataset 1

representative citing papers

Incisor: Ex Ante Cloud Instance Selection for HPC Jobs

cs.DC · 2026-04-27 · unverdicted · novelty 7.0

Incisor uses program analysis and frontier LLMs to select working AWS EC2 instances ex ante for 100% of first-time HPC runs of C/C++/Fortran and Python codes, cutting runtime 54% and costs 44% versus an expert-constrained SkyPilot baseline.

CuLifter: Lifting GPU Binaries to Typed IR

cs.AR · 2026-04-30 · unverdicted · novelty 6.0

CuLifter recovers types from untyped GPU register files via constraint propagation to lift 99.98% of 24,437 functions across 919 cubins to valid LLVM IR.

Optimas: An Intelligent Analytics-Informed Generative AI Framework for Performance Optimization

cs.PF · 2026-04-26 · unverdicted · novelty 6.0

Optimas deploys a multi-agent LLM workflow to convert performance diagnostics into correct code transformations, delivering 100% valid code and performance gains in 98.82% of 3,410 experiments across benchmarks and HPC applications.

Maximizing Memory-Level Parallelism via Integrated Stochastic Logic-in-Memory Architectures

cs.ET · 2026-04-25 · unverdicted · novelty 6.0

An MTJ-based logic-in-memory design performs fully parallel stochastic bit-stream generation and arithmetic without external random number generators by exploiting device stochasticity.

citing papers explorer

Showing 4 of 4 citing papers.

Incisor: Ex Ante Cloud Instance Selection for HPC Jobs cs.DC · 2026-04-27 · unverdicted · none · ref 54
Incisor uses program analysis and frontier LLMs to select working AWS EC2 instances ex ante for 100% of first-time HPC runs of C/C++/Fortran and Python codes, cutting runtime 54% and costs 44% versus an expert-constrained SkyPilot baseline.
CuLifter: Lifting GPU Binaries to Typed IR cs.AR · 2026-04-30 · unverdicted · none · ref 33
CuLifter recovers types from untyped GPU register files via constraint propagation to lift 99.98% of 24,437 functions across 919 cubins to valid LLVM IR.
Optimas: An Intelligent Analytics-Informed Generative AI Framework for Performance Optimization cs.PF · 2026-04-26 · unverdicted · none · ref 23
Optimas deploys a multi-agent LLM workflow to convert performance diagnostics into correct code transformations, delivering 100% valid code and performance gains in 98.82% of 3,410 experiments across benchmarks and HPC applications.
Maximizing Memory-Level Parallelism via Integrated Stochastic Logic-in-Memory Architectures cs.ET · 2026-04-25 · unverdicted · none · ref 18
An MTJ-based logic-in-memory design performs fully parallel stochastic bit-stream generation and arithmetic without external random number generators by exploiting device stochasticity.

A benchmark suite for improving per- formance portability of the sycl programming model

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer