GPUscout: Locating data movement-related bottlenecks on GPUs

· 2023 · arXiv 4062.362420

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

LEO: Tracing GPU Stall Root Causes via Cross-Vendor Backward Slicing

cs.DC · 2026-04-21 · unverdicted · novelty 6.0

LEO performs cross-vendor backward slicing from stalled GPU instructions to attribute root causes to source code, enabling optimizations that produce geometric-mean speedups of 1.73-1.82x on 21 workloads.

Toward an Energy-Optimized Operation of Data Centers Located in Wind Farms Using Reinforcement Learning

cs.LG · 2026-06-29 · unverdicted · novelty 5.0

Reinforcement learning with imitation learning and reward shaping improves online workload shifting in a one-turbine one-data-center simulation but remains below an offline optimizer that sees the full day.

citing papers explorer

Showing 2 of 2 citing papers after filters.

LEO: Tracing GPU Stall Root Causes via Cross-Vendor Backward Slicing cs.DC · 2026-04-21 · unverdicted · none · ref 15
LEO performs cross-vendor backward slicing from stalled GPU instructions to attribute root causes to source code, enabling optimizations that produce geometric-mean speedups of 1.73-1.82x on 21 workloads.
Toward an Energy-Optimized Operation of Data Centers Located in Wind Farms Using Reinforcement Learning cs.LG · 2026-06-29 · unverdicted · none · ref 20
Reinforcement learning with imitation learning and reward shaping improves online workload shifting in a one-turbine one-data-center simulation but remains below an offline optimizer that sees the full day.

GPUscout: Locating data movement-related bottlenecks on GPUs

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer