SET is a new CUDA runtime framework that combines event-chaining, work-stealing, and per-stream buffers in graph-based pipelines to deliver 1.15-1.44X speedups and 18-54% lower scheduling overhead versus prior CUDA graph methods.
In: 2010 IEEE/ACM Int’l Con- ference on Green Computing and Communications Int’l Conference on Cyber, Physical and Social Computing, pp
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.DC 2verdicts
UNVERDICTED 2representative citing papers
MuMFiM is a new open-source two-scale modeling framework achieving 1000x GPU microscale speedup and near-optimal strong/weak scaling to 128 nodes on heterogeneous hardware, demonstrated on a human spine ligament.
citing papers explorer
-
A new open source framework for multiscale modeling of fibrous materials on heterogeneous supercomputers
MuMFiM is a new open-source two-scale modeling framework achieving 1000x GPU microscale speedup and near-optimal strong/weak scaling to 128 nodes on heterogeneous hardware, demonstrated on a human spine ligament.