pith. sign in

arxiv: 1608.00571 · v1 · pith:SOSHMMSBnew · submitted 2016-08-01 · 💻 cs.DC · cs.OS· cs.PL

TREES: A CPU/GPU Task-Parallel Runtime with Explicit Epoch Synchronization

classification 💻 cs.DC cs.OScs.PL
keywords treeswork-firstprincipleruntimetask-parallelapplicationsgpuspaid
0
0 comments X
read the original abstract

We have developed a task-parallel runtime system, called TREES, that is designed for high performance on CPU/GPU platforms. On platforms with multiple CPUs, Cilk's "work-first" principle underlies how task-parallel applications can achieve performance, but work-first is a poor fit for GPUs. We build upon work-first to create the "work-together" principle that addresses the specific strengths and weaknesses of GPUs. The work-together principle extends work-first by stating that (a) the overhead on the critical path should be paid by the entire system at once and (b) work overheads should be paid co-operatively. We have implemented the TREES runtime in OpenCL, and we experimentally evaluate TREES applications on a CPU/GPU platform.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.