Title resolution pending

Guanhua Wang, Shivaram Venkataraman, Amar Phanishayee, Jorgen Thelin, Nikhil Devanur, Ion Stoica · 2019 · arXiv 1910.04940

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

PyTorch Distributed: Experiences on Accelerating Data Parallel Training

cs.DC · 2020-06-28 · accept · novelty 5.0

PyTorch distributed data parallel attains near-linear scalability on 256 GPUs through gradient bucketing, computation-communication overlap, and selective synchronization skipping.

The Landscape of GPU-Centric Communication

cs.DC · 2024-09-15 · unverdicted · novelty 2.0

A survey categorizing vendor mechanisms and user-level libraries for GPU-centric communication within and across nodes, with discussion of benefits, challenges, and open questions.

citing papers explorer

Showing 2 of 2 citing papers.

PyTorch Distributed: Experiences on Accelerating Data Parallel Training cs.DC · 2020-06-28 · accept · none · ref 47
PyTorch distributed data parallel attains near-linear scalability on 256 GPUs through gradient bucketing, computation-communication overlap, and selective synchronization skipping.
The Landscape of GPU-Centric Communication cs.DC · 2024-09-15 · unverdicted · none · ref 123
A survey categorizing vendor mechanisms and user-level libraries for GPU-centric communication within and across nodes, with discussion of benefits, challenges, and open questions.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer