Acco: Accumulate while you communicate for communication-overlapped sharded llm training.arXiv preprint arXiv:2406.02613, 2024

Adel Nabli, Louis Fournier, Pierre Erbacher, Louis Serrano, Eugene Belilovsky, Edouard Oyallon · 2024 · arXiv 2406.02613

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Learned Subspace Compression for Communication-Efficient Pipeline Parallelism

cs.LG · 2026-06-03 · unverdicted · novelty 6.0

MAPL learns task-specific orthogonal compression subspaces per pipeline stage via manifold-constrained optimization and recovers signals with low-overhead anchors, yielding better compression-performance tradeoffs than fixed projections on LLaMA models up to 1B parameters.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Learned Subspace Compression for Communication-Efficient Pipeline Parallelism cs.LG · 2026-06-03 · unverdicted · none · ref 37
MAPL learns task-specific orthogonal compression subspaces per pipeline stage via manifold-constrained optimization and recovers signals with low-overhead anchors, yielding better compression-performance tradeoffs than fixed projections on LLaMA models up to 1B parameters.

Acco: Accumulate while you communicate for communication-overlapped sharded llm training.arXiv preprint arXiv:2406.02613, 2024

fields

years

verdicts

representative citing papers

citing papers explorer