Lossless compression for LLM tensor incremental snapshots

Daniel Waddington, Cornel Constantinescu · 2025 · arXiv 2505.09810

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

FluxMoE: Decoupling Expert Residency for High-Performance MoE Serving

cs.LG · 2026-04-03 · unverdicted · novelty 6.0

FluxMoE decouples MoE expert weights from persistent GPU residency via on-demand paging, achieving up to 3x throughput gains over vLLM in memory-constrained inference without accuracy loss.

ZipCCL: Efficient Lossless Data Compression of Communication Collectives for Accelerating LLM Training

cs.DC · 2026-04-30 · unverdicted · novelty 5.0

ZipCCL delivers up to 1.35x faster communication and 1.18x end-to-end speedup in LLM training through lossless compression of near-Gaussian collectives on 64-GPU clusters.

citing papers explorer

Showing 2 of 2 citing papers.

FluxMoE: Decoupling Expert Residency for High-Performance MoE Serving cs.LG · 2026-04-03 · unverdicted · none · ref 49
FluxMoE decouples MoE expert weights from persistent GPU residency via on-demand paging, achieving up to 3x throughput gains over vLLM in memory-constrained inference without accuracy loss.
ZipCCL: Efficient Lossless Data Compression of Communication Collectives for Accelerating LLM Training cs.DC · 2026-04-30 · unverdicted · none · ref 47
ZipCCL delivers up to 1.35x faster communication and 1.18x end-to-end speedup in LLM training through lossless compression of near-Gaussian collectives on 64-GPU clusters.

Lossless compression for LLM tensor incremental snapshots

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer