To compress or not? pushing the frontier of lossless genai model weights compression with exponent concentration.arXiv preprint arXiv:2510.02676

Zeyu Yang, Tianyi Zhang, Jianwen Xie, Chuan Li, Zhaozhuo Xu, Anshumali Shrivastava · arXiv 2510.02676

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Statistically-Lossless Quantization of Large Language Models

cs.LG · 2026-05-04 · unverdicted · novelty 6.0

SLQ achieves task-lossless LLM quantization below 4 bits per parameter and distribution-lossless at 5-6 bits on average, with 1.7-3.6x speedups over FP16.

citing papers explorer

Showing 1 of 1 citing paper.

Statistically-Lossless Quantization of Large Language Models cs.LG · 2026-05-04 · unverdicted · none · ref 11
SLQ achieves task-lossless LLM quantization below 4 bits per parameter and distribution-lossless at 5-6 bits on average, with 1.7-3.6x speedups over FP16.

To compress or not? pushing the frontier of lossless genai model weights compression with exponent concentration.arXiv preprint arXiv:2510.02676

fields

years

verdicts

representative citing papers

citing papers explorer