arXiv preprint arXiv:2506.03781 , year=

Unifying Uniform, Binary-coding Quantization for Accurate Compression of Large Language Models , author= · arXiv 2506.03781

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

LiftQuant: Continuous Bit-Width LLM via Dimensional Lifting and Projection

cs.LG · 2026-06-02 · unverdicted · novelty 6.0

LiftQuant enables continuous bit-width LLM quantization via dimensional lifting and projection from a 1-bit lattice, allowing 2.4-bit compression of 70B models that outperforms fixed 2-bit baselines on identical hardware.

LBLLM: Lightweight Binarization of Large Language Models via Three-Stage Distillation

cs.LG · 2026-04-21 · unverdicted · novelty 6.0

LBLLM achieves better accuracy than prior binarization methods for LLMs by decoupling weight and activation quantization through initialization, layer-wise distillation, and learnable activation scaling.

citing papers explorer

Showing 2 of 2 citing papers.

LiftQuant: Continuous Bit-Width LLM via Dimensional Lifting and Projection cs.LG · 2026-06-02 · unverdicted · none · ref 58
LiftQuant enables continuous bit-width LLM quantization via dimensional lifting and projection from a 1-bit lattice, allowing 2.4-bit compression of 70B models that outperforms fixed 2-bit baselines on identical hardware.
LBLLM: Lightweight Binarization of Large Language Models via Three-Stage Distillation cs.LG · 2026-04-21 · unverdicted · none · ref 99
LBLLM achieves better accuracy than prior binarization methods for LLMs by decoupling weight and activation quantization through initialization, layer-wise distillation, and learnable activation scaling.

arXiv preprint arXiv:2506.03781 , year=

fields

years

verdicts

representative citing papers

citing papers explorer