Crafting heavy-tails in weight matrix spectrum without gradient noise.arXiv preprint arXiv:2406.04657,

Vignesh Kothapalli, Tianyu Pang, Shenyang Deng, Zongmin Liu, Yaoqing Yang · arXiv 2406.04657

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

AlphaQ: Calibration-Free Bit Allocation for Mixture-of-Experts Quantization

cs.LG · 2026-06-03 · unverdicted · novelty 6.0

AlphaQ performs calibration-free mixed-precision quantization of MoE models by allocating higher bits to experts whose weight spectra exhibit stronger heavy-tailed structure according to HT-SR theory, outperforming calibration-based methods and reaching near full-precision accuracy at 3.5 average bi

citing papers explorer

Showing 1 of 1 citing paper after filters.

AlphaQ: Calibration-Free Bit Allocation for Mixture-of-Experts Quantization cs.LG · 2026-06-03 · unverdicted · none · ref 14
AlphaQ performs calibration-free mixed-precision quantization of MoE models by allocating higher bits to experts whose weight spectra exhibit stronger heavy-tailed structure according to HT-SR theory, outperforming calibration-based methods and reaching near full-precision accuracy at 3.5 average bi

Crafting heavy-tails in weight matrix spectrum without gradient noise.arXiv preprint arXiv:2406.04657,

fields

years

verdicts

representative citing papers

citing papers explorer