pith. sign in

4 Jerry Chee, Yaohui Cai, Volodymyr Kuleshov, and Christopher M De Sa

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

fields

cs.LG 2 cs.IT 1

years

2026 2 2025 1

verdicts

UNVERDICTED 3

representative citing papers

High-Rate Quantized Matrix Multiplication II

cs.LG · 2026-05-13 · unverdicted · novelty 6.0

Waterfilling rate allocation makes quantized matrix multiplication for LLMs near information-theoretically optimal, with WaterSIC being basis-free and within 0.25 bits per entry of the limit.

High-Rate Quantized Matrix Multiplication I

cs.IT · 2026-01-23 · unverdicted · novelty 5.0

High-rate quantization theory yields accurate approximations for the distortion of absmax INT and FP schemes in generic weight-plus-activation matrix multiplication.

citing papers explorer

Showing 3 of 3 citing papers.

  • The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm cs.LG · 2025-07-24 · unverdicted · none · ref 2

    GPTQ is equivalent to Babai's nearest plane algorithm for CVP on the Hessian lattice of layer inputs, yielding geometric interpretation, inherited error bounds, and improved clipping-free quantization with GPU kernels.

  • High-Rate Quantized Matrix Multiplication II cs.LG · 2026-05-13 · unverdicted · none · ref 19

    Waterfilling rate allocation makes quantized matrix multiplication for LLMs near information-theoretically optimal, with WaterSIC being basis-free and within 0.25 bits per entry of the limit.

  • High-Rate Quantized Matrix Multiplication I cs.IT · 2026-01-23 · unverdicted · none · ref 34

    High-rate quantization theory yields accurate approximations for the distortion of absmax INT and FP schemes in generic weight-plus-activation matrix multiplication.