A.3 Cayley OPTIMIZATION CHOICE In Table 11, we evaluate the impact of varying the number of samples and iterations used in Cay- ley optimization

Our method, SpinQuant, successfully reduces the gap to the full-precision network from the previous 9 · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

SpinQuant: LLM quantization with learned rotations

cs.LG · 2024-05-26 · conditional · novelty 7.0

SpinQuant learns optimal rotations to enable accurate 4-bit quantization of LLM weights, activations, and KV cache, reducing the zero-shot gap to full precision to 2.9 points on LLaMA-2 7B.

citing papers explorer

Showing 1 of 1 citing paper.

SpinQuant: LLM quantization with learned rotations cs.LG · 2024-05-26 · conditional · none · ref 28
SpinQuant learns optimal rotations to enable accurate 4-bit quantization of LLM weights, activations, and KV cache, reducing the zero-shot gap to full precision to 2.9 points on LLaMA-2 7B.

A.3 Cayley OPTIMIZATION CHOICE In Table 11, we evaluate the impact of varying the number of samples and iterations used in Cay- ley optimization

fields

years

verdicts

representative citing papers

citing papers explorer