OWQ: Outlier- aware weight quantization for efficient fine-tuning and inference of large language models

Changhun Lee, Jungyu Jin, Taesu Kim, Hyungjun Kim, Eunhyeok Park · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Saliency-Aware Regularized Quantization Calibration for Large Language Models

cs.AI · 2026-05-07 · unverdicted · novelty 6.0

SARQC augments standard PTQ calibration with a saliency-aware regularizer to keep quantized weights closer to original floating-point values, yielding improved perplexity and zero-shot accuracy on dense and MoE LLMs.

citing papers explorer

Showing 1 of 1 citing paper.

Saliency-Aware Regularized Quantization Calibration for Large Language Models cs.AI · 2026-05-07 · unverdicted · none · ref 35
SARQC augments standard PTQ calibration with a saliency-aware regularizer to keep quantized weights closer to original floating-point values, yielding improved perplexity and zero-shot accuracy on dense and MoE LLMs.

OWQ: Outlier- aware weight quantization for efficient fine-tuning and inference of large language models

fields

years

verdicts

representative citing papers

citing papers explorer