pith. sign in

AWQ: Activation-aware Weight Quantization for On-Device LLM Compression and Acceleration

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

method 1

citation-polarity summary

years

2026 3

roles

method 1

polarities

use method 1

representative citing papers

K-Quantization and its Impact on Output Performance

cs.CL · 2026-05-19 · unverdicted · novelty 3.0

Empirical evaluation of quantization effects on eight LLMs across bit widths, showing performance generally declines at lower precision but with model-size-dependent resilience and acceptable accuracy at 2 bits for many cases.

citing papers explorer

Showing 3 of 3 citing papers.