KL divergence correlates with benchmark scores over wide quantization ranges but loses all predictive power in the near-baseline silent zone because it tracks disagreement volume rather than direction.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3representative citing papers
Empirical evaluation of quantization effects on eight LLMs across bit widths, showing performance generally declines at lower precision but with model-size-dependent resilience and acceptable accuracy at 2 bits for many cases.
citing papers explorer
-
Displacement Is Not Direction: Evaluating Fidelity Metrics for Quantized LLM Deployment
KL divergence correlates with benchmark scores over wide quantization ranges but loses all predictive power in the near-baseline silent zone because it tracks disagreement volume rather than direction.
-
K-Quantization and its Impact on Output Performance
Empirical evaluation of quantization effects on eight LLMs across bit widths, showing performance generally declines at lower precision but with model-size-dependent resilience and acceptable accuracy at 2 bits for many cases.
- PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding