Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

Exploring the Trade-Offs: Quantization Methods, Task Difficulty · 2025 · DOI 10.24963/ijcai.2025/902

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Displacement Is Not Direction: Evaluating Fidelity Metrics for Quantized LLM Deployment

cs.LG · 2026-06-17 · unverdicted · novelty 6.0

KL divergence correlates with benchmark scores over wide quantization ranges but loses all predictive power in the near-baseline silent zone because it tracks disagreement volume rather than direction.

K-Quantization and its Impact on Output Performance

cs.CL · 2026-05-19 · unverdicted · novelty 3.0

Empirical evaluation of quantization effects on eight LLMs across bit widths, showing performance generally declines at lower precision but with model-size-dependent resilience and acceptable accuracy at 2 bits for many cases.

PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding

cs.DC · 2026-05-13 · 2 refs

citing papers explorer

Showing 3 of 3 citing papers after filters.

Displacement Is Not Direction: Evaluating Fidelity Metrics for Quantized LLM Deployment cs.LG · 2026-06-17 · unverdicted · none · ref 15
KL divergence correlates with benchmark scores over wide quantization ranges but loses all predictive power in the near-baseline silent zone because it tracks disagreement volume rather than direction.
K-Quantization and its Impact on Output Performance cs.CL · 2026-05-19 · unverdicted · none · ref 4
Empirical evaluation of quantization effects on eight LLMs across bit widths, showing performance generally declines at lower precision but with model-size-dependent resilience and acceptable accuracy at 2 bits for many cases.
PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding cs.DC · 2026-05-13 · unreviewed · ref 20 · 2 links

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

fields

years

verdicts

representative citing papers

citing papers explorer