Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

Exploring the Trade-Offs: Quantization Methods, Task Difficulty · 2025 · DOI 10.24963/ijcai.2025/902

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Displacement Is Not Direction: Evaluating Fidelity Metrics for Quantized LLM Deployment

cs.LG · 2026-06-17 · unverdicted · novelty 6.0

KL divergence correlates with benchmark scores over wide quantization ranges but loses all predictive power in the near-baseline silent zone because it tracks disagreement volume rather than direction.

PipeSD: An Efficient Cloud-Edge Collaborative Pipeline Inference Framework with Speculative Decoding

cs.DC · 2026-05-13 · unverdicted · novelty 4.0 · 2 refs

PipeSD is a cloud-edge collaborative inference framework that overlaps token generation and communication via dynamic programming pipeline scheduling and uses Bayesian-optimized dual-threshold NAV triggering, delivering 1.16x-2.16x speedup and 14.3%-25.3% energy reduction over baselines.

K-Quantization and its Impact on Output Performance

cs.CL · 2026-05-19 · unverdicted · novelty 3.0

Empirical evaluation of quantization effects on eight LLMs across bit widths, showing performance generally declines at lower precision but with model-size-dependent resilience and acceptable accuracy at 2 bits for many cases.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

fields

years

verdicts

representative citing papers

citing papers explorer