Todor Mihaylov, Peter Clark, Tushar Khot, and Ashish Sabharwal

(Accessed on 09/26/ · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation

cs.LG · 2025-03-25 · unverdicted · novelty 6.0

LogQuant applies log-based filtering for 2-bit KV cache quantization in LLMs, claiming 25% higher throughput, 60% larger batches, and 40-200% accuracy gains on math/code tasks versus existing compression approaches.

citing papers explorer

Showing 1 of 1 citing paper.

LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation cs.LG · 2025-03-25 · unverdicted · none · ref 14
LogQuant applies log-based filtering for 2-bit KV cache quantization in LLMs, claiming 25% higher throughput, 60% larger batches, and 40-200% accuracy gains on math/code tasks versus existing compression approaches.

Todor Mihaylov, Peter Clark, Tushar Khot, and Ashish Sabharwal

fields

years

verdicts

representative citing papers

citing papers explorer