arXiv preprint arXiv:2305.02633 , year=

Shauli Ravfogel, Yoav Goldberg, Jacob Goldberger · 2023 · arXiv 2305.02633

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Not All Errors Are Equal: A Systematic Study of Error Propagation in Large Language Model Inference

cs.DC · 2026-06-01 · unverdicted · novelty 7.0

A new fault-injection framework enables a systematic empirical study that produces 17 takeaways on error propagation in LLM inference and four software-only mitigation directions.

RaBitQCache: Rotated Binary Quantization for KVCache in Long Context LLM Inference

cs.LG · 2026-06-30 · unverdicted · novelty 5.0

RaBitQCache proposes rotated binary quantization with binary-INT4 arithmetic for unbiased attention weight estimation in long-context LLMs, enabling adaptive Top-p retrieval and hardware optimizations.

citing papers explorer

Showing 2 of 2 citing papers.

Not All Errors Are Equal: A Systematic Study of Error Propagation in Large Language Model Inference cs.DC · 2026-06-01 · unverdicted · none · ref 71
A new fault-injection framework enables a systematic empirical study that produces 17 takeaways on error propagation in LLM inference and four software-only mitigation directions.
RaBitQCache: Rotated Binary Quantization for KVCache in Long Context LLM Inference cs.LG · 2026-06-30 · unverdicted · none · ref 42
RaBitQCache proposes rotated binary quantization with binary-INT4 arithmetic for unbiased attention weight estimation in long-context LLMs, enabling adaptive Top-p retrieval and hardware optimizations.

arXiv preprint arXiv:2305.02633 , year=

fields

years

verdicts

representative citing papers

citing papers explorer