pith. sign in

arXiv preprint arXiv:2501.13987

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

fields

cs.LG 7 cs.CV 1

years

2026 6 2025 2

representative citing papers

LoopQ: Quantization for Recursive Transformers

cs.LG · 2026-05-08 · unverdicted · novelty 7.0

LoopQ provides a loop-aware PTQ framework for recursive Transformers that mitigates distribution shift, state reuse, and recursive error accumulation, yielding 68.8% higher average accuracy and 87.7% lower perplexity under W4A4 versus static baselines.

Theory-optimal Quantization Based on Flatness

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

The paper introduces the Flatness metric, derives a theory-optimal quantization solution, and presents BDQ that uses bidirectional diagonal transformations to reduce outlier impact, achieving under 1% drop at W4A4 on LLaMA-3-8B.

citing papers explorer

Showing 8 of 8 citing papers.