Mixdq: Memory-efficient few-step text-to-image diffusion models with metric-decoupled mixed precision quantization

Tianchen Zhao, Xuefei Ning, Tongcheng Fang, Enshu Liu, Guyue Huang, Zinan Lin, Shengen Yan, Guohao Dai, Yu Wang · 2024 · arXiv 2405.17873

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

RUQuant: Towards Refining Uniform Quantization for Large Language Models

cs.CL · 2026-04-05 · unverdicted · novelty 6.0

RUQuant uses block-wise composite orthogonal matrices from Householder reflections and Givens rotations plus a fine-tuned global reflection to achieve 99.8% full-precision accuracy at W6A6 and 97% at W4A4 for 13B LLMs in about one minute.

LoRaQ: Optimized Low Rank Approximation for 4-bit Quantization

cs.LG · 2026-04-20 · unverdicted · novelty 5.0

LoRaQ enables fully sub-16-bit quantized diffusion models by optimizing low-rank error compensation in a data-free way, outperforming prior methods at equal memory cost on Pixart-Σ and SANA while supporting mixed low-precision branches.

citing papers explorer

Showing 2 of 2 citing papers.

RUQuant: Towards Refining Uniform Quantization for Large Language Models cs.CL · 2026-04-05 · unverdicted · none · ref 20
RUQuant uses block-wise composite orthogonal matrices from Householder reflections and Givens rotations plus a fine-tuned global reflection to achieve 99.8% full-precision accuracy at W6A6 and 97% at W4A4 for 13B LLMs in about one minute.
LoRaQ: Optimized Low Rank Approximation for 4-bit Quantization cs.LG · 2026-04-20 · unverdicted · none · ref 12
LoRaQ enables fully sub-16-bit quantized diffusion models by optimizing low-rank error compensation in a data-free way, outperforming prior methods at equal memory cost on Pixart-Σ and SANA while supporting mixed low-precision branches.

Mixdq: Memory-efficient few-step text-to-image diffusion models with metric-decoupled mixed precision quantization

fields

years

verdicts

representative citing papers

citing papers explorer