Collaborative Few-Step Distillation and Low-Bit Quantization for Wan2.2 Dual-Expert Video Diffusion Models

Jinyang Du; Jinyang Guo; Ruihao Gong; Shenghao Jin; Shiqiao Gu; Xianglong Liu; Yang Yong; Ziqian Xu

arxiv: 2606.00658 · v1 · pith:3XFXU6OInew · submitted 2026-05-30 · 💻 cs.CV · cs.AI

Collaborative Few-Step Distillation and Low-Bit Quantization for Wan2.2 Dual-Expert Video Diffusion Models

Jinyang Du , Shenghao Jin , Ziqian Xu , Ruihao Gong , Shiqiao Gu , Yang Yong , Jinyang Guo , Xianglong Liu This is my paper

classification 💻 cs.CV cs.AI

keywords few-steplow-bitmodelquantizationdenoisingdiffusiondistillationdual-expert

0 comments

read the original abstract

Large video diffusion models achieve strong visual quality but remain expensive to deploy because each sample requires many denoising steps and a large resident parameter footprint. This paper studies a deployment-oriented compression pipeline for Wan2.2-T2V-A14B by combining few-step distribution-matching distillation with low-bit quantization. The pipeline follows the model's dual-expert denoising route, calibrates the high-noise and low-noise branches separately, protects sensitive entrance layers, and uses HiF4-style low-bit representation to improve dynamic-range coverage. Quantization is calibrated on the distilled few-step student rather than on the original long-step trajectory, reducing activation-distribution mismatch during inference. The proposed co-design keeps the quantized model close to the same-step full-precision model and surpasses the original full-precision baseline at 8 and 20 steps on average. The 20-step setting gives the best quality-efficiency trade-off in the tested configurations.

This paper has not been read by Pith yet.

Collaborative Few-Step Distillation and Low-Bit Quantization for Wan2.2 Dual-Expert Video Diffusion Models

discussion (0)