pith. sign in

Flash-dmd: Towards high-fidelity few-step image generation with efficient distillation and joint reinforcement learning

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

fields

cs.CV 4 cs.LG 1

years

2026 5

verdicts

UNVERDICTED 5

roles

background 2

polarities

background 1 unclear 1

representative citing papers

Reinforcing Few-step Generators via Reward-Tilted Distribution Matching

cs.CV · 2026-05-25 · unverdicted · novelty 6.0

RTDMD unifies KL minimization to a reward-tilted teacher into distribution matching plus reward terms, using AC-DMD in stage one and hybrid GRPO-style gradients plus SubGRPO in stage two to reach new SOTA on preference, aesthetic, and compositional metrics with 4-step generation on SD3, SD3.5, and F

citing papers explorer

Showing 5 of 5 citing papers.