Aligning Few-Step Generative Models by Amortizing Sample-based Variational Inference

Dohyun Kim; Hyeongyu Kang; Jaewoo Lee; Jeongjae Lee; Jinkyoo Park; Jongchul Ye; Kyuil Sim; Minsu Kim; Sanghyeok Choi; Tabitha Edith Lee

arxiv: 2605.26552 · v1 · pith:GSYTBFMCnew · submitted 2026-05-26 · 💻 cs.LG · cs.AI

Aligning Few-Step Generative Models by Amortizing Sample-based Variational Inference

Jaewoo Lee , Hyeongyu Kang , Dohyun Kim , Kyuil Sim , Woocheol Shin , Minsu Kim , Taeyoung Yun , Jeongjae Lee

show 4 more authors

Sanghyeok Choi Tabitha Edith Lee Jongchul Ye Jinkyoo Park

This is my paper

classification 💻 cs.LG cs.AI

keywords alignmentfew-stepgenerativegeneratorvariationaldistributioninferencemodel

0 comments

read the original abstract

Aligning a few-step generative model is challenging, since existing alignment frameworks typically rely on restrictive assumptions: a tractable likelihood, a specific ODE/SDE solver, or a particular model family. We introduce FAV, Few-step Generative Models Alignment via Sample-based Variational Inference, a general alignment framework that requires only sample access to the generator and the reference distribution. We cast alignment as sampling from a reward-tilted distribution anchored to a reference distribution. We leverage Stein Variational Gradient Descent as a sample-based variational inference scheme and amortize its particle updates into the generator parameters via fixed-point regression. We evaluate FAV on two domains: robotics manipulation and image generator alignment. On generative policy alignment for robotic manipulation, FAV outperforms prevailing policy extraction baselines across 56 offline and 30 offline-to-online RL tasks. For image generator alignment, FAV fine-tunes diverse few-step backbones, including GAN, drifting model, consistency models, and flow maps, scaling from ImageNet-$256$ to 1024$^2$ text-to-image synthesis. Code is available at https://github.com/Jaewoopudding/FAV.

This paper has not been read by Pith yet.

Aligning Few-Step Generative Models by Amortizing Sample-based Variational Inference

discussion (0)