arxiv: 2511.22177 · v2 · submitted 2025-11-27 · 💻 cs.LG · cs.CV

Recognition: 2 theorem links

· Lean Theorem

Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage

Peiyu Yu , Suraj Kothawade , Sirui Xie , Ying Nian Wu , Hongliang Fei

Authors on Pith no claims yet

Pith reviewed 2026-05-17 04:19 UTC · model grok-4.3

classification 💻 cs.LG cs.CV

keywords text-to-image generationsampling schedulesREINFORCEJames-Stein estimatordiffusion modelsinstance-level policiespost-trainingfew-step sampling

0 comments

The pith

Learning prompt-specific sampling schedules with a James-Stein REINFORCE baseline improves alignment and lets few-step diffusion match distilled quality.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to prove that one can raise the performance of an already-trained text-to-image sampler simply by changing when its steps occur, rather than by altering its weights. It does so by training a single-pass Dirichlet policy that produces a different schedule for each prompt and noise seed, with gradients estimated via REINFORCE using a James-Stein shrinkage estimator as the reward baseline. If the claim holds, practitioners gain a lightweight, model-agnostic post-training lever that works on top of existing Stable Diffusion and Flux checkpoints. The concrete payoff shown is stronger text rendering, better compositional control, and a 5-step Flux-Dev sampler whose outputs reach the quality of deliberately distilled models such as Flux-Schnell.

Core claim

We show that instance-level sampling schedules can be learned for a frozen text-to-image diffusion sampler by optimizing a Dirichlet policy in a single pass with REINFORCE gradients whose variance is reduced by a principled James-Stein estimator serving as reward baseline. The resulting schedules, conditioned on both prompt and noise, produce measurable gains in text-image alignment metrics, including text rendering and compositional control, across current Stable Diffusion and Flux families. In addition, the same schedules allow a 5-step Flux-Dev sampler to reach generation quality comparable to that of deliberately distilled few-step models such as Flux-Schnell.

What carries the argument

A Dirichlet policy that outputs instance-specific sampling schedules, trained by REINFORCE whose reward baseline is a James-Stein shrinkage estimator chosen to lower gradient estimation error in high-dimensional action spaces.

If this is right

Text rendering and compositional control improve without any change to model weights.
A 5-step Flux-Dev sampler attains quality comparable to that of distilled models such as Flux-Schnell.
The same rescheduling procedure applies across Stable Diffusion and Flux model families.
Instance-level scheduling emerges as a post-training lever distinct from weight fine-tuning or distillation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same policy-learning approach could be tested on iterative samplers used for audio or video generation.
Instance-level schedules might be combined with existing distillation pipelines to push step counts even lower while preserving alignment.
The framework invites experiments that measure how well the learned schedules transfer across random seeds or entirely new prompt distributions.

Load-bearing premise

A single-pass Dirichlet policy trained with the James-Stein baseline produces stable, generalizable instance-level schedules whose measured improvements are not artifacts of the chosen reward or evaluation protocol.

What would settle it

Running the learned schedules on a fresh set of prompts or on a different diffusion backbone and finding no consistent lift in text-alignment or human-preference scores relative to the original fixed schedule would falsify the central claim.

Figures

Figures reproduced from arXiv: 2511.22177 by Hongliang Fei, Peiyu Yu, Sirui Xie, Suraj Kothawade, Ying Nian Wu.

**Figure 1.** Figure 1: Instance-level schedules improve text-to-image generation. (a)-(d) illustrate four aspects where pretrained models like FluxDev benefit from our schedules. Samplers using our schedules (right) show consistent improvements over those with the default schedule (left), which are even more pronounced at only 5 inference steps (a). We visualize the schedules at the top left corner of each image; X-axis denotes… view at source ↗

**Figure 2.** Figure 2: James–Stein (JS) reward baseline. (a) Simulation results from anlaytical policies showcasing JS is consistently better than RLOO for different number of rollouts K. More details in supplemental materials. (b) Diagram of JS baseline: combining bRLOO and bxctx into bJS = αcbRLOO + (1 − αc)bxctx (Eq. (7)). further improves general text-to-image alignment, text rendering, and fine-grained capabilities such a… view at source ↗

**Figure 3.** Figure 3: Rescheduling improves general T2I alignment. Head-to-head comparisons between images generated with default schedules (upper) and our learned schedules (lower) from FluxDev with 40 steps. Figures henceforth follow the same format. bone and sampling budget L ∈ {5, 10, 20, 40, 80}, we generate images for the same set of prompts and fixed noise seeds to ensure strict comparability across methods. We study f… view at source ↗

**Figure 4.** Figure 4: Representative training curves (Flux-Dev) for different baselines and sampling steps L. Y-axis denotes the aggregated rewards; X-axis denotes the num. of iterations. We observe that JS baseline consistently outperforms XCTX and RLOO baselines; the gap is clear at low budget and persists even when L is large when the discretization error of sampling trajectory diminishes [PITH_FULL_IMAGE:figures/full_fig_… view at source ↗

**Figure 5.** Figure 5: Rescheduling improves few-step sampling. Comparisons between images generated with default schedules (upper) and our learned schedules (lower) from Flux-Dev, with 5 steps. pling budget is sufficiently large, the gap between the default schedule and our learned schedules may appear to narrow when measured by global preference scores; however, we find that our schedules yield substantially stronger fineg… view at source ↗

**Figure 7.** Figure 7: Rescheduling improves fine grained alignment. Comparisons between images generated with default schedules (upper) and our learned schedules (lower) from Flux-Dev. grained alignment benefits—even when the sampling budget is already sufficient. 5. Conclusion We introduced instance-level rescheduling for text-toimage samplers: a policy that outputs the entire denoising schedule in one pass, for each promp… view at source ↗

**Figure 8.** Figure 8: Rescheduling improves few-step sampling. Comparisons between images generated with default schedules (upper) and our learned schedules (lower) from Flux-Dev, with 5 steps. 5 [PITH_FULL_IMAGE:figures/full_fig_p016_8.png] view at source ↗

**Figure 9.** Figure 9: Rescheduling improves few-step sampling. Comparisons between images generated with default schedules (upper) and our learned schedules (lower) from Flux-Dev, with 5 steps. 6 [PITH_FULL_IMAGE:figures/full_fig_p017_9.png] view at source ↗

**Figure 10.** Figure 10: Rescheduling improves general T2I alignment. Head-to-head comparisons between images generated with default schedules (upper) and our learned schedules (lower) from Flux-Dev with 40 steps. 7 [PITH_FULL_IMAGE:figures/full_fig_p018_10.png] view at source ↗

**Figure 11.** Figure 11: Rescheduling improves general T2I alignment. Head-to-head comparisons between images generated with default schedules (upper) and our learned schedules (lower) from Flux-Dev with 40 steps. 8 [PITH_FULL_IMAGE:figures/full_fig_p019_11.png] view at source ↗

**Figure 12.** Figure 12: Rescheduling improves text rendering. We present comparisons between images generated with default schedules (upper) and our learned schedules (lower) from Flux-Dev. 9 [PITH_FULL_IMAGE:figures/full_fig_p020_12.png] view at source ↗

**Figure 13.** Figure 13: Rescheduling improves text rendering. We present comparisons between images generated with default schedules (upper) and our learned schedules (lower) from Flux-Dev. 10 [PITH_FULL_IMAGE:figures/full_fig_p021_13.png] view at source ↗

**Figure 14.** Figure 14: Rescheduling improves fine grained alignment. Comparisons between images generated with default schedules (upper) and our learned schedules (lower) from Flux-Dev. 11 [PITH_FULL_IMAGE:figures/full_fig_p022_14.png] view at source ↗

**Figure 15.** Figure 15: Rescheduling improves fine grained alignment. Comparisons between images generated with default schedules (upper) and our learned schedules (lower) from Flux-Dev. 12 [PITH_FULL_IMAGE:figures/full_fig_p023_15.png] view at source ↗

read the original abstract

Most post-training methods for text-to-image samplers focus on model weights: either fine-tuning the backbone for alignment or distilling it for few-step efficiency. We take a different route: rescheduling the sampling timeline of a frozen sampler. Instead of a fixed, global schedule, we learn instance-level (prompt- and noise-conditioned) schedules through a single-pass Dirichlet policy. To ensure accurate gradient estimates in high-dimensional policy learning, we introduce a novel reward baseline based on a principled James-Stein estimator; it provably achieves lower estimation errors than commonly used variants and leads to superior performance. Our rescheduled samplers consistently improve text-image alignment including text rendering and compositional control across modern Stable Diffusion and Flux model families. Additionally, a 5-step Flux-Dev sampler with our schedules can attain generation quality comparable to deliberately distilled samplers like Flux-Schnell. We thus position our scheduling framework as an emerging model-agnostic post-training lever that unlocks additional generative potential in pretrained samplers.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper shows you can improve alignment in frozen text-to-image models by learning per-instance sampling schedules with a Dirichlet policy and James-Stein baseline, without weight updates.

read the letter

The main point is that they optimize sampling timelines instance by instance for pretrained models like Stable Diffusion and Flux. They use a single-pass Dirichlet policy conditioned on the prompt and noise level, trained with REINFORCE plus a James-Stein shrinkage baseline to get stabler gradients. The abstract reports better text-image alignment, text rendering, and composition across model families, plus a 5-step Flux-Dev setup reaching quality close to the distilled Flux-Schnell version. This is positioned as a model-agnostic post-training lever that leaves the weights untouched.

Referee Report

2 major / 2 minor

Summary. The paper introduces a model-agnostic post-training method that learns prompt- and noise-conditioned instance-level sampling schedules for frozen text-to-image diffusion models via a single-pass Dirichlet policy optimized with REINFORCE. A novel James-Stein shrinkage estimator is proposed as the reward baseline, which the authors claim provably reduces gradient estimation error relative to standard baselines and yields consistent gains in text-image alignment, text rendering, and compositional control on Stable Diffusion and Flux families; a 5-step Flux-Dev schedule is reported to match the quality of distilled models such as Flux-Schnell.

Significance. If the empirical gains and the claimed error reduction hold under broader testing, the work supplies a lightweight, weight-free lever for improving pretrained samplers that complements existing fine-tuning and distillation pipelines. The principled use of James-Stein shrinkage for variance reduction in high-dimensional policy gradients is a concrete technical contribution that could transfer to other RL-for-sampling settings.

major comments (2)

§3 (or the corresponding appendix): the abstract asserts that the James-Stein baseline 'provably achieves lower estimation errors' than common variants, yet the manuscript provides neither the derivation steps nor the key inequality that establishes this reduction; without this, the central justification for adopting the baseline remains unverified and load-bearing for the REINFORCE training claim.
§4.2–4.3 (experimental protocol): the reported gains rely on a single reward (presumably CLIP-based) and a fixed set of training prompts; no ablation or transfer test on held-out prompt distributions or alternative rewards (e.g., aesthetic or human-preference scores) is shown, leaving open the possibility that the learned Dirichlet policy overfits the training protocol rather than discovering generally useful schedules.

minor comments (2)

Figure 2 and Table 1: axis labels and legend entries for the schedule-parameter distributions are too small to read at standard print size; enlarging or splitting the panels would improve clarity.
§2.2: the notation for the Dirichlet concentration parameters conditioned on prompt and noise timestep is introduced without an explicit equation reference, making it difficult to trace how the policy output is mapped to the sampling schedule.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed comments. We address each major comment below with clarifications and indicate where revisions will be made to strengthen the manuscript.

read point-by-point responses

Referee: §3 (or the corresponding appendix): the abstract asserts that the James-Stein baseline 'provably achieves lower estimation errors' than common variants, yet the manuscript provides neither the derivation steps nor the key inequality that establishes this reduction; without this, the central justification for adopting the baseline remains unverified and load-bearing for the REINFORCE training claim.

Authors: We agree that the central claim requires explicit support. The James-Stein shrinkage estimator is applied to the per-instance reward baseline within the REINFORCE gradient, and the variance reduction follows from the standard bias-variance tradeoff of the James-Stein estimator for estimating a multivariate normal mean when the dimension exceeds two. We will insert the full derivation, including the key inequality that bounds the mean-squared error of the shrunk estimator below that of the sample-mean baseline, into Section 3 and the corresponding appendix of the revised manuscript. revision: yes
Referee: §4.2–4.3 (experimental protocol): the reported gains rely on a single reward (presumably CLIP-based) and a fixed set of training prompts; no ablation or transfer test on held-out prompt distributions or alternative rewards (e.g., aesthetic or human-preference scores) is shown, leaving open the possibility that the learned Dirichlet policy overfits the training protocol rather than discovering generally useful schedules.

Authors: The concern about potential overfitting to the training distribution and reward is legitimate. The current experiments demonstrate gains on two distinct model families using the CLIP-based reward, yet we did not report explicit transfer results on held-out prompt sets or alternative reward functions. We will add these ablations—evaluating the learned schedules on held-out prompts and with aesthetic and human-preference rewards—in the revised experimental section to better substantiate generalizability. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation or claims

full rationale

The paper presents an empirical RL method that applies the standard James-Stein shrinkage estimator as a reward baseline within REINFORCE for learning a Dirichlet policy over sampling schedules. The estimator itself is externally grounded in classical statistics and is not derived from the target result. Policy optimization follows standard REINFORCE with baseline subtraction; reported gains are measured on alignment metrics after training, with no equations or steps shown to reduce by construction to the fitted parameters or to self-citations. No load-bearing self-citations, uniqueness theorems, or ansatzes imported from prior author work appear in the provided text. The framework is self-contained against external statistical benchmarks and does not rename known results as novel derivations.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities; the James-Stein baseline is presented as a standard statistical tool rather than a new postulate.

pith-pipeline@v0.9.0 · 5485 in / 1047 out tokens · 24900 ms · 2026-05-17T04:19:59.163053+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We introduce a novel reward baseline based on a principled James–Stein estimator; it provably achieves lower estimation errors than commonly used variants
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean LogicNat_equivNat unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Dirichlet parameterization... τ ∼ Dirichlet(α_θ(x_T, c))

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

31 extracted references · 31 canonical work pages · 10 internal anchors

[1]

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Arash Ahmadian, Chris Cremer, Matthias Gall ´e, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet ¨Ust¨un, and Sara Hooker. Back to basics: Revisiting reinforce style op- timization for learning from human feedback in llms.arXiv preprint arXiv:2402.14740, 2024. 2, 3, 4

work page internal anchor Pith review Pith/arXiv arXiv 2024
[2]

Training Diffusion Models with Reinforcement Learning

Kevin Black, Michael Janner, Yilun Du, Ilya Kostrikov, and Sergey Levine. Training diffusion models with reinforce- ment learning.arXiv preprint arXiv:2305.13301, 2023. 3

work page internal anchor Pith review Pith/arXiv arXiv 2023
[3]

Scaling recti- fied flow transformers for high-resolution image synthesis

Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim Entezari, Jonas M ¨uller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, et al. Scaling recti- fied flow transformers for high-resolution image synthesis. InForty-first international conference on machine learning,

work page
[4]

Dpok: Reinforcement learning for fine-tuning text-to-image diffu- sion models.Advances in Neural Information Processing Systems, 36:79858–79885, 2023

Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Moham- mad Ghavamzadeh, Kangwook Lee, and Kimin Lee. Dpok: Reinforcement learning for fine-tuning text-to-image diffu- sion models.Advances in Neural Information Processing Systems, 36:79858–79885, 2023. 3

work page 2023
[5]

Geneval: An object-focused framework for evaluating text- to-image alignment.Advances in Neural Information Pro- cessing Systems, 36:52132–52152, 2023

Dhruba Ghosh, Hannaneh Hajishirzi, and Ludwig Schmidt. Geneval: An object-focused framework for evaluating text- to-image alignment.Advances in Neural Information Pro- cessing Systems, 36:52132–52152, 2023. 7

work page 2023
[6]

Variance reduction techniques for gradient estimates in rein- forcement learning.Journal of Machine Learning Research, 5(Nov):1471–1530, 2004

Evan Greensmith, Peter L Bartlett, and Jonathan Baxter. Variance reduction techniques for gradient estimates in rein- forcement learning.Journal of Machine Learning Research, 5(Nov):1471–1530, 2004. 1

work page 2004
[7]

Estimation with quadratic loss

William James, Charles Stein, et al. Estimation with quadratic loss. InProceedings of the fourth Berkeley sympo- sium on mathematical statistics and probability, pages 361–

work page
[8]

University of California Press, 1961. 3, 2

work page 1961
[9]

Elucidating the design space of diffusion-based generative models.Advances in neural information processing systems, 35:26565–26577, 2022

Tero Karras, Miika Aittala, Timo Aila, and Samuli Laine. Elucidating the design space of diffusion-based generative models.Advances in neural information processing systems, 35:26565–26577, 2022. 3

work page 2022
[10]

Buy 4 reinforce samples, get a baseline for free! 2, 3, 4

Wouter Kool, Herke van Hoof, and Max Welling. Buy 4 reinforce samples, get a baseline for free! 2, 3, 4

work page
[11]

Flux.https://github.com/ black-forest-labs/flux, 2024

Black Forest Labs. Flux.https://github.com/ black-forest-labs/flux, 2024. 2, 3, 6

work page 2024
[12]

Mask textspotter v3: Segmentation proposal net- work for robust scene text spotting

Minghui Liao, Guan Pang, Jing Huang, Tal Hassner, and Xi- ang Bai. Mask textspotter v3: Segmentation proposal net- work for robust scene text spotting. InEuropean conference on computer vision, pages 706–722. Springer, 2020. 4

work page 2020
[13]

Flow Matching for Generative Modeling

Yaron Lipman, Ricky TQ Chen, Heli Ben-Hamu, Maximil- ian Nickel, and Matt Le. Flow matching for generative mod- eling.arXiv preprint arXiv:2210.02747, 2022. 3

work page internal anchor Pith review Pith/arXiv arXiv 2022
[14]

Flow-GRPO: Training Flow Matching Models via Online RL

Jie Liu, Gongye Liu, Jiajun Liang, Yangguang Li, Jiaheng Liu, Xintao Wang, Pengfei Wan, Di Zhang, and Wanli Ouyang. Flow-grpo: Training flow matching models via on- line rl.arXiv preprint arXiv:2505.05470, 2025. 7, 8

work page internal anchor Pith review Pith/arXiv arXiv 2025
[15]

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

Xingchao Liu, Chengyue Gong, and Qiang Liu. Flow straight and fast: Learning to generate and transfer data with rectified flow.arXiv preprint arXiv:2209.03003, 2022. 3

work page internal anchor Pith review Pith/arXiv arXiv 2022
[16]

Decoupled Weight Decay Regularization

Ilya Loshchilov and Frank Hutter. Decoupled weight decay regularization.arXiv preprint arXiv:1711.05101, 2017. 4

work page internal anchor Pith review Pith/arXiv arXiv 2017
[17]

Dpm-solver: A fast ode solver for diffu- sion probabilistic model sampling in around 10 steps.Ad- vances in neural information processing systems, 35:5775– 5787, 2022

Cheng Lu, Yuhao Zhou, Fan Bao, Jianfei Chen, Chongxuan Li, and Jun Zhu. Dpm-solver: A fast ode solver for diffu- sion probabilistic model sampling in around 10 steps.Ad- vances in neural information processing systems, 35:5775– 5787, 2022. 3

work page 2022
[18]

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Dustin Podell, Zion English, Kyle Lacey, Andreas Blattmann, Tim Dockhorn, Jonas M ¨uller, Joe Penna, and Robin Rombach. Sdxl: Improving latent diffusion mod- els for high-resolution image synthesis.arXiv preprint arXiv:2307.01952, 2023. 6

work page internal anchor Pith review Pith/arXiv arXiv 2023
[19]

High-resolution image synthesis with latent diffusion models

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Bj ¨orn Ommer. High-resolution image synthesis with latent diffusion models. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022. 2, 3

work page 2022
[20]

Align your steps: Optimizing sampling schedules in diffusion mod- els.arXiv preprint arXiv:2404.14507, 2024

Amirmojtaba Sabour, Sanja Fidler, and Karsten Kreis. Align your steps: Optimizing sampling schedules in diffusion mod- els.arXiv preprint arXiv:2404.14507, 2024. 3

work page arXiv 2024
[21]

Progressive Distillation for Fast Sampling of Diffusion Models

Tim Salimans and Jonathan Ho. Progressive distillation for fast sampling of diffusion models.arXiv preprint arXiv:2202.00512, 2022. 3

work page internal anchor Pith review Pith/arXiv arXiv 2022
[22]

Denoising Diffusion Implicit Models

Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models.arXiv preprint arXiv:2010.02502, 2020. 3, 4

work page internal anchor Pith review Pith/arXiv arXiv 2010
[23]

Consistency models

Yang Song, Prafulla Dhariwal, Mark Chen, and Ilya Sutskever. Consistency models. 3

work page
[24]

Diffusion model align- ment using direct preference optimization

Bram Wallace, Meihua Dang, Rafael Rafailov, Linqi Zhou, Aaron Lou, Senthil Purushwalkam, Stefano Ermon, Caiming Xiong, Shafiq Joty, and Nikhil Naik. Diffusion model align- ment using direct preference optimization. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8228–8238, 2024. 3

work page 2024
[25]

Simple statistical gradient-following al- gorithms for connectionist reinforcement learning.Machine learning, 8(3):229–256, 1992

Ronald J Williams. Simple statistical gradient-following al- gorithms for connectionist reinforcement learning.Machine learning, 8(3):229–256, 1992. 3

work page 1992
[26]

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Xiaoshi Wu, Yiming Hao, Keqiang Sun, Yixiong Chen, Feng Zhu, Rui Zhao, and Hongsheng Li. Human preference score v2: A solid benchmark for evaluating human preferences of text-to-image synthesis.arXiv preprint arXiv:2306.09341,

work page internal anchor Pith review Pith/arXiv arXiv
[27]

Em distillation for one-step diffusion mod- els.Advances in Neural Information Processing Systems, 37: 45073–45104, 2024

Sirui Xie, Zhisheng Xiao, Diederik Kingma, Tingbo Hou, Ying Nian Wu, Kevin P Murphy, Tim Salimans, Ben Poole, and Ruiqi Gao. Em distillation for one-step diffusion mod- els.Advances in Neural Information Processing Systems, 37: 45073–45104, 2024. 3

work page 2024
[28]

Schedule on the fly: Diffusion time prediction for faster and better image generation

Zilyu Ye, Zhiyang Chen, Tiancheng Li, Zemin Huang, Wei- jian Luo, and Guo-Jun Qi. Schedule on the fly: Diffusion time prediction for faster and better image generation. In Proceedings of the Computer Vision and Pattern Recognition Conference, pages 23412–23422, 2025. 2, 3

work page 2025
[29]

One-step diffusion with distribution matching distillation

Tianwei Yin, Micha ¨el Gharbi, Richard Zhang, Eli Shecht- man, Fredo Durand, William T Freeman, and Taesung Park. One-step diffusion with distribution matching distillation. In Proceedings of the IEEE/CVF conference on computer vi- sion and pattern recognition, pages 6613–6623, 2024. 3

work page 2024
[30]

Unipc: A unified predictor-corrector framework 10 for fast sampling of diffusion models.Advances in Neural Information Processing Systems, 36:49842–49869, 2023

Wenliang Zhao, Lujia Bai, Yongming Rao, Jie Zhou, and Jiwen Lu. Unipc: A unified predictor-corrector framework 10 for fast sampling of diffusion models.Advances in Neural Information Processing Systems, 36:49842–49869, 2023. 3

work page 2023
[31]

dimension

Zikai Zhou, Shitong Shao, Lichen Bai, Shufei Zhang, Zhiqiang Xu, Bo Han, and Zeke Xie. Golden noise for diffusion models: A learning framework. InProceedings of the IEEE/CVF International Conference on Computer Vi- sion, pages 17688–17697, 2025. 2 11 Designing Instance-Level Sampling Schedules via REINFORCE with James-Stein Shrinkage Supplementary Materia...

work page 2025