Simple Approximation and Derivative Free Inference-Time Scaling for Diffusion Models via Sequential Monte Carlo on Path Measures

Chenyang Wang; Jose Blanchet; Weizhong Wang; Yinuo Ren; Yiping Lu

arxiv: 2605.17850 · v1 · pith:YQZURXGUnew · submitted 2026-05-18 · 📊 stat.ML · cs.CV· cs.LG· cs.NA· math.NA· math.PR

Simple Approximation and Derivative Free Inference-Time Scaling for Diffusion Models via Sequential Monte Carlo on Path Measures

Chenyang Wang , Weizhong Wang , Yinuo Ren , Jose Blanchet , Yiping Lu This is my paper

Pith reviewed 2026-05-20 01:08 UTC · model grok-4.3

classification 📊 stat.ML cs.CVcs.LGcs.NAmath.NAmath.PR

keywords diffusion modelssequential Monte CarloGirsanov theoreminference-time scalingderivative-free methodspath measuresimportance resamplingunbiased estimation

0 comments

The pith

URGE uses Girsanov path weights for unbiased, derivative-free resampling in diffusion model inference.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents URGE, a new algorithm for improving diffusion model samples at inference time without derivatives or biases. It applies sequential Monte Carlo by assigning multiplicative weights to entire trajectories using a Girsanov change of measure and resampling periodically. This avoids the repeated score evaluations required by prior guidance methods. The authors prove an equivalence between their path-wise weighting and traditional particle-wise approaches, showing both yield the same unbiased final distribution. Experiments demonstrate superior performance on benchmarks with simpler implementation.

Core claim

By leveraging the Girsanov theorem on path measures, one can attach a simple multiplicative weight to each simulated trajectory and resample them, achieving inference-time scaling that is derivative-free and unbiased. The key equivalence shows that the Girsanov path weight, through a backward conditional expectation, recovers the particle-level weights from previous work, ensuring identical terminal laws.

What carries the argument

The Girsanov path weight as a multiplicative factor on trajectories that enables unbiased importance resampling via sequential Monte Carlo on path measures.

Load-bearing premise

The underlying diffusion process must allow the Girsanov theorem to define a valid change of measure between the reference and guided dynamics.

What would settle it

Running a controlled simulation where the terminal distribution from URGE is compared to the exact target distribution or to a particle-wise SMC implementation and observing significant bias or mismatch would disprove the unbiasedness claim.

Figures

Figures reproduced from arXiv: 2605.17850 by Chenyang Wang, Jose Blanchet, Weizhong Wang, Yinuo Ren, Yiping Lu.

**Figure 1.** Figure 1: Step-wise resampling corrects suboptimal guided generation toward the reward-tilted distribution. (a) In Naive Guidance, blue trajectories follow the guided diffusion dX G t = [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 2.** Figure 2 [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: URGE achieves better performance compared with baseline, while exhibiting monotonic and larger reward increase with N (unlike FK-Steering). With URGE, a smaller model (SD v1.5) can attain higher reward than an XL model [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Qualitative comparison of inference-time scaling strategy on the Gaussian Mixture example. In our main experiments, we use K = 40 components and Dim = 30 dimensions as the default setting. To further validate the robustness of URGE, we additionally conduct experiments with K = 80, Dim = 30 and K = 40, Dim = 60, respectively. We run experiments under three random seeds and report the averaged results, as ta… view at source ↗

**Figure 5.** Figure 5: Performance metrics versus discretization steps on Gaussian Mixture Model. for the prompts used. The selected prompts contain exactly two colored entities. On colored two-object prompts, URGE also performs strongly. The generated samples adhere more strictly to the specified color attributes and spatial ordering, demonstrating enhanced compositional fidelity. Similar to additional experiments on other task… view at source ↗

**Figure 6.** Figure 6: Inference time versus number of particles for the Gaussian Deblurring task [PITH_FULL_IMAGE:figures/full_fig_p035_6.png] view at source ↗

**Figure 8.** Figure 8: Inference time for the inverse problems tasks on FFHQ-256 and ImageNet-256. Results report runtimes where AFDPS-ODE is run with k = 5 particles while all other methods use k = 10 particles. 35 [PITH_FULL_IMAGE:figures/full_fig_p035_8.png] view at source ↗

**Figure 9.** Figure 9: Inference time for the text-to-image tasks on 20 prompts. FKG refers to FK-Steering with gradient guidance. Base is ran with only 1 particle. Prompt a photo of ... ... a blue laptop and a brown bear. ... a purple elephant and a brown sports ball. ... a white dining table and a red car. ... a blue cell phone and a green apple. Base Model SDv1.5 Base Model SDXL FKSteering SDv1.5, N = 4 URGE SDv1.5 N = 4 [P… view at source ↗

**Figure 10.** Figure 10: Additional visual examples for the Box inpainting problem and the Gaussian deblurring problem on FFHQ [PITH_FULL_IMAGE:figures/full_fig_p037_10.png] view at source ↗

**Figure 11.** Figure 11: Additional visual examples for the Motion deblurring problem and the Super resolution problem on FFHQ. 37 [PITH_FULL_IMAGE:figures/full_fig_p037_11.png] view at source ↗

read the original abstract

iffusion-based generative models increasingly rely on inference-time guidance, adding a drift term or reweighting mixture of experts, to improve sample quality on task-specific objectives. However, most existing techniques require repeated score or gradient evaluations, introducing bias, high computational overhead, or both. We introduce \texttt{URGE}, Unbiased Resampling via Girsanov Estimation, a derivative-free inference-time scaling algorithm that performs path-wise importance reweighting via a Girsanov change of measure. Instead of computing gradient-based particle weights in previous work, \texttt{URGE} attaches a simple multiplicative weight to each simulated trajectory and periodically resamples. No score, no Hessian, and no PDE evaluation is required. We establish an equivalence between path-wise and particle-wise SMC: the Girsanov path weight admits a backward conditional expectation that recovers the previous particle-level weights, guaranteeing that both schemes produce the same unbiased terminal law. Empirically, \texttt{URGE} outperforms existing inference-time guidance baselines on synthetic tests and diffusion-model benchmarks, achieving better generation quality, while being significantly simpler to implement and fully gradient-free.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

URGE gives a simpler gradient-free path for inference-time scaling in diffusions via Girsanov-weighted SMC and claims an equivalence to particle schemes, but the continuous-time unbiasedness argument needs checking against the discrete steps actually used in code.

read the letter

The main thing to know is that this paper introduces URGE, a derivative-free algorithm that attaches Girsanov path weights to diffusion trajectories and resamples periodically to improve task-specific generation without gradients or scores. They also prove an equivalence showing that the path-wise weights recover the earlier particle-wise weights through a backward conditional expectation, so both approaches target the same unbiased terminal law. That combination is the core offering. What the work does well is keep the method simple to implement and demonstrate empirical gains over existing guidance baselines on synthetic examples and diffusion benchmarks, with better sample quality at lower overhead. The equivalence result is a clean theoretical link that could help unify some of the SMC literature on path measures. The soft spot is the discretization gap. The equivalence rests on continuous-time Girsanov change of measure and path measures, but diffusion models are always run with a finite number of Euler or similar steps. In discrete time the Radon-Nikodym derivative becomes a product of transition densities, and the backward conditional expectation does not automatically commute with that approximation. The abstract and stress-test note give no separate discrete proof or error bound, so the unbiasedness guarantee may not transfer exactly to the algorithm people would actually code and run. If the full paper supplies a discrete-time version or shows the bias is small in practice, that closes the issue; otherwise it is a real but fixable limitation rather than a fatal one. This paper is for people working on sampling improvements in generative models who want lighter-weight alternatives to gradient-based guidance. Readers focused on SMC applications or practical scaling would get the most out of the equivalence and the implementation details. It is coherent enough and novel enough to deserve a serious referee, even if revisions will likely be needed on the discrete analysis. I would send it to peer review.

Referee Report

1 major / 2 minor

Summary. The paper introduces URGE, an inference-time scaling method for diffusion models that performs path-wise importance reweighting via a Girsanov change of measure on path measures. It claims this approach is derivative-free (no score, Hessian, or PDE evaluations), establishes an equivalence between path-wise and particle-wise SMC via a backward conditional expectation that recovers particle weights, and guarantees the same unbiased terminal law. Empirical results show outperformance over existing guidance baselines on synthetic tests and diffusion benchmarks.

Significance. If the central equivalence and unbiasedness claims hold under the discrete-time discretizations actually used in implementations, the method provides a simpler, fully gradient-free alternative to gradient-based guidance techniques, reducing computational overhead while improving generation quality. The self-contained equivalence result (without reducing to fitted quantities) would be a useful theoretical contribution for SMC applications in generative modeling.

major comments (1)

[Theoretical analysis of equivalence (around the Girsanov path weight and backward conditional expectation)] The equivalence result (abstract and theoretical analysis) relies on a continuous-time Girsanov Radon-Nikodym derivative and backward conditional expectation to recover exact particle weights. However, all reported experiments and algorithms use finite-step Euler-Maruyama discretization. In discrete time the weight is a product of transition densities rather than an exponential martingale, and the conditional expectation does not necessarily commute with the discretization operator. No separate discrete-time proof, commutation argument, or error bound is provided to confirm that the unbiased terminal marginal is preserved exactly in the implemented algorithm.

minor comments (2)

[Abstract] The abstract refers to 'synthetic tests and diffusion-model benchmarks' without naming the specific datasets, metrics (e.g., FID, precision/recall), or number of diffusion steps; these details should be stated explicitly for reproducibility.
[Method section] Notation for the path measure and the Girsanov weight should be introduced with a clear definition of the underlying probability spaces before the equivalence statement.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their careful reading of the manuscript and for recognizing the potential value of URGE as a derivative-free inference-time scaling approach. We address the major comment on the theoretical equivalence below and outline the revisions we will make.

read point-by-point responses

Referee: The equivalence result (abstract and theoretical analysis) relies on a continuous-time Girsanov Radon-Nikodym derivative and backward conditional expectation to recover exact particle weights. However, all reported experiments and algorithms use finite-step Euler-Maruyama discretization. In discrete time the weight is a product of transition densities rather than an exponential martingale, and the conditional expectation does not necessarily commute with the discretization operator. No separate discrete-time proof, commutation argument, or error bound is provided to confirm that the unbiased terminal marginal is preserved exactly in the implemented algorithm.

Authors: We thank the referee for this precise observation. The core equivalence is established in continuous time via the Girsanov Radon-Nikodym derivative on path space and the tower property of conditional expectations, which recovers the particle weights exactly. In the discrete Euler-Maruyama setting actually implemented, the path weight is indeed the product of per-step transition Radon-Nikodym factors. Because the backward conditional expectation is applied recursively at each discrete time index using the same transition kernels that define the discretization, the equivalence between path-wise and particle-wise weights continues to hold exactly for the finite-step process; the terminal marginal therefore remains unbiased under the implemented algorithm. Nevertheless, we agree that an explicit discrete-time statement strengthens the presentation. We will add a new subsection (and corresponding appendix) that states the equivalence directly for the Euler-discretized process, provides the recursive commutation argument, and includes a brief discussion of the approximation error to the continuous-time limit. These additions will be self-contained and will not alter the existing continuous-time analysis. revision: yes

Circularity Check

0 steps flagged

No circularity: equivalence claim is a derived mathematical property

full rationale

The paper's central claim is an equivalence between path-wise and particle-wise SMC established via the Girsanov path weight admitting a backward conditional expectation that recovers particle weights, yielding the same unbiased terminal law. This is presented as a consequence of the change-of-measure construction rather than a self-definition or a fitted quantity renamed as a prediction. No equations in the abstract reduce the terminal law to its inputs by construction, and the method is described as derivative-free without invoking self-citations as load-bearing uniqueness theorems. The derivation remains self-contained against standard SMC and Girsanov theory; any discretization concerns affect correctness but do not indicate circularity in the claimed chain.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Only the abstract is available, so the ledger records the minimal structural assumptions needed for the stated claims to hold; no explicit free parameters or invented entities are mentioned.

axioms (1)

domain assumption The underlying diffusion process admits a Girsanov change of measure between the reference and guided path measures.
Required for the path-wise importance weights to be well-defined and for the backward conditional expectation equivalence to recover particle weights.

pith-pipeline@v0.9.0 · 5757 in / 1341 out tokens · 28958 ms · 2026-05-20T01:08:06.561230+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We establish an equivalence between path-wise and particle-wise SMC: the Girsanov path weight admits a backward conditional expectation that recovers the previous particle-level weights
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean J_uniquely_calibrated_via_higher_derivative unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

URGE attaches a simple multiplicative weight to each simulated trajectory and periodically resamples. No score, no Hessian, and no PDE evaluation is required.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

84 extracted references · 84 canonical work pages · 11 internal anchors

[1]

Hongrui Chen, Holden Lee, and Jianfeng Lu

Solving inverse problems via diffusion-based priors: An approximation-free ensemble sampling approach , author=. arXiv preprint arXiv:2506.03979 , year=

work page arXiv
[2]

and Lu, Y

On the Power of (Approximate) Reward Models for Inference-Time Scaling , author=. arXiv preprint arXiv:2602.01381 , year=

work page arXiv
[3]

arXiv preprint arXiv:2504.16172 , year=

Physics-Informed Inference Time Scaling via Simulation-Calibrated Scientific Machine Learning , author=. arXiv preprint arXiv:2504.16172 , year=

work page arXiv
[4]

Training-free adaptation of diffusion models via doob’sh-transform.arXiv preprint arXiv:2602.16198, 2026

Training-Free Adaptation of Diffusion Models via Doob's h -Transform , author=. arXiv preprint arXiv:2602.16198 , year=

work page arXiv
[5]

2026 , url=

Wei, Lifu Wei and Ren, Yinuo and Shi, Naichen and Lu, Yiping , booktitle=. 2026 , url=

work page 2026
[6]

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

Flow straight and fast: Learning to generate and transfer data with rectified flow , author=. arXiv preprint arXiv:2209.03003 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[7]

arXiv preprint arXiv:2501.09685 , year=

Inference-time alignment in diffusion models with reward-guided generation: Tutorial and review , author=. arXiv preprint arXiv:2501.09685 , year=

work page arXiv
[8]

Flow Matching for Generative Modeling

Flow matching for generative modeling , author=. arXiv preprint arXiv:2210.02747 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[9]

Advances in neural information processing systems , volume=

Video diffusion models , author=. Advances in neural information processing systems , volume=

work page
[10]

Advances in neural information processing systems , volume=

Protein design with guided discrete diffusion , author=. Advances in neural information processing systems , volume=

work page
[11]

Large Language Diffusion Models

Large language diffusion models , author=. arXiv preprint arXiv:2502.09992 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[12]

Advances in neural information processing systems , volume=

Diffusion-lm improves controllable text generation , author=. Advances in neural information processing systems , volume=

work page
[13]

Advances in neural information processing systems , volume=

Denoising diffusion probabilistic models , author=. Advances in neural information processing systems , volume=

work page
[14]

International conference on machine learning , pages=

Deep unsupervised learning using nonequilibrium thermodynamics , author=. International conference on machine learning , pages=. 2015 , organization=

work page 2015
[15]

Score-Based Generative Modeling through Stochastic Differential Equations

Score-based generative modeling through stochastic differential equations , author=. arXiv preprint arXiv:2011.13456 , year=

work page internal anchor Pith review Pith/arXiv arXiv 2011
[16]

1999 , publisher=

Continuous Martingales and Brownian Motion , author=. 1999 , publisher=

work page 1999
[17]

The Annals of Probability , volume=

Time Reversal of Diffusions , author=. The Annals of Probability , volume=

work page
[18]

Feynman-kac correctors in diffusion: Annealing, guidance, and product of experts.arXiv preprint arXiv:2503.02819, 2025

Feynman-kac correctors in diffusion: Annealing, guidance, and product of experts , author=. arXiv preprint arXiv:2503.02819 , year=

work page arXiv
[19]

A general framework for inference-time scaling and steering of diffusion models.arXiv preprint arXiv:2501.06848, 2025

A general framework for inference-time scaling and steering of diffusion models , author=. arXiv preprint arXiv:2501.06848 , year=

work page arXiv
[20]

Journal of Mathematical Imaging and Vision , volume=

Sliced and radon wasserstein barycenters of measures , author=. Journal of Mathematical Imaging and Vision , volume=. 2015 , publisher=

work page 2015
[21]

The Fourteenth International Conference on Learning Representations , year=

Physics-Informed Inference Time Scaling for Solving High-Dimensional Partial Differential Equations , author=. The Fourteenth International Conference on Learning Representations , year=

work page
[22]

Guidance with spherical gaussian constraint for condi- tional diffusion.arXiv preprint arXiv:2402.03201,

Guidance with spherical gaussian constraint for conditional diffusion , author=. arXiv preprint arXiv:2402.03201 , year=

work page arXiv
[23]

arXiv preprint arXiv:2408.08252 , year =

Derivative-free guidance in continuous and discrete diffusion models with soft value-based decoding , author=. arXiv preprint arXiv:2408.08252 , year=

work page arXiv
[24]

13th international conference, ICONIP , volume=

Maximum mean discrepancy , author=. 13th international conference, ICONIP , volume=

work page
[25]

Proceedings of the 2021 conference on empirical methods in natural language processing , pages=

Clipscore: A reference-free evaluation metric for image captioning , author=. Proceedings of the 2021 conference on empirical methods in natural language processing , pages=

work page 2021
[26]

IEEE transactions on image processing , volume=

Image quality assessment: from error visibility to structural similarity , author=. IEEE transactions on image processing , volume=. 2004 , publisher=

work page 2004
[27]

Proceedings of the IEEE conference on computer vision and pattern recognition , pages=

The unreasonable effectiveness of deep features as a perceptual metric , author=. Proceedings of the IEEE conference on computer vision and pattern recognition , pages=

work page
[28]

Journal of Machine Learning Research , volume=

Stochastic interpolants: A unifying framework for flows and diffusions , author=. Journal of Machine Learning Research , volume=

work page
[29]

Albergo, Carles Domingo-Enrich, Nicholas M

Test-time scaling of diffusions with flow maps , author=. arXiv preprint arXiv:2511.22688 , year=

work page arXiv
[30]

Classifier-Free Diffusion Guidance

Classifier-Free Diffusion Guidance , author =. arXiv preprint arXiv:2207.12598 , year =

work page internal anchor Pith review Pith/arXiv arXiv
[31]

Advances in Neural Information Processing Systems (NeurIPS) , year =

Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention , author =. Advances in Neural Information Processing Systems (NeurIPS) , year =

work page
[32]

Advances in Neural Information Processing Systems (NeurIPS) , year =

Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models , author =. Advances in Neural Information Processing Systems (NeurIPS) , year =

work page
[33]

Trippe, Christian A

Practical and Asymptotically Exact Conditional Sampling in Diffusion Models , author =. arXiv preprint arXiv:2306.17775 , year =

work page arXiv
[34]

arXiv preprint arXiv:2312.12487 , year =

Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models , author =. arXiv preprint arXiv:2312.12487 , year =

work page arXiv
[35]

Driftlite: Lightweight drift control for inference-time scaling of diffusion models.arXiv preprint arXiv:2509.21655, 2025

Driftlite: Lightweight drift control for inference-time scaling of diffusion models , author=. arXiv preprint arXiv:2509.21655 , year=

work page arXiv
[36]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

work page
[37]

Advances in Neural Information Processing Systems , volume=

DEFT: Efficient Fine-tuning of Diffusion Models by Learning the Generalised h -transform , author=. Advances in Neural Information Processing Systems , volume=

work page
[38]

Columbia University Preprint , year=

A stochastic analysis approach to conditional diffusion guidance , author=. Columbia University Preprint , year=

work page
[39]

No training, no problem: Rethinking classifier-free guidance for diffusion models.arXiv preprint arXiv:2407.02687, 2024

No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models , author =. arXiv preprint arXiv:2407.02687 , year =

work page arXiv
[40]

and Tang, S

Pardoux, E. and Tang, S. , title =. Probab. Theory Relat. Fields , volume =

work page
[41]

, title =

Karatzas, Ioannis and Shreve, Steven E. , title =. 1991 , publisher =

work page 1991
[42]

Rogers, L. C. G. and Williams, David , title =. 2000 , publisher =

work page 2000
[43]

and Kurtz, Thomas G

Ethier, Stewart N. and Kurtz, Thomas G. , title =. 1986 , publisher =

work page 1986
[44]

1964 , publisher =

Friedman, Avner , title =. 1964 , publisher =

work page 1964
[45]

, title =

Evans, Lawrence C. , title =. 2010 , publisher =

work page 2010
[46]

1981 , pages =

Ikeda, Nobuyuki and Watanabe, Shinzo , title =. 1981 , pages =

work page 1981
[47]

Advances in Neural Information Processing Systems (NeurIPS) 2023 , year =

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation , author =. Advances in Neural Information Processing Systems (NeurIPS) 2023 , year =

work page 2023
[48]

Advances in Neural Information Processing Systems (NeurIPS) 2022 , year =

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding , author =. Advances in Neural Information Processing Systems (NeurIPS) 2022 , year =

work page 2022
[49]

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Glide: Towards photorealistic image generation and editing with text-guided diffusion models , author=. arXiv preprint arXiv:2112.10741 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[50]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

High-resolution image synthesis with latent diffusion models , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page
[51]

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Sdxl: Improving latent diffusion models for high-resolution image synthesis , author=. arXiv preprint arXiv:2307.01952 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[52]

Advances in neural information processing systems , volume=

Diffusion models beat gans on image synthesis , author=. Advances in neural information processing systems , volume=

work page
[53]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages =

Diffusion Model Alignment Using Direct Preference Optimization , author =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages =

work page
[54]

Advances in Neural Information Processing Systems , volume =

Direct Preference Optimization: Your Language Model Is Secretly a Reward Model , author =. Advances in Neural Information Processing Systems , volume =

work page
[55]

International Conference on Learning Representations (ICLR) 2021 , year =

Denoising Diffusion Implicit Models , author =. International Conference on Learning Representations (ICLR) 2021 , year =

work page 2021
[56]

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2023 , year =

Better Aligning Text-to-Image Models with Human Preference , author =. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2023 , year =

work page 2023
[57]

arXiv preprint arXiv:2503.06884 , year =

Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help , author =. arXiv preprint arXiv:2503.06884 , year =

work page arXiv
[58]

NeurIPS Datasets and Benchmarks Track , year =

GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment , author =. NeurIPS Datasets and Benchmarks Track , year =

work page
[59]

Billingsley, Patrick , title =

work page
[60]

International Journal of Stochastic Analysis , year =

Yamada–Watanabe Results for Stochastic Differential Equations with Jumps , author =. International Journal of Stochastic Analysis , year =

work page
[61]

Stochastic Analysis with Applications to Mathematical Finance , editor =

Glasserman, Paul and Merener, Nicolas , title =. Stochastic Analysis with Applications to Mathematical Finance , editor =. 2004 , pages =

work page 2004
[62]

ACM Transactions on Modeling and Computer Simulation , volume =

Djehiche, Boualem and Hult, Henrik and Nyquist, Pierre , title =. ACM Transactions on Modeling and Computer Simulation , volume =

work page
[63]

Geometric ergodicity of Rao and Teh's algorithm for Markov jump processes , journal =

Miasojedow, B. Geometric ergodicity of Rao and Teh's algorithm for Markov jump processes , journal =

work page
[64]

Signal Processing , volume=

Effective sample size for importance sampling based on discrepancy measures , author=. Signal Processing , volume=. 2017 , publisher=

work page 2017
[65]

Journal of the Royal Statistical Society Series B , volume=

Sequential Monte Carlo samplers , author=. Journal of the Royal Statistical Society Series B , volume=. 2006 , publisher=

work page 2006
[66]

Bernoulli , volume=

On adaptive resampling strategies for sequential Monte Carlo methods , author=. Bernoulli , volume=. 2012 , publisher=

work page 2012
[67]

Feynman--Kac Formulae: Genealogical and Interacting Particle Systems with Applications , author =

work page
[68]

Del Moral, Pierre and Miclo, Laurent , title =. S. 2000 , publisher =

work page 2000
[69]

arXiv preprint arXiv:2410.03601 , year =

How discrete and continuous diffusion meet: Comprehensive analysis of discrete diffusion models via a stochastic integral framework , author =. arXiv preprint arXiv:2410.03601 , year =

work page arXiv
[70]

A Unified Approach to Analysis and Design of Denoising Markov Models

A unified approach to analysis and design of denoising markov models , author =. arXiv preprint arXiv:2504.01938 , year =

work page internal anchor Pith review Pith/arXiv arXiv
[71]

Applebaum, David , year =. L

work page
[72]

Adjoint sampling: Highly scalable diffusion samplers via adjoint matching.arXiv preprint arXiv:2504.11713, 2025

Adjoint sampling: Highly scalable diffusion samplers via adjoint matching , author =. arXiv preprint arXiv:2504.11713 , year =

work page arXiv
[73]

Adjoint Schr

Liu, Guan-Horng and Choi, Jaemoo and Chen, Yongxin and Miller, Benjamin Kurt and Chen, Ricky TQ , journal =. Adjoint Schr

work page
[74]

Adjoint matching: Fine- tuning flow and diffusion generative models with memoryless stochastic optimal control.arXiv preprint arXiv:2409.08861, 2024

Adjoint matching: Fine-tuning flow and diffusion generative models with memoryless stochastic optimal control , author =. arXiv preprint arXiv:2409.08861 , year =

work page arXiv
[75]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages =

Karras, Tero and Laine, Samuli and Aila, Timo , title =. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages =

work page
[76]

2009 IEEE conference on computer vision and pattern recognition , pages =

Deng, Jia and Dong, Wei and Socher, Richard and Li, Li-Jia and Li, Kai and Li, Fei-Fei , title =. 2009 IEEE conference on computer vision and pattern recognition , pages =

work page 2009
[77]

RNE: plug-and-play diffusion inference-time control and energy-based training

RNE: a plug-and-play framework for diffusion density estimation and inference-time control , author=. arXiv preprint arXiv:2506.05668 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[78]

Debiasing guidance for discrete diffusion with sequential monte carlo.arXiv preprint arXiv:2502.06079, 2025

Debiasing guidance for discrete diffusion with sequential monte carlo , author=. arXiv preprint arXiv:2502.06079 , year=

work page arXiv
[79]

Monge-Amp\`ere Flow for Generative Modeling

Monge-amp\`ere flow for generative modeling , author=. arXiv preprint arXiv:1809.10188 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[80]

Advances in neural information processing systems , volume=

Generative modeling by estimating gradients of the data distribution , author=. Advances in neural information processing systems , volume=

work page

Showing first 80 references.

[1] [1]

Hongrui Chen, Holden Lee, and Jianfeng Lu

Solving inverse problems via diffusion-based priors: An approximation-free ensemble sampling approach , author=. arXiv preprint arXiv:2506.03979 , year=

work page arXiv

[2] [2]

and Lu, Y

On the Power of (Approximate) Reward Models for Inference-Time Scaling , author=. arXiv preprint arXiv:2602.01381 , year=

work page arXiv

[3] [3]

arXiv preprint arXiv:2504.16172 , year=

Physics-Informed Inference Time Scaling via Simulation-Calibrated Scientific Machine Learning , author=. arXiv preprint arXiv:2504.16172 , year=

work page arXiv

[4] [4]

Training-free adaptation of diffusion models via doob’sh-transform.arXiv preprint arXiv:2602.16198, 2026

Training-Free Adaptation of Diffusion Models via Doob's h -Transform , author=. arXiv preprint arXiv:2602.16198 , year=

work page arXiv

[5] [5]

2026 , url=

Wei, Lifu Wei and Ren, Yinuo and Shi, Naichen and Lu, Yiping , booktitle=. 2026 , url=

work page 2026

[6] [6]

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

Flow straight and fast: Learning to generate and transfer data with rectified flow , author=. arXiv preprint arXiv:2209.03003 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[7] [7]

arXiv preprint arXiv:2501.09685 , year=

Inference-time alignment in diffusion models with reward-guided generation: Tutorial and review , author=. arXiv preprint arXiv:2501.09685 , year=

work page arXiv

[8] [8]

Flow Matching for Generative Modeling

Flow matching for generative modeling , author=. arXiv preprint arXiv:2210.02747 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[9] [9]

Advances in neural information processing systems , volume=

Video diffusion models , author=. Advances in neural information processing systems , volume=

work page

[10] [10]

Advances in neural information processing systems , volume=

Protein design with guided discrete diffusion , author=. Advances in neural information processing systems , volume=

work page

[11] [11]

Large Language Diffusion Models

Large language diffusion models , author=. arXiv preprint arXiv:2502.09992 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[12] [12]

Advances in neural information processing systems , volume=

Diffusion-lm improves controllable text generation , author=. Advances in neural information processing systems , volume=

work page

[13] [13]

Advances in neural information processing systems , volume=

Denoising diffusion probabilistic models , author=. Advances in neural information processing systems , volume=

work page

[14] [14]

International conference on machine learning , pages=

Deep unsupervised learning using nonequilibrium thermodynamics , author=. International conference on machine learning , pages=. 2015 , organization=

work page 2015

[15] [15]

Score-Based Generative Modeling through Stochastic Differential Equations

Score-based generative modeling through stochastic differential equations , author=. arXiv preprint arXiv:2011.13456 , year=

work page internal anchor Pith review Pith/arXiv arXiv 2011

[16] [16]

1999 , publisher=

Continuous Martingales and Brownian Motion , author=. 1999 , publisher=

work page 1999

[17] [17]

The Annals of Probability , volume=

Time Reversal of Diffusions , author=. The Annals of Probability , volume=

work page

[18] [18]

Feynman-kac correctors in diffusion: Annealing, guidance, and product of experts.arXiv preprint arXiv:2503.02819, 2025

Feynman-kac correctors in diffusion: Annealing, guidance, and product of experts , author=. arXiv preprint arXiv:2503.02819 , year=

work page arXiv

[19] [19]

A general framework for inference-time scaling and steering of diffusion models.arXiv preprint arXiv:2501.06848, 2025

A general framework for inference-time scaling and steering of diffusion models , author=. arXiv preprint arXiv:2501.06848 , year=

work page arXiv

[20] [20]

Journal of Mathematical Imaging and Vision , volume=

Sliced and radon wasserstein barycenters of measures , author=. Journal of Mathematical Imaging and Vision , volume=. 2015 , publisher=

work page 2015

[21] [21]

The Fourteenth International Conference on Learning Representations , year=

Physics-Informed Inference Time Scaling for Solving High-Dimensional Partial Differential Equations , author=. The Fourteenth International Conference on Learning Representations , year=

work page

[22] [22]

Guidance with spherical gaussian constraint for condi- tional diffusion.arXiv preprint arXiv:2402.03201,

Guidance with spherical gaussian constraint for conditional diffusion , author=. arXiv preprint arXiv:2402.03201 , year=

work page arXiv

[23] [23]

arXiv preprint arXiv:2408.08252 , year =

Derivative-free guidance in continuous and discrete diffusion models with soft value-based decoding , author=. arXiv preprint arXiv:2408.08252 , year=

work page arXiv

[24] [24]

13th international conference, ICONIP , volume=

Maximum mean discrepancy , author=. 13th international conference, ICONIP , volume=

work page

[25] [25]

Proceedings of the 2021 conference on empirical methods in natural language processing , pages=

Clipscore: A reference-free evaluation metric for image captioning , author=. Proceedings of the 2021 conference on empirical methods in natural language processing , pages=

work page 2021

[26] [26]

IEEE transactions on image processing , volume=

Image quality assessment: from error visibility to structural similarity , author=. IEEE transactions on image processing , volume=. 2004 , publisher=

work page 2004

[27] [27]

Proceedings of the IEEE conference on computer vision and pattern recognition , pages=

The unreasonable effectiveness of deep features as a perceptual metric , author=. Proceedings of the IEEE conference on computer vision and pattern recognition , pages=

work page

[28] [28]

Journal of Machine Learning Research , volume=

Stochastic interpolants: A unifying framework for flows and diffusions , author=. Journal of Machine Learning Research , volume=

work page

[29] [29]

Albergo, Carles Domingo-Enrich, Nicholas M

Test-time scaling of diffusions with flow maps , author=. arXiv preprint arXiv:2511.22688 , year=

work page arXiv

[30] [30]

Classifier-Free Diffusion Guidance

Classifier-Free Diffusion Guidance , author =. arXiv preprint arXiv:2207.12598 , year =

work page internal anchor Pith review Pith/arXiv arXiv

[31] [31]

Advances in Neural Information Processing Systems (NeurIPS) , year =

Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention , author =. Advances in Neural Information Processing Systems (NeurIPS) , year =

work page

[32] [32]

Advances in Neural Information Processing Systems (NeurIPS) , year =

Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models , author =. Advances in Neural Information Processing Systems (NeurIPS) , year =

work page

[33] [33]

Trippe, Christian A

Practical and Asymptotically Exact Conditional Sampling in Diffusion Models , author =. arXiv preprint arXiv:2306.17775 , year =

work page arXiv

[34] [34]

arXiv preprint arXiv:2312.12487 , year =

Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models , author =. arXiv preprint arXiv:2312.12487 , year =

work page arXiv

[35] [35]

Driftlite: Lightweight drift control for inference-time scaling of diffusion models.arXiv preprint arXiv:2509.21655, 2025

Driftlite: Lightweight drift control for inference-time scaling of diffusion models , author=. arXiv preprint arXiv:2509.21655 , year=

work page arXiv

[36] [36]

Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform , author=. Proceedings of the Computer Vision and Pattern Recognition Conference , pages=

work page

[37] [37]

Advances in Neural Information Processing Systems , volume=

DEFT: Efficient Fine-tuning of Diffusion Models by Learning the Generalised h -transform , author=. Advances in Neural Information Processing Systems , volume=

work page

[38] [38]

Columbia University Preprint , year=

A stochastic analysis approach to conditional diffusion guidance , author=. Columbia University Preprint , year=

work page

[39] [39]

No training, no problem: Rethinking classifier-free guidance for diffusion models.arXiv preprint arXiv:2407.02687, 2024

No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models , author =. arXiv preprint arXiv:2407.02687 , year =

work page arXiv

[40] [40]

and Tang, S

Pardoux, E. and Tang, S. , title =. Probab. Theory Relat. Fields , volume =

work page

[41] [41]

, title =

Karatzas, Ioannis and Shreve, Steven E. , title =. 1991 , publisher =

work page 1991

[42] [42]

Rogers, L. C. G. and Williams, David , title =. 2000 , publisher =

work page 2000

[43] [43]

and Kurtz, Thomas G

Ethier, Stewart N. and Kurtz, Thomas G. , title =. 1986 , publisher =

work page 1986

[44] [44]

1964 , publisher =

Friedman, Avner , title =. 1964 , publisher =

work page 1964

[45] [45]

, title =

Evans, Lawrence C. , title =. 2010 , publisher =

work page 2010

[46] [46]

1981 , pages =

Ikeda, Nobuyuki and Watanabe, Shinzo , title =. 1981 , pages =

work page 1981

[47] [47]

Advances in Neural Information Processing Systems (NeurIPS) 2023 , year =

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation , author =. Advances in Neural Information Processing Systems (NeurIPS) 2023 , year =

work page 2023

[48] [48]

Advances in Neural Information Processing Systems (NeurIPS) 2022 , year =

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding , author =. Advances in Neural Information Processing Systems (NeurIPS) 2022 , year =

work page 2022

[49] [49]

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Glide: Towards photorealistic image generation and editing with text-guided diffusion models , author=. arXiv preprint arXiv:2112.10741 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[50] [50]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

High-resolution image synthesis with latent diffusion models , author=. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

work page

[51] [51]

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Sdxl: Improving latent diffusion models for high-resolution image synthesis , author=. arXiv preprint arXiv:2307.01952 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[52] [52]

Advances in neural information processing systems , volume=

Diffusion models beat gans on image synthesis , author=. Advances in neural information processing systems , volume=

work page

[53] [53]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages =

Diffusion Model Alignment Using Direct Preference Optimization , author =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages =

work page

[54] [54]

Advances in Neural Information Processing Systems , volume =

Direct Preference Optimization: Your Language Model Is Secretly a Reward Model , author =. Advances in Neural Information Processing Systems , volume =

work page

[55] [55]

International Conference on Learning Representations (ICLR) 2021 , year =

Denoising Diffusion Implicit Models , author =. International Conference on Learning Representations (ICLR) 2021 , year =

work page 2021

[56] [56]

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2023 , year =

Better Aligning Text-to-Image Models with Human Preference , author =. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2023 , year =

work page 2023

[57] [57]

arXiv preprint arXiv:2503.06884 , year =

Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help , author =. arXiv preprint arXiv:2503.06884 , year =

work page arXiv

[58] [58]

NeurIPS Datasets and Benchmarks Track , year =

GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment , author =. NeurIPS Datasets and Benchmarks Track , year =

work page

[59] [59]

Billingsley, Patrick , title =

work page

[60] [60]

International Journal of Stochastic Analysis , year =

Yamada–Watanabe Results for Stochastic Differential Equations with Jumps , author =. International Journal of Stochastic Analysis , year =

work page

[61] [61]

Stochastic Analysis with Applications to Mathematical Finance , editor =

Glasserman, Paul and Merener, Nicolas , title =. Stochastic Analysis with Applications to Mathematical Finance , editor =. 2004 , pages =

work page 2004

[62] [62]

ACM Transactions on Modeling and Computer Simulation , volume =

Djehiche, Boualem and Hult, Henrik and Nyquist, Pierre , title =. ACM Transactions on Modeling and Computer Simulation , volume =

work page

[63] [63]

Geometric ergodicity of Rao and Teh's algorithm for Markov jump processes , journal =

Miasojedow, B. Geometric ergodicity of Rao and Teh's algorithm for Markov jump processes , journal =

work page

[64] [64]

Signal Processing , volume=

Effective sample size for importance sampling based on discrepancy measures , author=. Signal Processing , volume=. 2017 , publisher=

work page 2017

[65] [65]

Journal of the Royal Statistical Society Series B , volume=

Sequential Monte Carlo samplers , author=. Journal of the Royal Statistical Society Series B , volume=. 2006 , publisher=

work page 2006

[66] [66]

Bernoulli , volume=

On adaptive resampling strategies for sequential Monte Carlo methods , author=. Bernoulli , volume=. 2012 , publisher=

work page 2012

[67] [67]

Feynman--Kac Formulae: Genealogical and Interacting Particle Systems with Applications , author =

work page

[68] [68]

Del Moral, Pierre and Miclo, Laurent , title =. S. 2000 , publisher =

work page 2000

[69] [69]

arXiv preprint arXiv:2410.03601 , year =

How discrete and continuous diffusion meet: Comprehensive analysis of discrete diffusion models via a stochastic integral framework , author =. arXiv preprint arXiv:2410.03601 , year =

work page arXiv

[70] [70]

A Unified Approach to Analysis and Design of Denoising Markov Models

A unified approach to analysis and design of denoising markov models , author =. arXiv preprint arXiv:2504.01938 , year =

work page internal anchor Pith review Pith/arXiv arXiv

[71] [71]

Applebaum, David , year =. L

work page

[72] [72]

Adjoint sampling: Highly scalable diffusion samplers via adjoint matching.arXiv preprint arXiv:2504.11713, 2025

Adjoint sampling: Highly scalable diffusion samplers via adjoint matching , author =. arXiv preprint arXiv:2504.11713 , year =

work page arXiv

[73] [73]

Adjoint Schr

Liu, Guan-Horng and Choi, Jaemoo and Chen, Yongxin and Miller, Benjamin Kurt and Chen, Ricky TQ , journal =. Adjoint Schr

work page

[74] [74]

Adjoint matching: Fine- tuning flow and diffusion generative models with memoryless stochastic optimal control.arXiv preprint arXiv:2409.08861, 2024

Adjoint matching: Fine-tuning flow and diffusion generative models with memoryless stochastic optimal control , author =. arXiv preprint arXiv:2409.08861 , year =

work page arXiv

[75] [75]

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages =

Karras, Tero and Laine, Samuli and Aila, Timo , title =. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages =

work page

[76] [76]

2009 IEEE conference on computer vision and pattern recognition , pages =

Deng, Jia and Dong, Wei and Socher, Richard and Li, Li-Jia and Li, Kai and Li, Fei-Fei , title =. 2009 IEEE conference on computer vision and pattern recognition , pages =

work page 2009

[77] [77]

RNE: plug-and-play diffusion inference-time control and energy-based training

RNE: a plug-and-play framework for diffusion density estimation and inference-time control , author=. arXiv preprint arXiv:2506.05668 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[78] [78]

Debiasing guidance for discrete diffusion with sequential monte carlo.arXiv preprint arXiv:2502.06079, 2025

Debiasing guidance for discrete diffusion with sequential monte carlo , author=. arXiv preprint arXiv:2502.06079 , year=

work page arXiv

[79] [79]

Monge-Amp\`ere Flow for Generative Modeling

Monge-amp\`ere flow for generative modeling , author=. arXiv preprint arXiv:1809.10188 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[80] [80]

Advances in neural information processing systems , volume=

Generative modeling by estimating gradients of the data distribution , author=. Advances in neural information processing systems , volume=

work page