A note on connections between the F\"ollmer process and the denoising diffusion probabilistic model

Yuta Koike

arxiv: 2605.18040 · v1 · pith:NCBBW2MQnew · submitted 2026-05-18 · 📊 stat.ML · cs.LG· math.PR

A note on connections between the F\"ollmer process and the denoising diffusion probabilistic model

Yuta Koike This is my paper

Pith reviewed 2026-05-20 00:37 UTC · model grok-4.3

classification 📊 stat.ML cs.LGmath.PR

keywords Föllmer processDDPM samplerdiffusion probabilistic modelssampling error boundsreverse SDEconditioned Brownian motiondiscretization methods

0 comments

The pith

Discretized Föllmer processes supply natural hyper-parameter settings for the DDPM sampler.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper connects the Föllmer process, a Brownian motion forced to follow a chosen distribution at the end of its run, to the reverse dynamics in denoising diffusion probabilistic models. It shows that taking discrete steps along this conditioned process produces the same sampling procedure as DDPM when the noise schedule is chosen accordingly. This view lets researchers recover the best known guarantees on how close the samples stay to the target distribution, and even tighten them a little. The result matters because it replaces ad-hoc choices of step sizes and variances with a construction that comes directly from the conditioned Brownian motion.

Core claim

The discretized Föllmer process gives natural hyper-parameter settings of the DDPM sampler. This connection allows systematic recovery of state-of-the-art results on DDPM sampling error bounds with slight improvements.

What carries the argument

The Föllmer process, which is Brownian motion conditioned to a pre-specified distribution at time 1, acting as an augmented time-compressed version of the DDPM reverse SDE whose discretization matches the DDPM sampler.

If this is right

Hyper-parameters in DDPM arise directly from the discretization of the Föllmer process.
Sampling error bounds from prior work are recovered under the same assumptions on the target measure.
Slight improvements to those bounds follow from the unified analysis.
The DDPM reverse-time discretization is exactly reproduced by the Föllmer discretization without additional error.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the exact match holds, then results on Föllmer processes can be translated to give new insights into DDPM convergence rates.
Similar connections might be explored for other diffusion models that use reverse SDEs.
Practical implementations could test whether the improved bounds translate to better sample quality in finite steps.

Load-bearing premise

The continuous-time Föllmer process can be discretized to exactly reproduce the DDPM reverse-time discretization without introducing extra approximation error.

What would settle it

Verify whether the one-step transition distribution obtained by discretizing the Föllmer process coincides with the Gaussian transition used in the standard DDPM reverse sampler for a given noise level.

read the original abstract

The F\"ollmer process is a Brownian motion conditioned to have a pre-specified distribution at time 1. This process can be interpreted as an "augmented" time-compressed version of the reverse stochastic differential equation (SDE) for the denoising diffusion probabilistic model (DDPM). While this fact has been indirectly used to analyze DDPM sampling errors via discretization of the reverse SDE, connections between direct discretization of the F\"ollmer process and the DDPM sampler have not yet been fully explored. This note aims to clarify this point while surveying relevant results from existing work. We show that discretized F\"ollmer processes give natural hyper-parameter settings of the DDPM sampler. Moreover, this allows us to systematically recover state-of-the-art results on DDPM sampling error bounds with slight improvements.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript is a short note examining links between the Föllmer process (Brownian motion conditioned to a prescribed terminal distribution at t=1) and the reverse-time SDE underlying denoising diffusion probabilistic models (DDPM). It interprets the Föllmer process as an augmented, time-compressed version of the DDPM reverse dynamics and argues that direct discretization of the Föllmer process supplies natural hyper-parameter choices for the DDPM sampler. The note surveys existing DDPM error-bound literature and claims that this perspective systematically recovers state-of-the-art sampling error bounds while yielding modest improvements.

Significance. If the claimed exact equivalence between the discretized Föllmer process and the standard DDPM reverse kernel holds under the same assumptions used in prior DDPM analyses, the note would supply a useful organizing device for hyper-parameter selection and error-bound derivations in diffusion models. It could modestly strengthen the theoretical toolkit for analyzing sampling error without introducing new parameters or assumptions.

major comments (2)

[Section on discretization of the Föllmer process and hyper-parameter settings] The central claim that discretized Föllmer processes reproduce the DDPM reverse-time transition exactly (and thereby recover SOTA bounds with improvements) rests on the discretization step. The manuscript does not supply an explicit error analysis showing that any discretization error (e.g., from Euler–Maruyama on the Doob h-transform or truncation of the conditioned drift) is controlled by the regularity assumptions already present in the cited DDPM literature. This equivalence is load-bearing for both the hyper-parameter interpretation and the bound-recovery claim.
[Discussion of error bounds and comparison with prior work] The abstract states that the approach yields 'slight improvements' over existing DDPM sampling error bounds, yet the manuscript does not quantify these improvements, identify which prior bounds are being tightened, or provide a side-by-side comparison of the resulting constants or rates. Without such detail it is difficult to verify whether the improvements follow directly from the Föllmer perspective or from post-hoc parameter choices.

minor comments (2)

[Background section] Notation for the time-compressed versus original time scales should be introduced more explicitly when the Föllmer process is first defined, to avoid ambiguity when relating it to the standard DDPM reverse SDE.
A short table or explicit list of the recovered hyper-parameter settings (e.g., noise schedule, step sizes) would improve readability and make the 'natural' choices easier to compare with common DDPM implementations.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the thoughtful and constructive report. The note aims to connect the Föllmer process to DDPM reverse dynamics in order to motivate hyper-parameter choices. We address each major comment below and indicate where revisions will be made to strengthen the presentation.

read point-by-point responses

Referee: [Section on discretization of the Föllmer process and hyper-parameter settings] The central claim that discretized Föllmer processes reproduce the DDPM reverse-time transition exactly (and thereby recover SOTA bounds with improvements) rests on the discretization step. The manuscript does not supply an explicit error analysis showing that any discretization error (e.g., from Euler–Maruyama on the Doob h-transform or truncation of the conditioned drift) is controlled by the regularity assumptions already present in the cited DDPM literature. This equivalence is load-bearing for both the hyper-parameter interpretation and the bound-recovery claim.

Authors: We agree that the discretization step requires clearer justification. The manuscript treats the Föllmer process as the exact continuous-time reverse dynamics (via the Doob h-transform) and applies the same Euler–Maruyama discretization used in the cited DDPM analyses. Under the standard regularity assumptions (Lipschitz continuity of the score and linear growth) already invoked in those works, the local truncation error remains O(Δt) and the global discretization error bound carries over without modification. We will revise the discretization section to include a short remark explicitly stating this inheritance of the error analysis and confirming that no new assumptions are introduced. revision: yes
Referee: [Discussion of error bounds and comparison with prior work] The abstract states that the approach yields 'slight improvements' over existing DDPM sampling error bounds, yet the manuscript does not quantify these improvements, identify which prior bounds are being tightened, or provide a side-by-side comparison of the resulting constants or rates. Without such detail it is difficult to verify whether the improvements follow directly from the Föllmer perspective or from post-hoc parameter choices.

Authors: The claimed modest improvements stem directly from the Föllmer-derived schedule for the time steps and diffusion coefficients, which yields a slightly tighter control on the accumulated discretization and approximation errors compared with generic choices in the surveyed literature. We acknowledge that the current draft does not make the comparison explicit. In the revision we will add a brief table or paragraph that identifies the specific prior bounds (e.g., those obtained via standard DDPM reverse-kernel analyses) and shows the resulting improvement in the leading constants under identical assumptions on the data distribution and score regularity. revision: yes

Circularity Check

0 steps flagged

No circularity detected; Föllmer discretization acts as conceptual organizer for existing DDPM results

full rationale

The paper is a short note whose central contribution is to interpret the discretized Föllmer process as supplying natural hyper-parameter choices for the DDPM sampler and thereby recovering (with minor tightening) previously published sampling-error bounds. No load-bearing step reduces by construction to a fitted parameter, a self-defined quantity, or an unverified self-citation chain. The derivation chain rests on external prior analyses of DDPM reverse SDEs whose assumptions are taken as given; the Föllmer viewpoint is presented as an organizing lens rather than a source of new fitted values or a uniqueness theorem imported from the same authors. Consequently the claimed recovery of SOTA bounds does not collapse into a tautology or re-labeling of the paper’s own inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Only the abstract is available; the work appears to rest on standard properties of Brownian motion and conditioned diffusions without introducing new fitted parameters or postulated entities.

axioms (1)

standard math Existence and basic properties of the Föllmer process as a conditioned Brownian motion
Invoked in the definition of the process that is then discretized.

pith-pipeline@v0.9.0 · 5664 in / 1078 out tokens · 35370 ms · 2026-05-20T00:37:33.574171+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We show that discretized Föllmer processes give natural hyper-parameter settings of the DDPM sampler... recover state-of-the-art results on DDPM sampling error bounds

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

61 extracted references · 61 canonical work pages · 2 internal anchors

[1]

Convergence of diffusion models under the manifold hypothesis in high-dimensions.arXiv preprint arXiv:2409.18804, 2024

Azangulov, I., Deligiannidis, G. & Rousseau, J. (2024). Convergence of diffusion models under the manifold hypothesis in high-dimensions. Preprint, arXiv: 2409.18804

work page arXiv 2024
[2]

& Ledoux, M

Bakry, D., Gentil, I. & Ledoux, M. (2014).Analysis and geometry of Markov diffusion operators. Springer

work page 2014
[3]

& Deligiannidis, G

Benton, J., De Bortoli, V ., Doucet, A. & Deligiannidis, G. (2024). Nearlyd-linear convergence bounds for diffu- sion models via stochastic localization. InThe Twelfth International Conference on Learning Representations

work page 2024
[4]

& Theodorou, E

Chen, T., Liu, G.-H. & Theodorou, E. (2022). Likelihood training of Schr¨odinger bridge using forward-backward SDEs theory. InInternational Conference on Learning Representations

work page 2022
[5]

S., Boffi, N

Chen, Y ., Goldstein, M., Hua, M., Albergo, M. S., Boffi, N. M. & Vanden-Eijnden, E. (2024). Probabilistic fore- casting with stochastic interpolants and F ¨ollmer processes. InProceedings of the 41st International Conference on Machine Learning, vol. 235. pp. 6728–6756

work page 2024
[6]

Variational Optimality of F\"ollmer Processes in Generative Diffusions

Chen, Y . & Vanden-Eijnden, E. (2026). Variational optimality of F ¨ollmer processes in generative diffusions. Preprint, arXiv:2602.10989

work page internal anchor Pith review Pith/arXiv arXiv 2026
[7]

Chung, K. L. & Walsh, J. B. (2005).Markov processes, Brownian motion, and time symmetry. Springer, 2nd edn

work page 2005
[8]

& Gentiloni Silveri, M

Conforti, G., Durmus, A. & Gentiloni Silveri, M. (2025). KL convergence guarantees for score diffusion models under minimal data assumptions.SIAM J. Math. Data Sci.7, 86–109

work page 2025
[9]

Dai, Y ., Jiao, Y ., Kang, L. & Lu, X. (2023). Global optimization via Schr ¨odinger–F¨ollmer diffusion.SIAM J. Optim.61, 2953–2980. 29

work page 2023
[10]

Dai Pra, P. (1991). A stochastic control approach to reciprocal diffusion processes.Appl. Math. Optim.23, 313–329

work page 1991
[11]

De Bortoli, V . (2022). Convergence of denoising diffusion models under the manifold hypothesis.Transactions on Machine Learning Research

work page 2022
[12]

& Doucet, A

De Bortoli, V ., Thornton, J., Heng, J. & Doucet, A. (2021). Diffusion Schr ¨odinger bridge with applications to score-based generative modeling. In M. Ranzato, A. Beygelzimer, Y . Dauphin, P. Liang & J. W. Vaughan, eds., Advances in Neural Information Processing Systems, vol. 34. Curran Associates, Inc., pp. 17695–17709

work page 2021
[13]

Efron, B. (2011). Tweedie’s formula and selection bias.J. Amer. Statist. Assoc.106, 1602–1614

work page 2011
[14]

Efron, B. (2024). Machine learning and the James–Stein estimator.Jpn. J. Stat. Data Sci.7, 257–266

work page 2024
[15]

Eldan, R. (2013). Thin shell implies spectral gap up to polylog via a stochastic localization scheme.Geom. Funct. Anal.23, 532–569

work page 2013
[16]

& Lee, J

Eldan, R. & Lee, J. R. (2018). Regularization under diffusion and anticoncentration of the information content. Duke Math. J.167, 969–993

work page 2018
[17]

& Shenfeld, Y

Eldan, R., Lehec, J. & Shenfeld, Y . (2020). Stability of the logarithmic Sobolev inequality via the F ¨ollmer process.Ann. Inst. Henri Poincar ´e Probab. Stat.56, 2253–2269

work page 2020
[18]

& Mikulincer, D

Eldan, R. & Mikulincer, D. (2020). Stability of the Shannon–Stam inequality via the F ¨ollmer process.Probab. Theory Related Fields177, 891–922

work page 2020
[19]

& Zhai, A

Eldan, R., Mikulincer, D. & Zhai, A. (2020). The CLT in high dimensions: quantitative bounds via martingale embedding.Ann. Probab.48, 2494–2524

work page 2020
[20]

& Nakano, Y

Endo, K. & Nakano, Y . (2024). Weak approximation of Schr ¨odinger–F¨ollmer diffusion.Statist. Probab. Lett. 213, 110171

work page 2024
[21]

& Koike, Y

Fang, X. & Koike, Y . (2024). Sharp high-dimensional central limit theorems for log-concave distributions.Ann. Inst. Henri Poincar´e Probab. Stat.60, 2129–2156

work page 2024
[22]

Central limit theorem for high temperature spin models via martingale embedding

Fang, X. & Zhao, Y .-K. (2025). Central limit theorem for high temperature Ising models via martingale embed- ding. Preprint, arXiv:2511.06196

work page internal anchor Pith review Pith/arXiv arXiv 2025
[23]

& Saumard, A

Fathi, M., Goldstein, L., Reinert, G. & Saumard, A. (2022). Relaxing the Gaussian assumption in shrinkage and SURE in high dimension.Ann. Statist.50, 2737–2766

work page 2022
[24]

F ¨ollmer, H. (1986). Time reversal on Wiener space. In S. A. Albeverio, P. Blanchard & L. Streit, eds.,Stochastic processes - mathematics and physics, vol. 1158 ofLecture Notes in Math.Springer, pp. 119–129

work page 1986
[25]

F ¨ollmer, H. (1988). Random fields and diffusion processes. In P.-L. Hennequin, ed., ´Ecole d’´et´e de Probabilit´es de Saint-Flour XV–XVII, 1985–87, vol. 1362 ofLecture Notes in Math.Springer, pp. 101–203

work page 1988
[26]

& Gantert, N

F ¨ollmer, H. & Gantert, N. (1997). Entropy minimization and Schr ¨odinger processes in infinite dimensions.Ann. Probab.25, 901–926

work page 1997
[27]

& Chen, M

Fu, H., Yang, Z., Wang, M. & Chen, M. (2024). Unveil conditional diffusion models with classifier-free guidance: A sharp statistical theory. Preprint, arXiv: 2403.11968

work page arXiv 2024
[28]

Haussmann, U. G. & Pardoux, E. (1986). Time reversal of diffusions.Ann. Probab.14, 1188–1205

work page 1986
[29]

& Abbeel, P

Ho, J., Jain, A. & Abbeel, P. (2020). Denoising diffusion probabilistic models. InAdvances in Neural Information Processing Systems, vol. 33. pp. 6840–6851

work page 2020
[30]

& Liu, Y

Huang, J., Jiao, Y ., Kang, L., Liao, X., Liu, J. & Liu, Y . (2025). Schr ¨odinger–F¨ollmer sampler.IEEE Trans. Inform. Theory71, 1283–1299. 30

work page 2025
[31]

& Chen, Y

Huang, Z., Wei, Y . & Chen, Y . (2026). Denoising diffusion probabilistic models are optimally adaptive to un- known low dimensionality.Math. Oper. Res. (forthcoming)

work page 2026
[32]

& Zhang, T

Jain, N. & Zhang, T. (2026). A sharp KL convergence analysis for diffusion models under minimal assumptions. InThe Fourteenth International Conference on Learning Representations

work page 2026
[33]

Jamison, B. (1975). The Markov processes of Schr ¨odinger.Zeitschrift f ¨ur Wahrscheinlichkeitstheorie und Ver- wandte Gebiete32, 323–331

work page 1975
[34]

Jiao, Y ., Zhou, Y . & Li, G. (2025). Optimal convergence analysis of DDPM for general distributions. Preprint, arXiv:2510.27562

work page arXiv 2025
[35]

& Shreve, S

Karatzas, I. & Shreve, S. E. (1998).Brownian motion and stochastic calculus. Springer, 2nd edn

work page 1998
[36]

& Lehec, J

Klartag, B. & Lehec, J. (2025). Affirmative resolution of Bourgain’s slicing problem using Guan’s bound.Geom. Funct. Anal.35, 1147–1168

work page 2025
[37]

& Putterman, E

Klartag, B. & Putterman, E. (2023). Spectral monotonicity under Gaussian convolution.Ann. Fac. Sci. Toulouse Math. (6)32, 939–967

work page 2023
[38]

Lehec, J. (2013). Representation formula for the entropy and functional inequalities.Ann. Inst. Henri Poincar ´e Probab. Stat.49, 885–899

work page 2013
[39]

L ´eonard, C. (2014). A survey of the Schr ¨odinger problem and some of its connections with optimal transport. Discrete Contin. Dyn. Syst.34, 1533–1574

work page 2014
[40]

& Yan, Y

Li, G. & Yan, Y . (2024). Adapting to unknown low-dimensional structures in score-based diffusion models. In The Thirty-eighth Annual Conference on Neural Information Processing Systems

work page 2024
[41]

Liptser, R. S. & Shiryaev, A. N. (2001).Statistics of random processes: I. general theory. Springer, 2nd edn

work page 2001
[42]

(1995).Geometry of sets and measures in Euclidean spaces

Mattila, P. (1995).Geometry of sets and measures in Euclidean spaces. Cambridge University Press

work page 1995
[43]

& Vargas, F

McGuinness, M., Fladmark, E. & Vargas, F. (2024). Path integral optimiser: Global optimisation via neural Schr¨odinger–F¨ollmer diffusion. InOpt 2024: Optimization for machine learning

work page 2024
[44]

(2021).Stochastic optimal transportation

Mikami, T. (2021).Stochastic optimal transportation. Springer

work page 2021
[45]

Mikulincer, D. (2021). Stability of Talagrand’s Gaussian transport-entropy inequality via the F ¨ollmer process. Israel J. Math.242, 215–241

work page 2021
[46]

& Shenfeld, Y

Mikulincer, D. & Shenfeld, Y . (2024). The Brownian transport map.Probab. Theory Related Fields190, 379– 444

work page 2024
[47]

Montanari, A. (2023). Sampling, diffusions, and stochastic localization. Preprint, arXiv: 2305.10690

work page arXiv 2023
[48]

& Suzuki, T

Oko, K., Akiyama, S. & Suzuki, T. (2023). Diffusion models are minimax optimal distribution estimators. In A. Krause, E. Brunskill, K. Cho, B. Engelhardt, S. Sabato & J. Scarlett, eds.,Proceedings of the 40th International Conference on Machine Learning, vol. 202 ofProceedings of Machine Learning Research. PMLR, pp. 26517– 26582

work page 2023
[49]

Peluchetti, S. (2023). Diffusion bridge mixture transports, Schr ¨odinger bridge problems and generative modeling. J. Mach. Learn. Res.24, 1–51

work page 2023
[50]

Polyanskiy, Y . & Wu, Y . (2025).Information theory: From coding to learning. Cambridge University Press

work page 2025
[51]

& Yor, M

Revuz, D. & Yor, M. (1999).Continuous martingales and Brownian motion. Springer, 3rd edn

work page 1999
[52]

& Kantas, N

Ruzayqat, H., Beskos, A., Crisan, D., Jasra, A. & Kantas, N. (2023). Unbiased estimation using a class of diffusion processes.J. Comput. Phys.472, 111643. 31

work page 2023
[53]

& Zhang, M

Shi, B., Tian, K. & Zhang, M. S. (2025). Perspectives on stochastic localization. Preprint, arXiv: 2510.04460

work page arXiv 2025
[54]

P., Kumar, A., Ermon, S

Song, Y ., Sohl-Dickstein, J., Kingma, D. P., Kumar, A., Ermon, S. & Poole, B. (2021). Score-based generative modeling through stochastic differential equations. InInternational Conference on Learning Representations

work page 2021
[55]

& Raginsky, M

Tzen, B. & Raginsky, M. (2019). Theoretical guarantees for sampling and inference in generative models with latent diffusions. InProceedings of the Thirty-Second Conference on Learning Theory, vol. 99. pp. 3084–3114

work page 2019
[56]

van der Vaart, A. W. & Wellner, J. A. (2023).Weak convergence and empirical processes. Springer, 2nd edn

work page 2023
[57]

Vargas, F., Ovsianas, A., Fernandes, D., Girolami, M., Lawrence, N. D. & N ¨usken, N. (2023). Bayesian learning via neural Schr¨odinger–F¨ollmer flows.Statist. Comput.33, 3

work page 2023
[58]

(2026).High-dimensional probability

Vershynin, R. (2026).High-dimensional probability. Cambridge University Press, 2nd edn

work page 2026
[59]

& Yang, C

Wang, G., Jiao, Y ., Xu, Q., Wang, Y . & Yang, C. (2021). Deep generative learning via Schr ¨odinger bridge. In Proceedings of the 38th International Conference on Machine Learning, vol. 139. pp. 10794–10804

work page 2021
[60]

(1991).Probability with martingales

Williams, D. (1991).Probability with martingales. Cambridge University Press

work page 1991
[61]

& Chen, Y

Zhang, Q. & Chen, Y . (2022). Path integral sampler: A stochastic control approach for sampling. InInternational Conference on Learning Representations. 32

work page 2022

[1] [1]

Convergence of diffusion models under the manifold hypothesis in high-dimensions.arXiv preprint arXiv:2409.18804, 2024

Azangulov, I., Deligiannidis, G. & Rousseau, J. (2024). Convergence of diffusion models under the manifold hypothesis in high-dimensions. Preprint, arXiv: 2409.18804

work page arXiv 2024

[2] [2]

& Ledoux, M

Bakry, D., Gentil, I. & Ledoux, M. (2014).Analysis and geometry of Markov diffusion operators. Springer

work page 2014

[3] [3]

& Deligiannidis, G

Benton, J., De Bortoli, V ., Doucet, A. & Deligiannidis, G. (2024). Nearlyd-linear convergence bounds for diffu- sion models via stochastic localization. InThe Twelfth International Conference on Learning Representations

work page 2024

[4] [4]

& Theodorou, E

Chen, T., Liu, G.-H. & Theodorou, E. (2022). Likelihood training of Schr¨odinger bridge using forward-backward SDEs theory. InInternational Conference on Learning Representations

work page 2022

[5] [5]

S., Boffi, N

Chen, Y ., Goldstein, M., Hua, M., Albergo, M. S., Boffi, N. M. & Vanden-Eijnden, E. (2024). Probabilistic fore- casting with stochastic interpolants and F ¨ollmer processes. InProceedings of the 41st International Conference on Machine Learning, vol. 235. pp. 6728–6756

work page 2024

[6] [6]

Variational Optimality of F\"ollmer Processes in Generative Diffusions

Chen, Y . & Vanden-Eijnden, E. (2026). Variational optimality of F ¨ollmer processes in generative diffusions. Preprint, arXiv:2602.10989

work page internal anchor Pith review Pith/arXiv arXiv 2026

[7] [7]

Chung, K. L. & Walsh, J. B. (2005).Markov processes, Brownian motion, and time symmetry. Springer, 2nd edn

work page 2005

[8] [8]

& Gentiloni Silveri, M

Conforti, G., Durmus, A. & Gentiloni Silveri, M. (2025). KL convergence guarantees for score diffusion models under minimal data assumptions.SIAM J. Math. Data Sci.7, 86–109

work page 2025

[9] [9]

Dai, Y ., Jiao, Y ., Kang, L. & Lu, X. (2023). Global optimization via Schr ¨odinger–F¨ollmer diffusion.SIAM J. Optim.61, 2953–2980. 29

work page 2023

[10] [10]

Dai Pra, P. (1991). A stochastic control approach to reciprocal diffusion processes.Appl. Math. Optim.23, 313–329

work page 1991

[11] [11]

De Bortoli, V . (2022). Convergence of denoising diffusion models under the manifold hypothesis.Transactions on Machine Learning Research

work page 2022

[12] [12]

& Doucet, A

De Bortoli, V ., Thornton, J., Heng, J. & Doucet, A. (2021). Diffusion Schr ¨odinger bridge with applications to score-based generative modeling. In M. Ranzato, A. Beygelzimer, Y . Dauphin, P. Liang & J. W. Vaughan, eds., Advances in Neural Information Processing Systems, vol. 34. Curran Associates, Inc., pp. 17695–17709

work page 2021

[13] [13]

Efron, B. (2011). Tweedie’s formula and selection bias.J. Amer. Statist. Assoc.106, 1602–1614

work page 2011

[14] [14]

Efron, B. (2024). Machine learning and the James–Stein estimator.Jpn. J. Stat. Data Sci.7, 257–266

work page 2024

[15] [15]

Eldan, R. (2013). Thin shell implies spectral gap up to polylog via a stochastic localization scheme.Geom. Funct. Anal.23, 532–569

work page 2013

[16] [16]

& Lee, J

Eldan, R. & Lee, J. R. (2018). Regularization under diffusion and anticoncentration of the information content. Duke Math. J.167, 969–993

work page 2018

[17] [17]

& Shenfeld, Y

Eldan, R., Lehec, J. & Shenfeld, Y . (2020). Stability of the logarithmic Sobolev inequality via the F ¨ollmer process.Ann. Inst. Henri Poincar ´e Probab. Stat.56, 2253–2269

work page 2020

[18] [18]

& Mikulincer, D

Eldan, R. & Mikulincer, D. (2020). Stability of the Shannon–Stam inequality via the F ¨ollmer process.Probab. Theory Related Fields177, 891–922

work page 2020

[19] [19]

& Zhai, A

Eldan, R., Mikulincer, D. & Zhai, A. (2020). The CLT in high dimensions: quantitative bounds via martingale embedding.Ann. Probab.48, 2494–2524

work page 2020

[20] [20]

& Nakano, Y

Endo, K. & Nakano, Y . (2024). Weak approximation of Schr ¨odinger–F¨ollmer diffusion.Statist. Probab. Lett. 213, 110171

work page 2024

[21] [21]

& Koike, Y

Fang, X. & Koike, Y . (2024). Sharp high-dimensional central limit theorems for log-concave distributions.Ann. Inst. Henri Poincar´e Probab. Stat.60, 2129–2156

work page 2024

[22] [22]

Central limit theorem for high temperature spin models via martingale embedding

Fang, X. & Zhao, Y .-K. (2025). Central limit theorem for high temperature Ising models via martingale embed- ding. Preprint, arXiv:2511.06196

work page internal anchor Pith review Pith/arXiv arXiv 2025

[23] [23]

& Saumard, A

Fathi, M., Goldstein, L., Reinert, G. & Saumard, A. (2022). Relaxing the Gaussian assumption in shrinkage and SURE in high dimension.Ann. Statist.50, 2737–2766

work page 2022

[24] [24]

F ¨ollmer, H. (1986). Time reversal on Wiener space. In S. A. Albeverio, P. Blanchard & L. Streit, eds.,Stochastic processes - mathematics and physics, vol. 1158 ofLecture Notes in Math.Springer, pp. 119–129

work page 1986

[25] [25]

F ¨ollmer, H. (1988). Random fields and diffusion processes. In P.-L. Hennequin, ed., ´Ecole d’´et´e de Probabilit´es de Saint-Flour XV–XVII, 1985–87, vol. 1362 ofLecture Notes in Math.Springer, pp. 101–203

work page 1988

[26] [26]

& Gantert, N

F ¨ollmer, H. & Gantert, N. (1997). Entropy minimization and Schr ¨odinger processes in infinite dimensions.Ann. Probab.25, 901–926

work page 1997

[27] [27]

& Chen, M

Fu, H., Yang, Z., Wang, M. & Chen, M. (2024). Unveil conditional diffusion models with classifier-free guidance: A sharp statistical theory. Preprint, arXiv: 2403.11968

work page arXiv 2024

[28] [28]

Haussmann, U. G. & Pardoux, E. (1986). Time reversal of diffusions.Ann. Probab.14, 1188–1205

work page 1986

[29] [29]

& Abbeel, P

Ho, J., Jain, A. & Abbeel, P. (2020). Denoising diffusion probabilistic models. InAdvances in Neural Information Processing Systems, vol. 33. pp. 6840–6851

work page 2020

[30] [30]

& Liu, Y

Huang, J., Jiao, Y ., Kang, L., Liao, X., Liu, J. & Liu, Y . (2025). Schr ¨odinger–F¨ollmer sampler.IEEE Trans. Inform. Theory71, 1283–1299. 30

work page 2025

[31] [31]

& Chen, Y

Huang, Z., Wei, Y . & Chen, Y . (2026). Denoising diffusion probabilistic models are optimally adaptive to un- known low dimensionality.Math. Oper. Res. (forthcoming)

work page 2026

[32] [32]

& Zhang, T

Jain, N. & Zhang, T. (2026). A sharp KL convergence analysis for diffusion models under minimal assumptions. InThe Fourteenth International Conference on Learning Representations

work page 2026

[33] [33]

Jamison, B. (1975). The Markov processes of Schr ¨odinger.Zeitschrift f ¨ur Wahrscheinlichkeitstheorie und Ver- wandte Gebiete32, 323–331

work page 1975

[34] [34]

Jiao, Y ., Zhou, Y . & Li, G. (2025). Optimal convergence analysis of DDPM for general distributions. Preprint, arXiv:2510.27562

work page arXiv 2025

[35] [35]

& Shreve, S

Karatzas, I. & Shreve, S. E. (1998).Brownian motion and stochastic calculus. Springer, 2nd edn

work page 1998

[36] [36]

& Lehec, J

Klartag, B. & Lehec, J. (2025). Affirmative resolution of Bourgain’s slicing problem using Guan’s bound.Geom. Funct. Anal.35, 1147–1168

work page 2025

[37] [37]

& Putterman, E

Klartag, B. & Putterman, E. (2023). Spectral monotonicity under Gaussian convolution.Ann. Fac. Sci. Toulouse Math. (6)32, 939–967

work page 2023

[38] [38]

Lehec, J. (2013). Representation formula for the entropy and functional inequalities.Ann. Inst. Henri Poincar ´e Probab. Stat.49, 885–899

work page 2013

[39] [39]

L ´eonard, C. (2014). A survey of the Schr ¨odinger problem and some of its connections with optimal transport. Discrete Contin. Dyn. Syst.34, 1533–1574

work page 2014

[40] [40]

& Yan, Y

Li, G. & Yan, Y . (2024). Adapting to unknown low-dimensional structures in score-based diffusion models. In The Thirty-eighth Annual Conference on Neural Information Processing Systems

work page 2024

[41] [41]

Liptser, R. S. & Shiryaev, A. N. (2001).Statistics of random processes: I. general theory. Springer, 2nd edn

work page 2001

[42] [42]

(1995).Geometry of sets and measures in Euclidean spaces

Mattila, P. (1995).Geometry of sets and measures in Euclidean spaces. Cambridge University Press

work page 1995

[43] [43]

& Vargas, F

McGuinness, M., Fladmark, E. & Vargas, F. (2024). Path integral optimiser: Global optimisation via neural Schr¨odinger–F¨ollmer diffusion. InOpt 2024: Optimization for machine learning

work page 2024

[44] [44]

(2021).Stochastic optimal transportation

Mikami, T. (2021).Stochastic optimal transportation. Springer

work page 2021

[45] [45]

Mikulincer, D. (2021). Stability of Talagrand’s Gaussian transport-entropy inequality via the F ¨ollmer process. Israel J. Math.242, 215–241

work page 2021

[46] [46]

& Shenfeld, Y

Mikulincer, D. & Shenfeld, Y . (2024). The Brownian transport map.Probab. Theory Related Fields190, 379– 444

work page 2024

[47] [47]

Montanari, A. (2023). Sampling, diffusions, and stochastic localization. Preprint, arXiv: 2305.10690

work page arXiv 2023

[48] [48]

& Suzuki, T

Oko, K., Akiyama, S. & Suzuki, T. (2023). Diffusion models are minimax optimal distribution estimators. In A. Krause, E. Brunskill, K. Cho, B. Engelhardt, S. Sabato & J. Scarlett, eds.,Proceedings of the 40th International Conference on Machine Learning, vol. 202 ofProceedings of Machine Learning Research. PMLR, pp. 26517– 26582

work page 2023

[49] [49]

Peluchetti, S. (2023). Diffusion bridge mixture transports, Schr ¨odinger bridge problems and generative modeling. J. Mach. Learn. Res.24, 1–51

work page 2023

[50] [50]

Polyanskiy, Y . & Wu, Y . (2025).Information theory: From coding to learning. Cambridge University Press

work page 2025

[51] [51]

& Yor, M

Revuz, D. & Yor, M. (1999).Continuous martingales and Brownian motion. Springer, 3rd edn

work page 1999

[52] [52]

& Kantas, N

Ruzayqat, H., Beskos, A., Crisan, D., Jasra, A. & Kantas, N. (2023). Unbiased estimation using a class of diffusion processes.J. Comput. Phys.472, 111643. 31

work page 2023

[53] [53]

& Zhang, M

Shi, B., Tian, K. & Zhang, M. S. (2025). Perspectives on stochastic localization. Preprint, arXiv: 2510.04460

work page arXiv 2025

[54] [54]

P., Kumar, A., Ermon, S

Song, Y ., Sohl-Dickstein, J., Kingma, D. P., Kumar, A., Ermon, S. & Poole, B. (2021). Score-based generative modeling through stochastic differential equations. InInternational Conference on Learning Representations

work page 2021

[55] [55]

& Raginsky, M

Tzen, B. & Raginsky, M. (2019). Theoretical guarantees for sampling and inference in generative models with latent diffusions. InProceedings of the Thirty-Second Conference on Learning Theory, vol. 99. pp. 3084–3114

work page 2019

[56] [56]

van der Vaart, A. W. & Wellner, J. A. (2023).Weak convergence and empirical processes. Springer, 2nd edn

work page 2023

[57] [57]

Vargas, F., Ovsianas, A., Fernandes, D., Girolami, M., Lawrence, N. D. & N ¨usken, N. (2023). Bayesian learning via neural Schr¨odinger–F¨ollmer flows.Statist. Comput.33, 3

work page 2023

[58] [58]

(2026).High-dimensional probability

Vershynin, R. (2026).High-dimensional probability. Cambridge University Press, 2nd edn

work page 2026

[59] [59]

& Yang, C

Wang, G., Jiao, Y ., Xu, Q., Wang, Y . & Yang, C. (2021). Deep generative learning via Schr ¨odinger bridge. In Proceedings of the 38th International Conference on Machine Learning, vol. 139. pp. 10794–10804

work page 2021

[60] [60]

(1991).Probability with martingales

Williams, D. (1991).Probability with martingales. Cambridge University Press

work page 1991

[61] [61]

& Chen, Y

Zhang, Q. & Chen, Y . (2022). Path integral sampler: A stochastic control approach for sampling. InInternational Conference on Learning Representations. 32

work page 2022