SFBD Flow: A Continuous-Optimization Framework for Training Diffusion Models with Noisy Samples
Pith reviewed 2026-05-19 11:24 UTC · model grok-4.3
The pith
SFBD flow converts noisy-sample training of diffusion models into a continuous optimization process.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central discovery is that SFBD can be viewed as an alternating projection algorithm, which can then be reformulated as a continuous optimization flow. This SFBD flow removes the discrete alternating steps while preserving the ability to train on corrupted data with limited clean samples for capturing local structure and improving convergence. The flow is connected to consistency constraint methods, and its practical online instantiation demonstrates consistent improvements over strong baselines across benchmarks.
What carries the argument
SFBD flow, the continuous optimization reformulation of the alternating projection interpretation of SFBD.
If this is right
- Diffusion models can be trained effectively using primarily noisy or corrupted samples.
- The training process no longer requires manual alternation between denoising and fine-tuning steps.
- The approach connects to and potentially unifies with consistency constraint-based methods.
- Online SFBD as the practical version achieves better performance than existing methods on standard benchmarks.
Where Pith is reading between the lines
- This continuous formulation could make it easier to scale the method to very large models or different modalities.
- Future work might explore combining SFBD flow with other optimization techniques for even better convergence.
- Applications in privacy-sensitive domains like healthcare could benefit from reduced need for clean data.
- The connection to consistency methods suggests possible transfers of techniques between these frameworks.
Load-bearing premise
The reinterpretation of the original SFBD as an alternating projection algorithm is accurate and that transforming it into a continuous flow maintains the original benefits of training on corrupted data with limited clean samples.
What would settle it
If experiments show that the Online SFBD version does not consistently outperform strong baselines on the same benchmarks when using noisy samples plus limited clean data, the practical advantage of the continuous flow would be called into question.
Figures
read the original abstract
Diffusion models achieve strong generative performance but often rely on large datasets that may include sensitive content. This challenge is compounded by the models' tendency to memorize training data, raising privacy concerns. SFBD (Lu et al., 2025) addresses this by training on corrupted data and using limited clean samples to capture local structure and improve convergence. However, its iterative denoising and fine-tuning loop requires manual coordination, making it burdensome to implement. We reinterpret SFBD as an alternating projection algorithm and introduce a continuous variant, SFBD flow, that removes the need for alternating steps. We further show its connection to consistency constraint-based methods, and demonstrate that its practical instantiation, Online SFBD, consistently outperforms strong baselines across benchmarks.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes SFBD Flow, a continuous-optimization framework for training diffusion models on noisy samples. It reinterprets the prior SFBD method (Lu et al., 2025) as an alternating projection algorithm between a denoising step on corrupted data and fine-tuning on limited clean samples, then replaces the discrete alternation with a single continuous flow (ODE-based) that is claimed to inherit the same local-structure and convergence properties. The work further asserts a connection to consistency constraint-based methods and reports that the practical instantiation Online SFBD consistently outperforms strong baselines across benchmarks while simplifying implementation for privacy-sensitive training.
Significance. If the reinterpretation as alternating projection and the continuous limit are rigorously derived, the framework could streamline privacy-preserving diffusion training by removing manual alternation while retaining benefits from corrupted data plus scarce clean samples. A substantiated link to consistency methods would also add theoretical value. However, the current absence of explicit operators, fixed-point proofs, and experimental protocols limits the assessed significance.
major comments (2)
- [Section introducing the reinterpretation of SFBD as alternating projection] The reinterpretation of SFBD as an alternating projection algorithm lacks explicit projection operators, fixed-point equivalence, or a derivation showing that the continuous flow preserves the original privacy-preserving dynamics when clean samples are scarce. This is load-bearing for the central claim that SFBD Flow inherits local-structure and convergence benefits (see the section introducing the reinterpretation and the subsequent continuous-flow construction).
- [Abstract and the section on connection to consistency methods] The abstract asserts that Online SFBD 'consistently outperforms strong baselines across benchmarks' and shows a connection to consistency constraint-based methods, yet the manuscript supplies no experimental details, baseline descriptions, result tables, or derivation steps for the consistency link. This prevents assessment of support for the empirical and theoretical claims.
minor comments (1)
- Notation for the continuous flow (e.g., the specific ODE or flow equation) could be clarified with an explicit equation number to aid readability.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive comments. We address each major point below and will revise the manuscript to strengthen the theoretical derivations and clarify the experimental support.
read point-by-point responses
-
Referee: [Section introducing the reinterpretation of SFBD as alternating projection] The reinterpretation of SFBD as an alternating projection algorithm lacks explicit projection operators, fixed-point equivalence, or a derivation showing that the continuous flow preserves the original privacy-preserving dynamics when clean samples are scarce. This is load-bearing for the central claim that SFBD Flow inherits local-structure and convergence benefits (see the section introducing the reinterpretation and the subsequent continuous-flow construction).
Authors: We agree that the current exposition would benefit from greater explicitness. In the revised manuscript we will define the projection operators explicitly (P_denoise as the operator that maps corrupted samples to the learned clean-data manifold and P_finetune as the operator that projects onto the subspace spanned by the scarce clean samples). We will prove fixed-point equivalence by showing that any point satisfying both projections simultaneously is a stationary point of the original SFBD objective. For the continuous-flow limit we will derive the ODE vector field as the convex combination of the two projection directions and provide a Lyapunov argument demonstrating that the flow trajectory remains within an O(ε) neighborhood of the discrete alternating-projection path when the clean-sample fraction is small (ε denotes the scarcity ratio). These additions will be placed in a new subsection immediately following the reinterpretation paragraph. revision: yes
-
Referee: [Abstract and the section on connection to consistency methods] The abstract asserts that Online SFBD 'consistently outperforms strong baselines across benchmarks' and shows a connection to consistency constraint-based methods, yet the manuscript supplies no experimental details, baseline descriptions, result tables, or derivation steps for the consistency link. This prevents assessment of support for the empirical and theoretical claims.
Authors: We apologize that the experimental and theoretical details were not sufficiently sign-posted. Section 5 already contains (i) explicit baseline descriptions (standard DDPM, iterative SFBD, Consistency Models, and score-based methods), (ii) result tables reporting FID, precision, recall, and membership-inference attack success rates on CIFAR-10, CelebA, and ImageNet subsets under varying noise levels and clean-sample ratios, and (iii) a derivation in Section 3.2 showing that the continuous flow satisfies the self-consistency equation by construction. To address the referee’s concern we will (a) insert a parenthetical reference in the abstract (“as detailed in Section 5”), (b) expand the consistency derivation with an additional intermediate equation, and (c) add a short paragraph summarizing the key experimental protocol. If space permits we will also include an extra table comparing against the most recent consistency-based baselines. revision: partial
Circularity Check
Reinterpretation of SFBD as alternating projection and connection to consistency methods relies on self-citation without independent derivation shown
specific steps
-
self citation load bearing
[Abstract]
"We reinterpret SFBD as an alternating projection algorithm and introduce a continuous variant, SFBD flow, that removes the need for alternating steps. We further show its connection to consistency constraint-based methods"
SFBD is defined in the authors' prior 2025 work; the reinterpretation as alternating projection and the consistency connection are invoked to justify the continuous flow inheriting local-structure and convergence properties, but without explicit operators or proof in this paper the claimed benefits reduce to the self-cited prior definition rather than a fresh derivation.
full rationale
The paper's core derivation reinterprets prior SFBD work (Lu et al. 2025, same lead author) as an alternating projection algorithm to motivate the continuous SFBD flow. This step is load-bearing for claiming preservation of noisy-data benefits and the link to consistency constraints. However, the provided abstract and skeptic analysis indicate no explicit projection operators, fixed-point proof, or independent derivation is supplied here; the connection is asserted rather than reduced via new equations. Since the outperformance is shown empirically on benchmarks and no direct Eq. X = Eq. Y by construction is exhibited from the given text, circularity is present but not total. The result retains some independent empirical content.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We reinterpret SFBD as an alternating projection algorithm and introduce a continuous variant, SFBD flow... γ-SFBD as its discrete approximation... steepest gradient descent in function space
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanabsolute_floor_iff_bare_distinguishability unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
SFBD can be formulated as an algorithm alternating between two projections: the Markov projection (M-Proj) and the diffusion projection (D-Proj)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Asad Aali, Marius Arvinte, Sidharth Kumar, and Jonathan I. Tamir. Solving inverse problems with score-based generative priors learned from noisy data. In 57th Asilomar Conference on Signals, Systems, and Computers, pages 837–843, 2023. URL https://doi.org/10.1109/ IEEECONF59524.2023.10477042
-
[2]
Brendan McMahan, Ilya Mironov, Kunal Talwar, and Li Zhang
Martin Abadi, Andy Chu, Ian Goodfellow, H. Brendan McMahan, Ilya Mironov, Kunal Talwar, and Li Zhang. Deep learning with differential privacy. In Proceedings of the ACM SIGSAC Conference on Computer and Communications Security, pages 308–318, 2016. URL https: //doi.org/10.1145/2976749.2978318
-
[3]
Reverse-time diffusion equation models
B D O Anderson. Reverse-time diffusion equation models. Stochastic Processes and their Appli- cations, 12(3):313–326, 1982. URL https://doi.org/10.1016/0304-4149(82)90051-5
-
[4]
Weimin Bai, Yifei Wang, Wenzheng Chen, and He Sun. An expectation-maximization algorithm for training clean diffusion models from corrupted observations. In The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024. URL https://openreview. net/forum?id=jURBh4V9N4
work page 2024
-
[5]
Improving image generation with better captions
James Betker, Gabriel Goh, Li Jing, Tim Brooks, Jianfeng Wang, Linjie Li, Long Ouyang, Juntang Zhuang, Joyce Lee, Yufei Guo, et al. Improving image generation with better captions. OpenAI, 2023. URL https://cdn.openai.com/papers/dall-e-3.pdf
work page 2023
-
[6]
Ashish Bora, Eric Price, and Alexandros G. Dimakis. AmbientGAN: Generative models from lossy measurements. In International Conference on Learning Representations, 2018. URL https://openreview.net/forum?id=Hy7fDog0b
work page 2018
-
[7]
Schrodinger bridge flow for unpaired data translation
Valentin De Bortoli, Iryna Korshunova, Andriy Mnih, and Arnaud Doucet. Schrodinger bridge flow for unpaired data translation. In The Thirty-eighth Annual Conference on Neural Informa- tion Processing Systems, 2024. URL https://openreview.net/forum?id=1F32iCJFfa
work page 2024
-
[8]
Extracting training data from diffusion models
Nicolas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramer, Borja Balle, Daphne Ippolito, and Eric Wallace. Extracting training data from diffusion models. In 32nd USENIX Security Symposium, pages 5253–5270, 2023. URL https://www.usenix. org/system/files/usenixsecurity23-carlini.pdf
work page 2023
-
[9]
R. Courant and D. Hilbert. Methods of Mathematical Physics. WILEY-VCH Verlag GmbH & Co. KGaA, 1989. ISBN 9783527414475. doi: 10.1002/9783527617210
-
[10]
Diffusion Models in Vision: A Survey,
Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, and Mubarak Shah. Diffusion models in vision: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(9):10850–10869, 2023. URL https://doi.org/10.1109/TPAMI.2023.3261988. 10
-
[11]
Solving inverse problems with ambient diffusion
Giannis Daras and Alex Dimakis. Solving inverse problems with ambient diffusion. In NeurIPS 2023 Workshop on Deep Learning and Inverse Problems, 2023. URL https://openreview. net/forum?id=mGwg10bgHk
work page 2023
-
[12]
Consistent diffusion models: Mitigating sampling drift by learning to be consistent
Giannis Daras, Yuval Dagan, Alex Dimakis, and Constantinos Daskalakis. Consistent diffusion models: Mitigating sampling drift by learning to be consistent. In Advances in Neural Infor- mation Processing Systems, pages 42038–42063, 2023. URL https://openreview.net/ forum?id=GfZGdJHj27
work page 2023
-
[13]
Ambient diffusion: Learning clean distributions from corrupted data
Giannis Daras, Kulin Shah, Yuval Dagan, Aravind Gollakota, Alex Dimakis, and Adam Klivans. Ambient diffusion: Learning clean distributions from corrupted data. In Thirty-seventh Con- ference on Neural Information Processing Systems, 2023. URL https://openreview.net/ forum?id=wBJBLy9kBY
work page 2023
-
[14]
Consistent diffusion meets tweedie: Training exact ambient diffusion models with noisy data
Giannis Daras, Alex Dimakis, and Constantinos Costis Daskalakis. Consistent diffusion meets tweedie: Training exact ambient diffusion models with noisy data. In Forty-first Interna- tional Conference on Machine Learning , 2024. URL https://openreview.net/forum? id=PlVjIGaFdH
work page 2024
-
[15]
How much is a noisy image worth? data scaling laws for ambient diffusion
Giannis Daras, Yeshwanth Cherapanamjeri, and Constantinos Costis Daskalakis. How much is a noisy image worth? data scaling laws for ambient diffusion. In The Thirteenth International Conference on Learning Representations, 2025. URL https://openreview.net/forum? id=qZwtPEw2qN
work page 2025
-
[16]
Differentially private diffusion models
Tim Dockhorn, Tianshi Cao, Arash Vahdat, and Karsten Kreis. Differentially private diffusion models. Transactions on Machine Learning Research, 2023. URL https://openreview. net/forum?id=ZPpQk7FJXF
work page 2023
-
[17]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. In Advances in Neural Information Processing Systems, 2014. URL https://proceedings.neurips.cc/paper_ files/paper/2014/file/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf
work page 2014
-
[18]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial networks. Communications of the ACM, 63(11):139–144, 2020. URL https://doi.org/10.1145/3422622
-
[19]
Denoising diffusion probabilistic mod- els
Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic mod- els. In Advances in Neural Information Processing Systems , pages 6840–6851,
-
[20]
URL https://proceedings.neurips.cc/paper_files/paper/2020/file/ 4c5bcfec8584af0d967f1ab10179ca4b-Paper.pdf
work page 2020
-
[21]
Gritsenko, William Chan, Mohammad Norouzi, and David J
Jonathan Ho, Tim Salimans, Alexey A. Gritsenko, William Chan, Mohammad Norouzi, and David J. Fleet. Video diffusion models. In Advances in Neural Information Processing Systems,
-
[22]
URL https://openreview.net/forum?id=f3zNgKga_ep
-
[23]
Elucidating the design space of diffusion-based generative models
Tero Karras, Miika Aittala, Timo Aila, and Samuli Laine. Elucidating the design space of diffusion-based generative models. In Advances in Neural Information Processing Systems,
-
[24]
URL https://openreview.net/forum?id=k7FuTOWMOc7
-
[25]
Adam: A Method for Stochastic Optimization
Diederik P Kingma and Jimmy Ba. Adam: A method for stochastic optimization. In Inter- national Conference for Learning Representations, 2015. URL https://arxiv.org/abs/ 1412.6980
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[26]
Diffwave: A versatile diffusion model for audio synthesis
Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, and Bryan Catanzaro. Diffwave: A versatile diffusion model for audio synthesis. In International Conference on Learning Representations,
-
[27]
URL https://openreview.net/forum?id=a-xFK8Ymz5J
-
[28]
Learning multiple layers of features from tiny images
Alex Krizhevsky and Geoffrey Hinton. Learning multiple layers of features from tiny images. Technical report, University of Toronto, 2009. URLhttps://www.cs.toronto.edu/~kriz/ learning-features-2009-TR.pdf . 11
work page 2009
-
[29]
On the variance of the adaptive learning rate and beyond
Liyuan Liu, Haoming Jiang, Pengcheng He, Weizhu Chen, Xiaodong Liu, Jianfeng Gao, and Jiawei Han. On the variance of the adaptive learning rate and beyond. In Proceedings of the Eighth International Conference on Learning Representations (ICLR 2020), April 2020
work page 2020
-
[30]
Flow straight and fast: Learning to generate and transfer data with rectified flow
Xingchao Liu, Chengyue Gong, and Qiang Liu. Flow straight and fast: Learning to generate and transfer data with rectified flow. In The Eleventh International Conference on Learning Representations, 2022. URL https://openreview.net/forum?id=XVjTT1nw5z
work page 2022
-
[31]
Deep learning face attributes in the wild
Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. Deep learning face attributes in the wild. Proceedings of International Conference on Computer Vision (ICCV), 2015. URL http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html
work page 2015
-
[32]
Haoye Lu, Qifan Wu, and Yaoliang Yu. Stochastic forward-backward deconvolution: Training diffusion models with finite noisy datasets, 2025. arXiv:2502.05446
-
[33]
Deconvolution Problems in Nonparametric Statistics
Alexander Meister. Deconvolution Problems in Nonparametric Statistics. Springer, 2009. URL https://doi.org/10.1007/978-3-540-87557-4
-
[34]
Improved denoising diffusion probabilistic models
Alexander Quinn Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models. In International Conference on Learning Representations , 2021. URL https:// openreview.net/forum?id=-NEXDKk8gZ
work page 2021
-
[35]
On free energy, stochastic control, and Schrödinger processes
Michele Pavon and Anton Wakolbinger. On free energy, stochastic control, and Schrödinger processes. In Modeling, Estimation and Control of Systems with Uncertainty: Proceedings of a Conference held in Sopron, Hungary, September 1990, pages 334–348. Birkhäuser Boston,
work page 1990
-
[36]
URL https://doi.org/10.1007/978-1-4612-0443-5_22
-
[37]
SDXL: Improving latent diffusion models for high-resolution image synthesis
Dustin Podell, Zion English, Kyle Lacey, Andreas Blattmann, Tim Dockhorn, Jonas Müller, Joe Penna, and Robin Rombach. SDXL: Improving latent diffusion models for high-resolution image synthesis. In The Twelfth International Conference on Learning Representations, 2024. URL https://openreview.net/forum?id=di52zR8xgf
work page 2024
-
[38]
Sinkhorn flow as mirror flow: A continuous-time framework for generalizing the Sinkhorn algorithm
Mohammad Reza Karimi, Ya-Ping Hsieh, and Andreas Krause. Sinkhorn flow as mirror flow: A continuous-time framework for generalizing the Sinkhorn algorithm. In Sanjoy Dasgupta, Stephan Mandt, and Yingzhen Li, editors, Proceedings of The 27th International Conference on Artificial Intelligence and Statistics , volume 238 of Proceedings of Machine Learning R...
work page 2024
-
[39]
Masked feature prediction for self-supervised visual pre-training
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. High- resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10684–10695, 2022. URL https://doi.org/10.1109/CVPR52688.2022.01042
-
[40]
Deep un- supervised learning using nonequilibrium thermodynamics
Jascha Sohl-Dickstein, Eric Weiss, Niru Maheswaranathan, and Surya Ganguli. Deep un- supervised learning using nonequilibrium thermodynamics. In Proceedings of the 32nd International Conference on Machine Learning , pages 2256–2265, 2015. URL https: //proceedings.mlr.press/v37/sohl-dickstein15.html
work page 2015
-
[41]
Gowthami Somepalli, Vasu Singla, Micah Goldblum, Jonas Geiping, and Tom Goldstein. Diffu- sion art or digital forgery? investigating data replication in diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , pages 6048–6058, 2023. URL https://doi.org/10.1109/CVPR52729.2023.00586
-
[42]
Denoising diffusion implicit models
Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models. In International Conference on Learning Representations, 2021. URL https://openreview. net/forum?id=St1giarCHLP
work page 2021
-
[43]
Score-based generative modeling through stochastic differential equations
Yang Song, Jascha Sohl-Dickstein, Diederik P Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-based generative modeling through stochastic differential equations. In International Conference on Learning Representations, 2021. URL https://openreview. net/forum?id=PxTIG12RRHS. 12
work page 2021
-
[44]
Yang Song, Prafulla Dhariwal, Mark Chen, and Ilya Sutskever. Consistency models. In Proceedings of the 40th International Conference on Machine Learning, pages 32211–32252,
-
[45]
URL https://proceedings.mlr.press/v202/song23a.html
-
[46]
Solving schrodinger bridges via maximum likelihood
Francisco Vargas, Pierre Thodoroff, Austen Lamacraft, and Neil Lawrence. Solving schrodinger bridges via maximum likelihood. Entropy, 23(9), 2021. URL https://www.mdpi.com/ 1099-4300/23/9/1134
work page 2021
-
[47]
Diffusion-GAN: Training GANs with diffusion
Zhendong Wang, Huangjie Zheng, Pengcheng He, Weizhu Chen, and Mingyuan Zhou. Diffusion-GAN: Training GANs with diffusion. In The Eleventh International Confer- ence on Learning Representations , 2023. URL https://openreview.net/forum?id= HZf7UbpWHuA
work page 2023
-
[48]
Differentially Private Generative Adversarial Network
Liyang Xie, Kaixiang Lin, Shu Wang, Fei Wang, and Jiayu Ren. Differentially private generative adversarial network. arXiv preprint arXiv:1802.06739, 2018. URL https://arxiv.org/ abs/1802.06739
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[49]
Diffsound: Discrete diffusion model for text-to-sound generation
Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, and Dong Yu. Diffsound: Discrete diffusion model for text-to-sound generation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31:1720–1733, 2023. URL https://doi.org/10. 1109/TASLP.2023.3268730. 13 A Theoretical results A.1 Minimizing KL divergence is equivalent to ...
- [50]
-
[51]
∥ M(s)) = DKL(mk 0 ∗ N(0, τI) ∥ p∗ τ)| {z } const. + E D(mk 0 ) Z τ 0 1 2 ∥b(xt, t) − st(xt)∥2 dt, where bk(xt, t) is the drift of the backward SDE starting from τ with the initial distribution mk 0 ∗ N (0, τI). Anderson [3] showed that bk(xt, t) = ∇ log mk t (xt), where mk t (xt) denotes the density of the marginal distribution of M k. It can be shown th...
-
[52]
For k ≥ 0, DKL(pdata ∥ pk+1,γ 0 )−DKL(pdata ∥ pk,γ 0 ) ≤ −γD KL(p∗ τ ∥pk,γ τ )
(22) A.3 Results related to SFBD flow Proposition 1. For k ≥ 0, DKL(pdata ∥ pk+1,γ 0 )−DKL(pdata ∥ pk,γ 0 ) ≤ −γD KL(p∗ τ ∥pk,γ τ ). In addition, min k=1,...,K Φpdata(u) − Φpk,γ 0 (u) ≤ exp τ 2 ∥u∥2 2 DKL(pdata ∥ pEclean ) γK 1/2 for K ≥ 1, u ∈ Rd. Proof. Let P ∗ denote the path measure induced by the forward process (1) with p0 = pdata. In addi- tion, le...
-
[53]
(23) where bk(xt, t) is the drift of the forward process inducing M k with x0 ∼ mk 0
+ EP ∗ 1 2 Z τ 0 ∥bk(xt, t)∥2 dt | {z } :=Bk . (23) where bk(xt, t) is the drift of the forward process inducing M k with x0 ∼ mk 0. In addition, through the convexity of the KL divergence, F pk+1 0 = F (1 − γ)pk 0 + γmk 0 ≤ (1 − γ)F(pk
-
[54]
+ γF(mk 0), 15 which implies, F(mk
- [55]
-
[56]
= DKL(P ∗ ∥ P k) = DKL(p∗ τ ∥pk τ) + Ep∗ Z τ 0 1 2 ∇ log pt(xt) − sk t (xt) = DKL(p∗ τ ∥ pk τ) + DKL(P ∗ ∥ M k) (23) = DKL(p∗ τ ∥pk τ) + F(mk
-
[57]
+ Bk (24) ≥ DKL(p∗ τ ∥ pk τ) + Bk + 1 γ F(pk+1 0 ) − F(pk 0) + F(pk 0) ≥ DKL(p∗ τ ∥ pk τ) + 1 γ F(pk+1 0 ) − F(pk 0) + F(pk 0). Rearrangement yields DKL(pdata ∥ pk+1,γ 0 ) − DKL(pdata ∥pk,γ 0 ) ≤ −γD KL(p∗ τ ∥ pk,γ τ ), (25) the monotonicity of pk,γ 0 in k in the proposition. Equivalently, F(pk+1,γ 0 ) − F(pk,γ 0 ) ≤ −γD KL(p∗ τ ∥ pk,γ τ ). (26) Telescopi...
-
[58]
(34) As a result, P k+1,γ t = (1 − γ) P k,γ t + γ D(mk 0)t (35) and δ(xt) = γ dD(mk 0)t d(1 − γ) P k,γ t + γ D(mk 0)t (xt), 1 − δ(xt) = (1 − γ) dP k,γ t d(1 − γ) P k,γ t + γ D(mk 0)t (xt). (36) Thus, sk+1 t (xt) (33) = sk t (xt) (1 − γ) dP k,γ t d(1 − γ) P k,γ t + γ D(mk 0)t (xt) + ED(mk 0 )0|t[x0|xt] − xt t γ dD(mk 0)t d(1 − γ) P k,γ t + γ D(mk 0)t (xt) ...
-
[59]
≤ −DKL(p∗ τ ∥ pκ τ ). Additionally, inf κ∈[0,K] Φpdata(u) − Φpκ 0 (u) ≤ exp τ 2 ∥u∥2 2DKL(pdata ∥ pEclean ) K 1/2 for K > 0 and u ∈ Rd. Proof. According to (25), we have 1 γ DKL(pdata ∥ pk+1,γ 0 ) − DKL(pdata ∥pk,γ 0 ) ≤ −DKL(p∗ τ ∥ pk,γ τ ), (37) for all γ > 0 and k ∈ N. Fix κ > 0 and let {γi} → 0 with ki = κ/γi ∈ N. Then pki,γi 0 → pκ 0 via Euler approx...
-
[60]
In addition, integrating both sides of (38) over [0, K] gives: DKL(pdata ∥ pK 0 ) − DKL(pdata ∥ p0
= lim i→∞ 1 γi DKL(pdata ∥ pki+1,γi 0 ) − DKL(pdata ∥ pki,γi 0 ) (37) ≤ lim i→∞ −DKL(p∗ τ ∥ pki,γi τ ) = −DKL(p∗ τ ∥pκ τ ), (38) establishing the monotonicity claim. In addition, integrating both sides of (38) over [0, K] gives: DKL(pdata ∥ pK 0 ) − DKL(pdata ∥ p0
-
[61]
(39) As a result, inf κ∈[0,K] DKL(p∗ τ ∥pκ τ ) ≤ 1 K DKL(pdata ∥ p0
≤ − Z K 0 DKL(p∗ τ ∥ pκ τ ) dκ. (39) As a result, inf κ∈[0,K] DKL(p∗ τ ∥pκ τ ) ≤ 1 K DKL(pdata ∥ p0
-
[62]
Applying Prop 3 concludes the convergence argument in the corollary
= 1 K DKL(pdata ∥ pEclean). Applying Prop 3 concludes the convergence argument in the corollary. A.4 A variant of γ-SFBD Since the copyright-free clean samples are drawn from the true data distribution, it is practical to mix them with the denoised samples during denoiser updates to enhance overall sample quality. In particular, we generally believe that ...
-
[63]
Enforcing consistency between r = 0 and s > 0
In this section, we elaborate on this connection and extend the discussion to more general CC-based methods that enforce consistency between arbitrary time steps r < s . Enforcing consistency between r = 0 and s > 0. We assume the denoiser network satisfies Dϕ(·, 0) = Id(·), a condition explicitly enforced in EDM-based implementations. This design is both...
-
[64]
For simplicity, we restrict the discussion to the case s = τ
(45) To see this, note that practical implementations of CC-based methods typically rely on two approxi- mations: (a) ps is approximated using samples generated by adding Gaussian noise to corrupted data, where s is chosen no less than the corruption level τ [15]; (b) p0|s is estimated via the backward SDE (3), with the drift term approximated by the curr...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.