Where the Score Lives: A Wavelet View of Diffusion

Binxu Wang; Demba E. Ba; Emma Finn; T. Anderson Keller

arxiv: 2606.08309 · v1 · pith:ZHQNLHHUnew · submitted 2026-06-06 · 💻 cs.LG · cs.CV

Where the Score Lives: A Wavelet View of Diffusion

Emma Finn , Binxu Wang , T. Anderson Keller , Demba E. Ba This is my paper

Pith reviewed 2026-06-27 19:59 UTC · model grok-4.3

classification 💻 cs.LG cs.CV

keywords score-based generative modelsdiffusion modelswavelet basisscore functiondata momentsdenoisinginductive biasesgenerative behavior

0 comments

The pith

Expanding the score function in a 2D wavelet basis makes it analytically solvable in terms of the moments of the data distribution.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a parameterization of the score function for diffusion models using an expansion in a 2D orthogonal wavelet basis. This yields optimal score functions that are explicit functions of the moments of the data. A sympathetic reader would care because it offers an architecture-independent way to analyze which features of the data distribution are most important for effective denoising. It also shows how this approach can reproduce some behaviors of common networks like U-Nets without committing to their specific design. This helps explain why different score approximators produce different generative results.

Core claim

By expanding the score in a 2D orthogonal wavelet basis, the authors obtain an analytically solvable parameterization whose coefficients are directly determined by the moments of the data distribution. This moment-based form is interpretable and flexible enough to partially mimic the inductive biases of architectures such as U-Nets and CNNs, providing an architecture-agnostic view of what attributes matter most for denoising in score-based generative models.

What carries the argument

The 2D orthogonal wavelet basis expansion of the score function, which turns it into an explicit sum over moment-derived coefficients.

If this is right

The moment coefficients reveal which attributes of the data distribution matter most for denoising.
This parameterization can reproduce relevant inductive biases of U-Nets and CNNs.
Researchers can analyze how the data distribution interacts with the score network independently of architecture choice.
Distinct generative behaviors across architectures can be traced to differences in how they approximate these moment-based scores.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This approach might allow construction of new score networks that directly incorporate moment calculations for improved interpretability.
Testing the wavelet scores on datasets with known moment structures could show whether they match or exceed standard diffusion performance.
Connections to other basis expansions like Fourier might reveal similar moment-based insights in different domains.

Load-bearing premise

That expanding the score in wavelets produces coefficients based on moments that capture the data attributes essential for good denoising performance.

What would settle it

Running the wavelet-based score on standard image datasets and finding that the generated samples have significantly worse quality metrics than those from a U-Net score network would indicate the parameterization does not capture the necessary attributes.

Figures

Figures reproduced from arXiv: 2606.08309 by Binxu Wang, Demba E. Ba, Emma Finn, T. Anderson Keller.

**Figure 2.** Figure 2: Wavelet Expansion of the Score. The score of a given image Xt can be expanded in an orthonormal wavelet basis by representing it as a sum of coefficients ci(Xt) multiplied by corresponding wavelets wi (Equation 5). Since the wavelets come in many translated copies, we only display wavelets at a single spatial location k, and display coefficients in spatially ordered ‘feature maps’. The sum over spatial lo… view at source ↗

**Figure 3.** Figure 3: Wavelet Coefficient Approximation. We approximate the unknown true coefficients ci(Xt) of the true score function as an inner product of features φi(Xt) and parameters α (t) i . The features depicted here are independent degree 3 polynomial features. 2.2 Correlation Structures in the Data Natural images exhibit structured dependencies in the wavelet domain: heavy–tailed marginals per coefficient, co-activa… view at source ↗

**Figure 4.** Figure 4: Visualization of noise regimes. Clean image σ = 0 (left) to highly noised σ = 4 (right) mately orthonormal linear operator, B ∈ R p×d where p is the number of features and d = H × W is the flattened dimension of the image. B maps vectorized images Xt ∈ R d to wavelet coefficients. Noise Model We add Gaussian noise according to the variance exploding regime Xt = X0 + σZ for Z ∼ N (0, 1). We consider four no… view at source ↗

**Figure 6.** Figure 6: MNIST-64. Denoising MSE with (a) independent features, and (b) band-tied coupling on larger images. We see higher degree polynomials are more performant with larger images, compared with [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗

**Figure 5.** Figure 5: MNIST-32. (a) Denoising MSE with independent monomial features across three sets of features: degrees 1, 2, & 3. (b) Same features with band-tied coupling, and (c) local coupling. (d) Comparable performance of a variety of trained U-Net score function approximators. (a) Independent Features (b) Band-Tied Coupling [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 7.** Figure 7: Comparison of linear baseline vs. wavelet models (D=3) on MNIST-32 and MNIST-64. We see the [PITH_FULL_IMAGE:figures/full_fig_p013_7.png] view at source ↗

**Figure 8.** Figure 8: Comparison of independent Hermite polynomials across different images sizes and degrees. Increasing [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗

**Figure 9.** Figure 9: Denoising results for the independent monomial model on two image sizes (columns) and across ranges [PITH_FULL_IMAGE:figures/full_fig_p015_9.png] view at source ↗

**Figure 10.** Figure 10: Denoising results for the band-tied model on 32 by 32 images, across ranges of noise levels (groups of 3 [PITH_FULL_IMAGE:figures/full_fig_p016_10.png] view at source ↗

**Figure 11.** Figure 11: MNIST samples generated by different UNet-CNN EDM diffusion model variants. Each row shows [PITH_FULL_IMAGE:figures/full_fig_p017_11.png] view at source ↗

**Figure 12.** Figure 12: Generated samples for Independent vs. Band models at varying polynomial degrees. Each column [PITH_FULL_IMAGE:figures/full_fig_p018_12.png] view at source ↗

read the original abstract

Score-based generative models have had remarkable success over the last decade in generating a diverse set of visually plausible images. A variety of architectures including CNNs, U-Nets, and Transformers have been used as the score-approximation network in such diffusion modeling; however, to date, relatively little is known about how these architectural choices impact generative behavior. In this work, to provide insight into this area, we propose an analytically solvable parameterization of the score function using an expansion in a 2D orthogonal wavelet basis. In particular, we derive interpretable optimal score functions in terms of the moments of the data distribution. We use this parametrization to provide an architecture-agnostic, moment-based analysis that reveals which attributes of the data distribution tend to matter most for denoising. Our score machine is flexible enough to partially mimic the relevant inductive biases of multiple architectures, including U-Nets, and CNNs, taking a step towards understanding why different score architectures can exhibit distinct generative behavior. Since our score is solvable in terms of the moments of the data, we can begin to understand how the data distribution interacts with the score network to produce the behavior we observe in diffusion models.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The wavelet parameterization claims an analytically solvable score from data moments, but the reduction from orthogonality alone needs explicit justification that the abstract does not supply.

read the letter

The main takeaway is that this paper tries to give an architecture-agnostic handle on score functions in diffusion by expanding them in a 2D orthogonal wavelet basis and tying the coefficients to moments of the data distribution. If the steps hold, it supplies a concrete way to see which distribution attributes drive denoising and why different networks produce different outputs.

What is actually new is the specific wavelet parameterization and the derivation of optimal scores expressed directly through moments. The paper uses this to analyze inductive biases across U-Nets and CNNs without committing to one architecture, which is a distinct move from the usual empirical comparisons.

The work does a reasonable job framing the open question about why architectural choices matter and offering a moment-based lens as a potential answer. The claim that the parameterization can partially mimic relevant biases is a useful direction.

The soft spot sits at the central claim. Orthogonality of the basis does not by itself make the projected score depend only on a finite set of moments; the forward process would have to cancel the rest, and the abstract gives no indication of why that happens here. The stress-test concern lands because standard wavelet and diffusion properties do not automatically deliver the reduction. Without the explicit derivation in hand it is difficult to judge whether the math closes or whether the result is more of a reparameterization. Experiments validating the moment-based analysis would also help.

This is for readers working on the theory of score-based models who want analytical tools rather than another architecture tweak. It deserves a serious referee to check the derivations and see whether the moment reduction actually works.

Referee Report

1 major / 0 minor

Summary. The paper proposes an analytically solvable parameterization of the score function ∇_x log p_t(x) in diffusion models by expanding it in a 2D orthogonal wavelet basis. It claims to derive interpretable optimal score functions whose coefficients depend only on the moments of the clean data distribution, enabling an architecture-agnostic moment-based analysis of which data attributes matter most for denoising and how the parameterization can partially mimic inductive biases of U-Nets and CNNs.

Significance. If the central derivation holds, the work would offer a concrete, moment-driven lens on score networks that is independent of specific architectures, potentially explaining observed differences in generative behavior across models. The explicit link to data moments and the claim of analytical solvability would be a notable contribution to the theoretical understanding of diffusion models.

major comments (1)

[Abstract / Central Claim] The central claim (abstract and introduction) that an expansion in a 2D orthogonal wavelet basis produces coefficients that are exactly functions of the (finite set of) moments of the data distribution, yielding an analytically solvable score, is not automatic from orthogonality or standard wavelet properties. The manuscript must provide the explicit derivation showing how the projection ∫ ∇_x log p_t(x) ψ_{j,k}(x) dx reduces to moment terms under the forward diffusion process; without this step the analytical solvability and moment-based analysis rest on an unproven reduction.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their careful reading and constructive feedback. We address the single major comment below.

read point-by-point responses

Referee: [Abstract / Central Claim] The central claim (abstract and introduction) that an expansion in a 2D orthogonal wavelet basis produces coefficients that are exactly functions of the (finite set of) moments of the data distribution, yielding an analytically solvable score, is not automatic from orthogonality or standard wavelet properties. The manuscript must provide the explicit derivation showing how the projection ∫ ∇_x log p_t(x) ψ_{j,k}(x) dx reduces to moment terms under the forward diffusion process; without this step the analytical solvability and moment-based analysis rest on an unproven reduction.

Authors: We agree that the reduction from the wavelet projection of the score to explicit moment terms requires a fully explicit derivation rather than relying on standard wavelet orthogonality alone. In the revised manuscript we will insert a new subsection that starts from the closed-form expression for log p_t under the Gaussian forward process, substitutes the wavelet expansion of the score, and shows term-by-term how each coefficient integral collapses to a finite combination of raw moments of the clean data distribution via the moment-generating function of the diffusion kernel. revision: yes

Circularity Check

0 steps flagged

Wavelet expansion of score derived independently from data moments; no reduction to inputs by construction

full rationale

The paper presents a derivation of an analytically solvable score parameterization via 2D orthogonal wavelet basis expansion, with coefficients expressed as functions of data distribution moments. No load-bearing self-citations, self-definitional steps, or fitted parameters renamed as predictions appear in the provided claims. The central result is framed as following from wavelet orthogonality and diffusion forward process properties, remaining self-contained against external benchmarks without circular reduction. This matches the common honest outcome of score 0-2.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities; the central claim rests on the unelaborated premise that the wavelet basis permits analytical solution in terms of moments.

pith-pipeline@v0.9.1-grok · 5742 in / 1013 out tokens · 14368 ms · 2026-06-27T19:59:45.360461+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

48 extracted references · 5 canonical work pages · 3 internal anchors

[1]

Allen , keywords =

Broughton, S. Allen , keywords =. 2009 , title =

2009
[2]

, author=

Natural image statistics and neural representation. , author=. Annual review of neuroscience , year=
[3]

Scale Mixtures of Gaussians and the Statistics of Natural Images , url =

Wainwright, Martin J and Simoncelli, Eero , booktitle =. Scale Mixtures of Gaussians and the Statistics of Natural Images , url =. 1999 , bdsk-url-1 =

1999
[4]

Ideal spatial adaptation by wavelet shrinkage , url =

Donoho, David L and Johnstone, Iain M , doi =. Ideal spatial adaptation by wavelet shrinkage , url =. Biometrika , month = sep, note =. 1994 , bdsk-url-1 =

1994
[5]

2021 , eprint=

Score-Based Generative Modeling through Stochastic Differential Equations , author=. 2021 , eprint=

2021
[6]

Advances in Neural Information Processing Systems , volume=

Generative Modeling by Estimating Gradients of the Data Distribution , author=. Advances in Neural Information Processing Systems , volume=. 2019 , url=

2019
[7]

Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence (UAI) , pages=

Sliced Score Matching: A Scalable Approach to Density and Score Estimation , author=. Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence (UAI) , pages=. 2019 , url=

2019
[8]

NeurIPS , year=

Elucidating the Design Space of Diffusion-Based Generative Models , author=. NeurIPS , year=
[9]

Denoising Diffusion Implicit Models

Denoising Diffusion Implicit Models , author=. arXiv preprint arXiv:2010.02502 , year=

work page internal anchor Pith review Pith/arXiv arXiv 2010
[10]

2024 , eprint=

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution , author=. 2024 , eprint=

2024
[11]

2023 , eprint=

Simple diffusion: End-to-end diffusion for high resolution images , author=. 2023 , eprint=

2023
[12]

Neural Computation , volume=

A Connection Between Score Matching and Denoising Autoencoders , author=. Neural Computation , volume=
[13]

1992 , publisher=

Ten Lectures on Wavelets , author=. 1992 , publisher=

1992
[14]

IEEE Transactions on Image Processing , volume=

Adaptive Wavelet Thresholding for Image Denoising and Compression , author=. IEEE Transactions on Image Processing , volume=
[15]

IEEE Transactions on Image Processing , volume=

Image Denoising Using Scale Mixtures of Gaussians in the Wavelet Domain , author=. IEEE Transactions on Image Processing , volume=
[16]

IEEE Transactions on Pattern Analysis and Machine Intelligence , volume=

Invariant Scattering Convolution Networks , author=. IEEE Transactions on Pattern Analysis and Machine Intelligence , volume=
[17]

The Steerable Pyramid: A Flexible Architecture for Multi-Scale Derivative Computation , author=. Proc. IEEE Intl. Conf. on Image Processing (ICIP) , volume=
[18]

IEEE Transactions on Image Processing , volume=

Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , author=. IEEE Transactions on Image Processing , volume=
[19]

Journal of the American Statistical Association , volume=

Tweedie's Formula and Selection Bias , author=. Journal of the American Statistical Association , volume=
[20]

Bulletin of the International Statistical Institute , volume=

An Empirical Bayes Estimator of the Mean of a Normal Population , author=. Bulletin of the International Statistical Institute , volume=
[21]

Technometrics , volume=

Ridge Regression: Biased Estimation for Nonorthogonal Problems , author=. Technometrics , volume=
[22]

2003 , eprint=

Wavelet Notes , author=. 2003 , eprint=

2003
[23]

Daubechies, Orthonormal bases of compactly supported wavelets, Communications on Pure and Applied Mathematics 41 (7) (1988) 909–996.doi:10.1002/cpa.3160410705

Daubechies, Ingrid , title =. Communications on Pure and Applied Mathematics , volume =. doi:https://doi.org/10.1002/cpa.3160410705 , url =. https://onlinelibrary.wiley.com/doi/pdf/10.1002/cpa.3160410705 , abstract =

work page doi:10.1002/cpa.3160410705
[24]

2024 , eprint=

Generalization in diffusion models arises from geometry-adaptive harmonic representations , author=. 2024 , eprint=

2024
[25]

Kevin P. Murphy. Probabilistic Machine Learning: Advanced Topics
[26]

2025 , eprint=

Towards a Mechanistic Explanation of Diffusion Model Generalization , author=. 2025 , eprint=

2025
[27]

2025 , eprint=

Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training , author=. 2025 , eprint=

2025
[28]

2024 , eprint=

An analytic theory of creativity in convolutional diffusion models , author=. 2024 , eprint=

2024
[29]

NeurIPS 2024 Workshop on Scientific Methods for Understanding Deep Learning , year=

Memorization to Generalization: The Emergence of Diffusion Models from Associative Memory , author=. NeurIPS 2024 Workshop on Scientific Methods for Understanding Deep Learning , year=

2024
[30]

2024 , eprint=

The Unreasonable Effectiveness of Gaussian Score Approximation for Diffusion Models and its Applications , author=. 2024 , eprint=

2024
[31]

Denoising Diffusion Probabilistic Models

Jonathan Ho and Ajay Jain and Pieter Abbeel , title =. CoRR , volume =. 2020 , url =. 2006.11239 , timestamp =

work page internal anchor Pith review Pith/arXiv arXiv 2020
[32]

1992 , title =

Daubechies, Ingrid , keywords =. 1992 , title =

1992
[33]

2023 , eprint=

Wavelet Diffusion Models are fast and scalable Image Generators , author=. 2023 , eprint=

2023
[34]

2024 , eprint=

A Good Score Does not Lead to A Good Generative Model , author=. 2024 , eprint=

2024
[35]

2022 , eprint=

Wavelet Score-Based Generative Modeling , author=. 2022 , eprint=

2022
[36]

Estimation of Non-Normalized Statistical Models by Score Matching , journal =

Aapo Hyv. Estimation of Non-Normalized Statistical Models by Score Matching , journal =. 2005 , volume =

2005
[37]

2023 , eprint=

A Multi-Resolution Framework for U-Nets with Applications to Hierarchical VAEs , author=. 2023 , eprint=

2023
[38]

, journal=

Donoho, D.L. , journal=. De-noising by soft-thresholding , year=
[39]

Proceedings of the 32nd International Conference on Machine Learning , series =

Deep Unsupervised Learning using Nonequilibrium Thermodynamics , author =. Proceedings of the 32nd International Conference on Machine Learning , series =. 2015 , url =

2015
[40]

Advances in Neural Information Processing Systems , volume =

Denoising Diffusion Probabilistic Models , author =. Advances in Neural Information Processing Systems , volume =. 2020 , url =

2020
[41]

, journal=

Mallat, S.G. , journal=. A theory for multiresolution signal decomposition: the wavelet representation , year=
[42]

minimal-diffusion: A minimal yet resourceful implementation of diffusion models , howpublished =
[43]

2025 , eprint=

Wavelet Diffusion Neural Operator , author=. 2025 , eprint=

2025
[44]

2025 , eprint=

Latent Wavelet Diffusion For Ultra-High-Resolution Image Synthesis , author=. 2025 , eprint=

2025
[45]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =

Zhang, Jinjin and Huang, Qiuyu and Liu, Junjie and Guo, Xiefan and Huang, Di , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =. 2025 , pages =

2025
[46]

LeCun, Yann and Cortes, Corinna and Burges, Christopher J. C. , title =. 1998 , note =

1998
[47]

An Analytical Theory of Spectral Bias in the Learning Dynamics of Diffusion Models

Wang, Binxu and Pehlevan, Cengiz , title =. Advances in Neural Information Processing Systems , year =. 2503.03206 , archivePrefix =

work page internal anchor Pith review Pith/arXiv arXiv
[48]

arXiv preprint arXiv:2509.09672 , year=

Locality in image diffusion models emerges from data statistics , author=. arXiv preprint arXiv:2509.09672 , year=

work page arXiv

[1] [1]

Allen , keywords =

Broughton, S. Allen , keywords =. 2009 , title =

2009

[2] [2]

, author=

Natural image statistics and neural representation. , author=. Annual review of neuroscience , year=

[3] [3]

Scale Mixtures of Gaussians and the Statistics of Natural Images , url =

Wainwright, Martin J and Simoncelli, Eero , booktitle =. Scale Mixtures of Gaussians and the Statistics of Natural Images , url =. 1999 , bdsk-url-1 =

1999

[4] [4]

Ideal spatial adaptation by wavelet shrinkage , url =

Donoho, David L and Johnstone, Iain M , doi =. Ideal spatial adaptation by wavelet shrinkage , url =. Biometrika , month = sep, note =. 1994 , bdsk-url-1 =

1994

[5] [5]

2021 , eprint=

Score-Based Generative Modeling through Stochastic Differential Equations , author=. 2021 , eprint=

2021

[6] [6]

Advances in Neural Information Processing Systems , volume=

Generative Modeling by Estimating Gradients of the Data Distribution , author=. Advances in Neural Information Processing Systems , volume=. 2019 , url=

2019

[7] [7]

Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence (UAI) , pages=

Sliced Score Matching: A Scalable Approach to Density and Score Estimation , author=. Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence (UAI) , pages=. 2019 , url=

2019

[8] [8]

NeurIPS , year=

Elucidating the Design Space of Diffusion-Based Generative Models , author=. NeurIPS , year=

[9] [9]

Denoising Diffusion Implicit Models

Denoising Diffusion Implicit Models , author=. arXiv preprint arXiv:2010.02502 , year=

work page internal anchor Pith review Pith/arXiv arXiv 2010

[10] [10]

2024 , eprint=

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution , author=. 2024 , eprint=

2024

[11] [11]

2023 , eprint=

Simple diffusion: End-to-end diffusion for high resolution images , author=. 2023 , eprint=

2023

[12] [12]

Neural Computation , volume=

A Connection Between Score Matching and Denoising Autoencoders , author=. Neural Computation , volume=

[13] [13]

1992 , publisher=

Ten Lectures on Wavelets , author=. 1992 , publisher=

1992

[14] [14]

IEEE Transactions on Image Processing , volume=

Adaptive Wavelet Thresholding for Image Denoising and Compression , author=. IEEE Transactions on Image Processing , volume=

[15] [15]

IEEE Transactions on Image Processing , volume=

Image Denoising Using Scale Mixtures of Gaussians in the Wavelet Domain , author=. IEEE Transactions on Image Processing , volume=

[16] [16]

IEEE Transactions on Pattern Analysis and Machine Intelligence , volume=

Invariant Scattering Convolution Networks , author=. IEEE Transactions on Pattern Analysis and Machine Intelligence , volume=

[17] [17]

The Steerable Pyramid: A Flexible Architecture for Multi-Scale Derivative Computation , author=. Proc. IEEE Intl. Conf. on Image Processing (ICIP) , volume=

[18] [18]

IEEE Transactions on Image Processing , volume=

Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , author=. IEEE Transactions on Image Processing , volume=

[19] [19]

Journal of the American Statistical Association , volume=

Tweedie's Formula and Selection Bias , author=. Journal of the American Statistical Association , volume=

[20] [20]

Bulletin of the International Statistical Institute , volume=

An Empirical Bayes Estimator of the Mean of a Normal Population , author=. Bulletin of the International Statistical Institute , volume=

[21] [21]

Technometrics , volume=

Ridge Regression: Biased Estimation for Nonorthogonal Problems , author=. Technometrics , volume=

[22] [22]

2003 , eprint=

Wavelet Notes , author=. 2003 , eprint=

2003

[23] [23]

Daubechies, Orthonormal bases of compactly supported wavelets, Communications on Pure and Applied Mathematics 41 (7) (1988) 909–996.doi:10.1002/cpa.3160410705

Daubechies, Ingrid , title =. Communications on Pure and Applied Mathematics , volume =. doi:https://doi.org/10.1002/cpa.3160410705 , url =. https://onlinelibrary.wiley.com/doi/pdf/10.1002/cpa.3160410705 , abstract =

work page doi:10.1002/cpa.3160410705

[24] [24]

2024 , eprint=

Generalization in diffusion models arises from geometry-adaptive harmonic representations , author=. 2024 , eprint=

2024

[25] [25]

Kevin P. Murphy. Probabilistic Machine Learning: Advanced Topics

[26] [26]

2025 , eprint=

Towards a Mechanistic Explanation of Diffusion Model Generalization , author=. 2025 , eprint=

2025

[27] [27]

2025 , eprint=

Why Diffusion Models Don't Memorize: The Role of Implicit Dynamical Regularization in Training , author=. 2025 , eprint=

2025

[28] [28]

2024 , eprint=

An analytic theory of creativity in convolutional diffusion models , author=. 2024 , eprint=

2024

[29] [29]

NeurIPS 2024 Workshop on Scientific Methods for Understanding Deep Learning , year=

Memorization to Generalization: The Emergence of Diffusion Models from Associative Memory , author=. NeurIPS 2024 Workshop on Scientific Methods for Understanding Deep Learning , year=

2024

[30] [30]

2024 , eprint=

The Unreasonable Effectiveness of Gaussian Score Approximation for Diffusion Models and its Applications , author=. 2024 , eprint=

2024

[31] [31]

Denoising Diffusion Probabilistic Models

Jonathan Ho and Ajay Jain and Pieter Abbeel , title =. CoRR , volume =. 2020 , url =. 2006.11239 , timestamp =

work page internal anchor Pith review Pith/arXiv arXiv 2020

[32] [32]

1992 , title =

Daubechies, Ingrid , keywords =. 1992 , title =

1992

[33] [33]

2023 , eprint=

Wavelet Diffusion Models are fast and scalable Image Generators , author=. 2023 , eprint=

2023

[34] [34]

2024 , eprint=

A Good Score Does not Lead to A Good Generative Model , author=. 2024 , eprint=

2024

[35] [35]

2022 , eprint=

Wavelet Score-Based Generative Modeling , author=. 2022 , eprint=

2022

[36] [36]

Estimation of Non-Normalized Statistical Models by Score Matching , journal =

Aapo Hyv. Estimation of Non-Normalized Statistical Models by Score Matching , journal =. 2005 , volume =

2005

[37] [37]

2023 , eprint=

A Multi-Resolution Framework for U-Nets with Applications to Hierarchical VAEs , author=. 2023 , eprint=

2023

[38] [38]

, journal=

Donoho, D.L. , journal=. De-noising by soft-thresholding , year=

[39] [39]

Proceedings of the 32nd International Conference on Machine Learning , series =

Deep Unsupervised Learning using Nonequilibrium Thermodynamics , author =. Proceedings of the 32nd International Conference on Machine Learning , series =. 2015 , url =

2015

[40] [40]

Advances in Neural Information Processing Systems , volume =

Denoising Diffusion Probabilistic Models , author =. Advances in Neural Information Processing Systems , volume =. 2020 , url =

2020

[41] [41]

, journal=

Mallat, S.G. , journal=. A theory for multiresolution signal decomposition: the wavelet representation , year=

[42] [42]

minimal-diffusion: A minimal yet resourceful implementation of diffusion models , howpublished =

[43] [43]

2025 , eprint=

Wavelet Diffusion Neural Operator , author=. 2025 , eprint=

2025

[44] [44]

2025 , eprint=

Latent Wavelet Diffusion For Ultra-High-Resolution Image Synthesis , author=. 2025 , eprint=

2025

[45] [45]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =

Zhang, Jinjin and Huang, Qiuyu and Liu, Junjie and Guo, Xiefan and Huang, Di , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =. 2025 , pages =

2025

[46] [46]

LeCun, Yann and Cortes, Corinna and Burges, Christopher J. C. , title =. 1998 , note =

1998

[47] [47]

An Analytical Theory of Spectral Bias in the Learning Dynamics of Diffusion Models

Wang, Binxu and Pehlevan, Cengiz , title =. Advances in Neural Information Processing Systems , year =. 2503.03206 , archivePrefix =

work page internal anchor Pith review Pith/arXiv arXiv

[48] [48]

arXiv preprint arXiv:2509.09672 , year=

Locality in image diffusion models emerges from data statistics , author=. arXiv preprint arXiv:2509.09672 , year=

work page arXiv