Multi-Scale Generative Modeling with Heat Dissipation Flow Matching

Hanquan Zhang; Haoyuan Guan; Jun Ma; Ke Zhang; Yanjun Qin

arxiv: 2605.19371 · v1 · pith:SHEKDA7Snew · submitted 2026-05-19 · 💻 cs.CV · cs.AI

Multi-Scale Generative Modeling with Heat Dissipation Flow Matching

Jun Ma , Hanquan Zhang , Yanjun Qin , Haoyuan Guan , Ke Zhang This is my paper

Pith reviewed 2026-05-20 06:20 UTC · model grok-4.3

classification 💻 cs.CV cs.AI

keywords Heat Dissipation Flow MatchingBlur-based generative modelsMulti-scale priorsFlow matchingImage generationInverse heat dissipationx-prediction

0 comments

The pith

Heat Dissipation Flow Matching integrates blur-based multi-scale priors into flow matching by aligning interpolated paths and using x-prediction.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces Heat Dissipation Flow Matching to bring blur-based corruption into the flow matching framework, which normally relies on noise. Blur preserves color budgets and multi-scale details that noise-based methods lose, but it creates an ill-posed inverse process and hard high-dimensional regression under the data-manifold assumption. HDFM fixes the first problem by aligning an interpolated heat-dissipation path and the second by switching to x-prediction. Toy experiments and ablations show consistent gains from both changes, and the resulting models outperform most baselines across all tested datasets.

Core claim

Heat Dissipation Flow Matching introduces a continuous blurred process into Flow Matching to inject multi-scale priors, aligns an interpolated heat-dissipation path to address ill-posedness of the classical inverse heat-dissipation process, and adopts x-prediction to mitigate high-dimensional regression difficulty, yielding consistent benefits from both blur and x-prediction while outperforming most baseline methods on all datasets.

What carries the argument

The interpolated heat-dissipation path aligned inside the flow matching ODE, paired with x-prediction, which together inject multi-scale priors while resolving ill-posedness and regression hardness.

If this is right

Generative models can combine blur-derived multi-scale priors with ODE-based flow matching without staying inside SDE frameworks.
Ablation results indicate that both the blur corruption and the x-prediction choice contribute measurable gains on image tasks.
The method produces higher-quality outputs than most existing baselines across the evaluated datasets.
The approach keeps the training and sampling benefits of flow matching while adding multi-scale information.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar path-alignment tricks might adapt other corruption schedules, such as masking or downsampling, into flow matching.
If the data-manifold assumption weakens on non-natural-image domains, the regression advantage of x-prediction could shrink or disappear.
Hybrid models that alternate between noise and blur paths within one flow-matching run could test whether the two corruption types are complementary.
Scaling the method to video or 3D data would show whether the multi-scale priors transfer beyond static images.

Load-bearing premise

The data-manifold assumption holds and aligning an interpolated heat-dissipation path plus adopting x-prediction sufficiently resolves the ill-posedness of the classical inverse heat-dissipation process and the high-dimensional regression difficulty.

What would settle it

An experiment showing that HDFM performance drops to match or fall below non-blur flow matching baselines when the interpolated path alignment is removed on the same datasets would falsify the claim that the alignment plus x-prediction resolves the core difficulties.

Figures

Figures reproduced from arXiv: 2605.19371 by Hanquan Zhang, Haoyuan Guan, Jun Ma, Ke Zhang, Yanjun Qin.

**Figure 1.** Figure 1: Manifold Assumption View of Heat Dissipation Process: D(·) denotes the gap of dimensionality between the velocity and the data manifold. As forward heat dissipation proceeds, the data-manifold dimension further contracts, while noise and velocity still spread over the full high-dimensional space. Thus, in the heatdissipation setting, blur image prediction is different from noise or velocity prediction. B… view at source ↗

**Figure 2.** Figure 2: The Architecture of HDFM. The learned model can be viewed as a stack of ViT [Dosovitskiy et al., 2021] blocks, augmented with a lightweight LayerSync regularization constraint. 2.3 Heat-dissipation Diffusion Model IHDM [Rissanen et al., 2023] defines a multi-scale degradation process via the heat equation with Neumann boundary conditions: ∂ ∂tu(x, y, t) = ∆u(x, y, t). (4) As t → ∞, the image is progressi… view at source ↗

**Figure 3.** Figure 3: Toy Experiment in Heat Dissipation: a 2D dataset is “buried” into a D-dimensional space via a fixed orthogonal projection matrix. After applying 1D heat-dissipation blurring, a lightweight neural network is trained to reconstruct the data during inverse heat dissipation, and we visualize the resulting. 4 Experiment 4.1 Setup The compared methods fall into two categories. For heatdissipation-related appro… view at source ↗

**Figure 4.** Figure 4: HDFM transport trajectories in a toy experiment. Left: trajectories of particles in the data space, exhibiting curved paths. Right: trajectories of the same particles in the DCT (frequency) domain, which become approximately straight lines, consistent with the linear velocity property in the frequency domain (the proof see Linear velocity in the frequency domain). Time t Frequency Ratio 15 10 5 0 0.0 0.2 … view at source ↗

**Figure 5.** Figure 5: Spectral discrepancy during sampling. Left: Frequency ratio curves for JiT (full noise), HDFM, and HDFM-blur (pure blurring), where HDFM exhibits an intermediate behavior between the two extremes. Right: Attenuating the blur strength in HDFM by a factor r shifts the curve toward JiT, validating a smooth interpolation from blur-dominant to noise-dominant sampling. These suggest that HDFM can get a hybrid a… view at source ↗

**Figure 6.** Figure 6: Qualitative Results. Some examples are selected on ImageNet256. 4.4 Comparisons As shown in the [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗

read the original abstract

Diffusion models are widely used in image generation, with most relying on noise-based corruption and denoising. A distinct branch instead uses blur as the main corruption, preserving better color budgets and multi-scale detail by providing multi-scale priors. However, blur-based models remain in SDE-based frameworks and are not integrated into ODE-based frameworks, such as Flow Matching (FM). Meanwhile, in the blur-based formulation, the classical inverse heat-dissipation (IHD) process faces an ill-posed challenge. Moreover, under the data-manifold assumption, regressing blurred images from high-dimensional noise (or velocity) space is also difficult. We propose Heat Dissipation Flow Matching (HDFM), which introduces a continuous blurred (heat-dissipation) process into FM to inject multi-scale priors. HDFM aligns an interpolated heat-dissipation path to address ill-posedness and adopts $x$-prediction to mitigate high-dimensional regression difficulty. Toy experiments and ablation studies show that HDFM consistently benefits from both blur and $x$-prediction. The performance of HDFM outperforms most baseline methods on all datasets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

HDFM puts a continuous blur process into flow matching with an interpolated path and x-prediction, but the fix for ill-posedness is described without a derivation that shows stability or uniqueness.

read the letter

The paper's main move is to take the blur-based corruption that keeps multi-scale detail and color better than pure noise, and embed a continuous version of it inside an ODE flow-matching framework. They add an interpolated heat-dissipation path to handle the classical ill-posedness of inverse heat dissipation and switch to x-prediction to ease the high-dimensional regression under the manifold assumption. Toy ablations apparently show gains from both pieces, and the method beats most baselines on the datasets they tried. That combination is new enough to notice; prior blur work stayed in SDE territory and flow matching stayed noise-based, so bridging them is a reasonable next step. The construction itself is laid out clearly enough in the abstract and presumably the text. The soft spot is exactly where the stress-test flags it: the interpolated path is presented as the remedy, but there is no visible derivation showing that the resulting velocity field is Lipschitz or that solutions are unique from blurred observations. If the manifold assumption is only approximate, as it usually is for images, the regression target can still be under-determined and the claimed multi-scale benefit may not hold beyond the toy cases. No error bars or full dataset details appear in the summary, which makes the performance claims hard to weigh. This is the kind of paper that would interest people already working on ODE flows or multi-scale generative priors. A reader looking for concrete alternatives to noise diffusion might pick up usable ideas. It is coherent on its own terms and shows honest engagement with the known problems in blur models, so it deserves a serious referee to check the missing derivation and run fuller experiments.

Referee Report

3 major / 2 minor

Summary. The paper proposes Heat Dissipation Flow Matching (HDFM) to integrate a continuous blur-based heat-dissipation process into Flow Matching for image generation, injecting multi-scale priors. It claims to resolve the ill-posedness of classical inverse heat-dissipation via an aligned interpolated path and to mitigate high-dimensional regression via x-prediction under the data-manifold assumption. Toy experiments and ablations indicate consistent benefits from blur and x-prediction, with HDFM outperforming most baselines across datasets.

Significance. If the central claims hold with rigorous justification, HDFM would usefully bridge blur-based multi-scale modeling with ODE-based Flow Matching, potentially improving detail and color preservation over noise-only approaches. The explicit use of heat dissipation for priors and the x-prediction choice are concrete contributions, though significance is limited by the current lack of derivation for well-posedness and by reliance on toy-scale validation.

major comments (3)

[§3] §3 (Method): The interpolated heat-dissipation path alignment is described as addressing IHD ill-posedness, but no derivation is given showing that the resulting velocity field is Lipschitz continuous or that the ODE admits unique stable solutions from blurred observations. This leaves the resolution of ill-posedness as a heuristic rather than a proven property.
[§4] §4 (Experiments): Performance claims that HDFM 'outperforms most baseline methods on all datasets' are presented without error bars, statistical tests, or full dataset specifications, so it is impossible to verify whether the gains are robust or merely consistent with the toy ablations.
[§2–3] §2–3: The data-manifold assumption is invoked to justify both the regression difficulty and the benefit of the multi-scale prior, yet no analysis or sensitivity test is supplied for the case when the assumption holds only approximately (standard for natural images), leaving open whether the claimed resolution of high-dimensional regression actually materializes.

minor comments (2)

[§3] Notation for the heat-dissipation process and the interpolation parameter could be introduced with an explicit equation early in §3 to improve readability.
[§4] Figure captions for the toy experiments should explicitly state the metrics and number of runs used to generate the reported curves.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We address each major comment below and describe the revisions we will incorporate to improve clarity and rigor.

read point-by-point responses

Referee: [§3] §3 (Method): The interpolated heat-dissipation path alignment is described as addressing IHD ill-posedness, but no derivation is given showing that the resulting velocity field is Lipschitz continuous or that the ODE admits unique stable solutions from blurred observations. This leaves the resolution of ill-posedness as a heuristic rather than a proven property.

Authors: We acknowledge that the current manuscript presents the path alignment primarily through motivation and empirical validation rather than a complete derivation of Lipschitz continuity or ODE uniqueness. The alignment is designed to keep trajectories near the data manifold, which in practice yields stable integration as shown in the toy experiments. In the revision we will expand §3 with a discussion of the well-posedness conditions under the data-manifold assumption and add a brief sketch of why the velocity field remains controlled, while noting that a fully rigorous proof is left for future work. revision: partial
Referee: [§4] §4 (Experiments): Performance claims that HDFM 'outperforms most baseline methods on all datasets' are presented without error bars, statistical tests, or full dataset specifications, so it is impossible to verify whether the gains are robust or merely consistent with the toy ablations.

Authors: We agree that the experimental section would be strengthened by greater statistical transparency. In the revised manuscript we will report means and standard deviations over multiple random seeds, provide complete dataset specifications (including exact sizes, splits, and preprocessing), and include paired statistical tests for the main comparisons. These additions will allow readers to assess the robustness of the reported improvements beyond the toy ablations. revision: yes
Referee: [§2–3] §2–3: The data-manifold assumption is invoked to justify both the regression difficulty and the benefit of the multi-scale prior, yet no analysis or sensitivity test is supplied for the case when the assumption holds only approximately (standard for natural images), leaving open whether the claimed resolution of high-dimensional regression actually materializes.

Authors: The data-manifold assumption underpins our choice of x-prediction and the multi-scale prior. While the current version does not contain an explicit sensitivity study for approximate manifolds, the image-dataset results already reflect performance under the approximate-manifold regime typical of natural images. We will add a short discussion in §2–3 that clarifies this point and, space permitting, include an additional ablation that perturbs the manifold assumption on the toy data to illustrate robustness. revision: partial

Circularity Check

0 steps flagged

No circularity in derivation; construction presented as addressing external ill-posedness

full rationale

The manuscript introduces HDFM by constructing an interpolated heat-dissipation path within Flow Matching and adopting x-prediction. No equations or self-referential definitions appear that reduce the claimed resolution of IHD ill-posedness or the multi-scale prior benefit to a fitted parameter or prior result by construction. The central steps are described as a heuristic alignment plus empirical validation via toy experiments and ablations on external datasets, without load-bearing self-citations or uniqueness theorems imported from the same authors. The derivation chain therefore remains independent of its own outputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the data-manifold assumption and the premise that the proposed alignment and prediction choices resolve the stated ill-posedness and regression difficulties; no free parameters or invented entities are mentioned in the abstract.

axioms (1)

domain assumption Data-manifold assumption
Explicitly invoked as the setting under which regressing blurred images from high-dimensional noise space is difficult.

pith-pipeline@v0.9.0 · 5728 in / 1258 out tokens · 39936 ms · 2026-05-20T06:20:30.672001+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

HDFM aligns an interpolated heat-dissipation path to address ill-posedness and adopts x-prediction to mitigate high-dimensional regression difficulty.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

41 extracted references · 41 canonical work pages · 2 internal anchors

[1]

Nature , volume=

De novo design of protein structure and function with RFdiffusion , author=. Nature , volume=. 2023 , publisher=

work page 2023
[2]

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Gr00t n1: An open foundation model for generalist humanoid robots , author=. arXiv preprint arXiv:2503.14734 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[3]

ICML , year=

Deep unsupervised learning using nonequilibrium thermodynamics , author=. ICML , year=

work page
[4]

ICLR , year=

Flow Matching for Generative Modeling , author=. ICLR , year=

work page
[5]

ICLR , year=

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow , author=. ICLR , year=

work page
[6]

ICLR , year=

Building Normalizing Flows with Stochastic Interpolants , author=. ICLR , year=

work page
[7]

Science , volume=

Scalable emulation of protein equilibrium ensembles with generative deep learning , author=. Science , volume=. 2025 , publisher=

work page 2025
[8]

IEEE Transactions on Pattern Analysis and Machine Intelligence , year=

Diffusion models in low-level vision: A survey , author=. IEEE Transactions on Pattern Analysis and Machine Intelligence , year=

work page
[9]

ICLR , year=

Generative Modelling with Inverse Heat Dissipation , author=. ICLR , year=

work page
[10]

ICLR , year=

Blurring Diffusion Models , author=. ICLR , year=

work page
[11]

NeurIPS , year=

Cold diffusion: Inverting arbitrary image transforms without noise , author=. NeurIPS , year=

work page
[12]

NeurIPS , year=

Denoising diffusion probabilistic models , author=. NeurIPS , year=

work page
[13]

ICLR , year=

Score-Based Generative Modeling through Stochastic Differential Equations , author=. ICLR , year=

work page
[14]

MIT Press , year=

SemiSupervised Learning , author=. MIT Press , year=

work page
[15]

Back to Basics: Let Denoising Generative Models Denoise

Back to Basics: Let Denoising Generative Models Denoise , author=. arXiv preprint arXiv:2511.13720 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[16]

ICCV , year=

Beyond Blur: A Fluid Perspective on Generative Diffusion Models , author=. ICCV , year=

work page
[17]

science , volume=

Nonlinear dimensionality reduction by locally linear embedding , author=. science , volume=. 2000 , publisher=

work page 2000
[18]

science , volume=

A global geometric framework for nonlinear dimensionality reduction , author=. science , volume=. 2000 , publisher=

work page 2000
[19]

Transactions on Machine Learning Research , year=

Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections , author=. Transactions on Machine Learning Research , year=

work page
[20]

ICLR , year=

What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models , author=. ICLR , year=

work page
[21]

ICLR , year=

Deconstructing Denoising Diffusion Models for Self-Supervised Learning , author=. ICLR , year=

work page
[22]

, author=

Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion. , author=. CVPR , year=

work page
[23]

ICLR , year=

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , author=. ICLR , year=

work page
[24]

NeurIPS , volume=

Neural ordinary differential equations , author=. NeurIPS , volume=

work page
[25]

ICLR , year=

f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation , author=. ICLR , year=

work page
[26]

arXiv , year=

On the spectral bias of neural networks: International Conference on Machine Learning , author=. arXiv , year=

work page
[27]

Communications in Computational Physics , volume=

Frequency principle: Fourier analysis sheds light on deep neural networks , author=. Communications in Computational Physics , volume=. 2020 , publisher=

work page 2020
[28]

arXiv preprint arXiv:2510.12581 , year=

LayerSync: Self-aligning Intermediate Layers , author=. arXiv preprint arXiv:2510.12581 , year=

work page arXiv
[29]

ImageNet Large Scale Visual Recognition Challenge,

Olga Russakovsky and Jia Deng and Hao Su and Jonathan Krause and Sanjeev Satheesh and Sean Ma and Zhiheng Huang and Andrej Karpathy and Aditya Khosla and Michael Bernstein and Alexander C. Berg and Li Fei-Fei , Title =. 2015 , journal =. doi:10.1007/s11263-015-0816-y , volume=

work page doi:10.1007/s11263-015-0816-y 2015
[30]

IEEE Transactions on Pattern Analysis and Machine Intelligence , year=

Places: A 10 million Image Database for Scene Recognition , author=. IEEE Transactions on Pattern Analysis and Machine Intelligence , year=

work page
[31]

Journal of the Optical Society of America A , volume=

Relations between the statistics of natural images and the response properties of cortical cells , author=. Journal of the Optical Society of America A , volume=. 1987 , publisher=

work page 1987
[32]

NeurIPS , year=

Diffusion models beat gans on image synthesis , author=. NeurIPS , year=

work page
[33]

NeurIPS , year=

Classifier-Free Diffusion Guidance , author=. NeurIPS , year=

work page
[34]

Structure and Interpretation of Computer Programs

Harold Abelson and Gerald Jay Sussman and Julie Sussman. Structure and Interpretation of Computer Programs. 1985

work page 1985
[35]

Visual Information Extraction with Lixto

Robert Baumgartner and Georg Gottlob and Sergio Flesca. Visual Information Extraction with Lixto. Proceedings of the 27th International Conference on Very Large Databases. 2001

work page 2001
[36]

Brachman and James G

Ronald J. Brachman and James G. Schmolze. An overview of the KL-ONE knowledge representation system. Cognitive Science. 1985

work page 1985
[37]

Complexity results for nonmonotonic logics

Georg Gottlob. Complexity results for nonmonotonic logics. Journal of Logic and Computation. 1992

work page 1992
[38]

Hypertree Decompositions and Tractable Queries

Georg Gottlob and Nicola Leone and Francesco Scarcello. Hypertree Decompositions and Tractable Queries. Journal of Computer and System Sciences. 2002

work page 2002
[39]

Levesque

Hector J. Levesque. Foundations of a functional approach to knowledge representation. Artificial Intelligence. 1984

work page 1984
[40]

Levesque

Hector J. Levesque. A logic of implicit and explicit belief. Proceedings of the Fourth National Conference on Artificial Intelligence. 1984

work page 1984
[41]

On the compilability and expressive power of propositional planning formalisms

Bernhard Nebel. On the compilability and expressive power of propositional planning formalisms. Journal of Artificial Intelligence Research. 2000

work page 2000

[1] [1]

Nature , volume=

De novo design of protein structure and function with RFdiffusion , author=. Nature , volume=. 2023 , publisher=

work page 2023

[2] [2]

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Gr00t n1: An open foundation model for generalist humanoid robots , author=. arXiv preprint arXiv:2503.14734 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[3] [3]

ICML , year=

Deep unsupervised learning using nonequilibrium thermodynamics , author=. ICML , year=

work page

[4] [4]

ICLR , year=

Flow Matching for Generative Modeling , author=. ICLR , year=

work page

[5] [5]

ICLR , year=

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow , author=. ICLR , year=

work page

[6] [6]

ICLR , year=

Building Normalizing Flows with Stochastic Interpolants , author=. ICLR , year=

work page

[7] [7]

Science , volume=

Scalable emulation of protein equilibrium ensembles with generative deep learning , author=. Science , volume=. 2025 , publisher=

work page 2025

[8] [8]

IEEE Transactions on Pattern Analysis and Machine Intelligence , year=

Diffusion models in low-level vision: A survey , author=. IEEE Transactions on Pattern Analysis and Machine Intelligence , year=

work page

[9] [9]

ICLR , year=

Generative Modelling with Inverse Heat Dissipation , author=. ICLR , year=

work page

[10] [10]

ICLR , year=

Blurring Diffusion Models , author=. ICLR , year=

work page

[11] [11]

NeurIPS , year=

Cold diffusion: Inverting arbitrary image transforms without noise , author=. NeurIPS , year=

work page

[12] [12]

NeurIPS , year=

Denoising diffusion probabilistic models , author=. NeurIPS , year=

work page

[13] [13]

ICLR , year=

Score-Based Generative Modeling through Stochastic Differential Equations , author=. ICLR , year=

work page

[14] [14]

MIT Press , year=

SemiSupervised Learning , author=. MIT Press , year=

work page

[15] [15]

Back to Basics: Let Denoising Generative Models Denoise

Back to Basics: Let Denoising Generative Models Denoise , author=. arXiv preprint arXiv:2511.13720 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[16] [16]

ICCV , year=

Beyond Blur: A Fluid Perspective on Generative Diffusion Models , author=. ICCV , year=

work page

[17] [17]

science , volume=

Nonlinear dimensionality reduction by locally linear embedding , author=. science , volume=. 2000 , publisher=

work page 2000

[18] [18]

science , volume=

A global geometric framework for nonlinear dimensionality reduction , author=. science , volume=. 2000 , publisher=

work page 2000

[19] [19]

Transactions on Machine Learning Research , year=

Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections , author=. Transactions on Machine Learning Research , year=

work page

[20] [20]

ICLR , year=

What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models , author=. ICLR , year=

work page

[21] [21]

ICLR , year=

Deconstructing Denoising Diffusion Models for Self-Supervised Learning , author=. ICLR , year=

work page

[22] [22]

, author=

Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion. , author=. CVPR , year=

work page

[23] [23]

ICLR , year=

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , author=. ICLR , year=

work page

[24] [24]

NeurIPS , volume=

Neural ordinary differential equations , author=. NeurIPS , volume=

work page

[25] [25]

ICLR , year=

f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation , author=. ICLR , year=

work page

[26] [26]

arXiv , year=

On the spectral bias of neural networks: International Conference on Machine Learning , author=. arXiv , year=

work page

[27] [27]

Communications in Computational Physics , volume=

Frequency principle: Fourier analysis sheds light on deep neural networks , author=. Communications in Computational Physics , volume=. 2020 , publisher=

work page 2020

[28] [28]

arXiv preprint arXiv:2510.12581 , year=

LayerSync: Self-aligning Intermediate Layers , author=. arXiv preprint arXiv:2510.12581 , year=

work page arXiv

[29] [29]

ImageNet Large Scale Visual Recognition Challenge,

Olga Russakovsky and Jia Deng and Hao Su and Jonathan Krause and Sanjeev Satheesh and Sean Ma and Zhiheng Huang and Andrej Karpathy and Aditya Khosla and Michael Bernstein and Alexander C. Berg and Li Fei-Fei , Title =. 2015 , journal =. doi:10.1007/s11263-015-0816-y , volume=

work page doi:10.1007/s11263-015-0816-y 2015

[30] [30]

IEEE Transactions on Pattern Analysis and Machine Intelligence , year=

Places: A 10 million Image Database for Scene Recognition , author=. IEEE Transactions on Pattern Analysis and Machine Intelligence , year=

work page

[31] [31]

Journal of the Optical Society of America A , volume=

Relations between the statistics of natural images and the response properties of cortical cells , author=. Journal of the Optical Society of America A , volume=. 1987 , publisher=

work page 1987

[32] [32]

NeurIPS , year=

Diffusion models beat gans on image synthesis , author=. NeurIPS , year=

work page

[33] [33]

NeurIPS , year=

Classifier-Free Diffusion Guidance , author=. NeurIPS , year=

work page

[34] [34]

Structure and Interpretation of Computer Programs

Harold Abelson and Gerald Jay Sussman and Julie Sussman. Structure and Interpretation of Computer Programs. 1985

work page 1985

[35] [35]

Visual Information Extraction with Lixto

Robert Baumgartner and Georg Gottlob and Sergio Flesca. Visual Information Extraction with Lixto. Proceedings of the 27th International Conference on Very Large Databases. 2001

work page 2001

[36] [36]

Brachman and James G

Ronald J. Brachman and James G. Schmolze. An overview of the KL-ONE knowledge representation system. Cognitive Science. 1985

work page 1985

[37] [37]

Complexity results for nonmonotonic logics

Georg Gottlob. Complexity results for nonmonotonic logics. Journal of Logic and Computation. 1992

work page 1992

[38] [38]

Hypertree Decompositions and Tractable Queries

Georg Gottlob and Nicola Leone and Francesco Scarcello. Hypertree Decompositions and Tractable Queries. Journal of Computer and System Sciences. 2002

work page 2002

[39] [39]

Levesque

Hector J. Levesque. Foundations of a functional approach to knowledge representation. Artificial Intelligence. 1984

work page 1984

[40] [40]

Levesque

Hector J. Levesque. A logic of implicit and explicit belief. Proceedings of the Fourth National Conference on Artificial Intelligence. 1984

work page 1984

[41] [41]

On the compilability and expressive power of propositional planning formalisms

Bernhard Nebel. On the compilability and expressive power of propositional planning formalisms. Journal of Artificial Intelligence Research. 2000

work page 2000