Coloring the Noise: Adversarial Sobolev Alignment for Faithful Image Super Resolution

Chao Zhou; Hongbo Wang; Huaibo Huang; Jinhua Hao; Pin Wang; Ran He

arxiv: 2605.23264 · v1 · pith:U2FVKLGTnew · submitted 2026-05-22 · 💻 cs.CV · cs.AI

Coloring the Noise: Adversarial Sobolev Alignment for Faithful Image Super Resolution

Hongbo Wang , Huaibo Huang , Pin Wang , Jinhua Hao , Chao Zhou , Ran He This is my paper

Pith reviewed 2026-05-25 04:38 UTC · model grok-4.3

classification 💻 cs.CV cs.AI

keywords image super-resolutiongenerative modelssobolev geometryspectral alignmentadversarial trainingnoise kernelriesz theoremartifact reduction

0 comments

The pith

Recasting generative super-resolution into Sobolev Riemannian geometry by coloring noise to match spectral decay yields more faithful restorations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Generative image super-resolution often introduces artifacts because isotropic Gaussian noise does not match the spectral decay of natural images. The paper claims this misalignment can be fixed by coloring the noise kernel to follow natural spectra inside a Sobolev-induced geometry. An adversary based on the Riesz theorem then provides worst-case gradients to guide the flow toward realistic structures. This setup is said to outperform standard generative methods in spectral consistency and fidelity. If true, it would allow more trustworthy high-resolution image generation without relying on post-hoc fixes.

Core claim

The paper establishes that driving the generative flow in a Sobolev-induced Riemannian geometry, with the noise transition kernel colored to mirror natural spectral decay, and using a Riesz Representation Theorem-based parametric adversary to synthesize worst-case Sobolev gradients, aligns the process to the tangent space of the natural image manifold, resulting in improved spectral consistency and structural fidelity over baselines.

What carries the argument

Colored noise transition kernel in Sobolev-induced Riemannian geometry with Riesz-based parametric adversary for worst-case gradient synthesis.

If this is right

ASASR outperforms generative baselines in spectral consistency.
It better preserves structural fidelity in super-resolved images.
The method mitigates artifacts through geometric alignment.
Optimization is directed along plausible structural failure tangents.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The colored noise approach may extend to other frequency-sensitive generation tasks like video upscaling.
Combining this with direct preference optimization could further refine alignment.
Empirical tests on out-of-distribution images would check if the manifold alignment generalizes.

Load-bearing premise

Recasting the generative flow into Sobolev-induced Riemannian geometry by coloring the noise will bridge the spectral misalignment with the natural image manifold.

What would settle it

A side-by-side comparison on standard benchmarks where ASASR shows no gain in spectral metrics or introduces more artifacts than baselines would disprove the effectiveness of the alignment.

Figures

Figures reproduced from arXiv: 2605.23264 by Chao Zhou, Hongbo Wang, Huaibo Huang, Jinhua Hao, Pin Wang, Ran He.

**Figure 1.** Figure 1: Visual comparison with state-of-the-art SR methods. The proposed ASASR achieves superior perceptual quality, generating more realistic textures and faithful structural details from the low-quality input. shapes the optimization objective, mathematically evolving the implicit distance metric into the Sobolev norm Hs . By traversing the solution space within this Sobolev-induced Riemannian geometry, the mode… view at source ↗

**Figure 2.** Figure 2: Conceptual illustration of spectral misalignment and our proposed ASASR. (a) Standard SR frameworks always assume an isotropic Euclidean space, a simplification that neglects the intrinsic spectral nature of real-world data. This geometric mismatch projects the generated candidate xhq onto a manifold disjoint from the ground truth xgt, resulting in the significant spectral discrepancy (hatched region). (b)… view at source ↗

**Figure 3.** Figure 3: Power Spectral Density analysis relative to the Natural Dataset. The ℓ 2 baseline exhibits noticeable decay in high frequencies, illustrating the spectral bias inherent to Euclidean constraint. In contrast, our Sobolev constraint closely aligns with the empirical distribution, effectively preserving fine-grained structural fidelity. final objective, the S-DPO: LS-DPO(θ) = −E(c,xw 1 ,xl 1 )∼D,t∼U(0,T) h lo… view at source ↗

**Figure 4.** Figure 4: Visualization of targeted negatives synthesized. These samples constitute realistic structural artifacts, such as text deformations and architectural distortions, serving as hard negatives. derive: xb a 1 = x w t + (1 − t) · vϕ(x w t , t, c), (16) Critically, to enforce semantic alignment, we re-project this degraded estimate back to the flow state x a t using the identical noise realization x0 from the … view at source ↗

**Figure 5.** Figure 5: Qualitative comparisons on both synthetic (the first two rows) and real-world (the last two rows) benchmarks. formed YCbCr space) as reference-based distortion metrics, LPIPS (Zhang et al., 2018) and DISTS (Ding et al., 2020) as reference-based perceptual metrics, MANIQA (Yang et al., 2022), MUSIQ (Ke et al., 2021) and CLIPIQA (Wang et al., 2022) as no-reference metrics. Baselines. We evaluate our proposed… view at source ↗

**Figure 6.** Figure 6: Visual and Spectral Fidelity. Top: Super-resolution results. Bottom: FFT spectra with GT-difference insets annotated with LSD scores. ASASR achieves the lowest LSD (27.35), quantitatively confirming its superior alignment with the ground truth spectral distribution, evidenced by minimal residuals compared to baselines [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗

**Figure 7.** Figure 7: Impact analysis on the Sobolev index s. We select s = 1.5 (marked by star) as the optimal trade-off between structural fidelity and texture realism. ometry that regularizes its optimization trajectory. As a result, applying AMG without SSR may introduce aggressive high-frequency details that benefit perceptual realism less consistently and can impair distortion-oriented metrics, while their combination en… view at source ↗

**Figure 8.** Figure 8: Visualization of user study results. (a) Top-1 selection ratio, where ASASR secures a dominant 91.1% of user votes. (b) Top-K cumulative rankings, demonstrating that our method is consistently favored as the highest-quality restoration among competing baselines across all K levels. 18 [PITH_FULL_IMAGE:figures/full_fig_p018_8.png] view at source ↗

**Figure 9.** Figure 9: Visual comparisons on synthesis datasets. We compare ASASR against state-of-the-art GAN-based and diffusion-based methods. As observed, our method achieves superior structural fidelity, effectively reconstructing complex architectural geometries (1st row) and legible text (3rd row) while maintaining natural textures (2nd row), avoiding the structural distortions and hallucinations common in competing gener… view at source ↗

**Figure 10.** Figure 10: Visual comparisons on real-world datasets. ASASR demonstrates robust generalization capabilities on challenging real-world scenes. Unlike baselines that often produce over-smoothed textures (e.g., SwinIR) or hallucinated artifacts (e.g., StableSR), our method successfully restores intricate high-frequency details, such as feather textures (1st & 3rd rows) and distant architectural features (2nd row), stri… view at source ↗

**Figure 11.** Figure 11: Visualization of OCR results on the RoadText1K dataset. To evaluate semantic preservation, we apply a pre-trained text detector on images restored by different methods. ASASR reconstructs clearer, sharper text characters compared to other generative models, enabling more accurate text detection (red bounding boxes) that closely aligns with the Ground Truth, whereas competing methods often lead to missed d… view at source ↗

**Figure 12.** Figure 12: Visualization of object detection and instance segmentation on the COCO dataset. We visualize the detection bounding boxes and segmentation masks predicted on restored images. ASASR preserves the structural integrity of objects, such as human limbs (1st & 2nd rows) and animal boundaries (3rd row), resulting in more precise segmentation masks and higher confidence scores compared to baselines that suffer f… view at source ↗

**Figure 13.** Figure 13: Visualization of semantic segmentation on the ADE20K dataset. The results illustrate the impact of restoration quality on scene parsing. ASASR effectively recovers distinct object boundaries and consistent semantic regions (e.g., the car in the 2nd row and furniture in the 3rd row), leading to cleaner segmentation maps with fewer artifacts compared to other diffusion-based counterparts, which often introd… view at source ↗

read the original abstract

Generative priors in Image Super-Resolution (SR) often compromise faithful restoration, we attribute this limitation to a fundamental spectral misalignment between isotropic objectives and the intrinsic natural image manifold. While Direct Preference Optimization offers a path to alignment, its reliance on spectrally flat Gaussian noise fails to distinguish authentic high-frequency details from hallucinations. To bridge this geometric gap, we propose ASASR, a theoretically grounded framework that recasts the generative flow into a Sobolev-induced Riemannian geometry by explicitly coloring the noise transition kernel to mirror natural spectral decay. Driving this geometric alignment, we integrate a parametric adversary grounded in the Riesz Representation Theorem, which synthesizes targeted negative samples equivalent to worst-case Sobolev gradients to direct optimization along the tangent space of plausible structural failures. Extensive evaluations demonstrate that ASASR outperforms leading generative baselines, particularly in preserving spectral consistency and structural fidelity, offering a robust solution that effectively mitigates artifacts.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The abstract frames a geometric fix for spectral misalignment in generative SR via colored noise kernels and a Riesz adversary in Sobolev space, but provides no equations or results to check whether the approach actually delivers.

read the letter

The main contribution seems to be recasting the generative flow as a Sobolev-induced Riemannian geometry by coloring the noise transition kernel to match natural spectral decay, then using a parametric adversary based on the Riesz Representation Theorem to synthesize worst-case negative samples. This targets the gap between flat Gaussian noise and the structured high-frequency content in real images, which the authors link to hallucinations in current methods like DPO-based alignment. The framing of the problem as a mismatch between isotropic objectives and the natural image manifold is clear and directly addresses a practical issue in faithful super-resolution. The integration of colored kernels with the adversary for tangent-space optimization looks like the specific new piece they are putting forward. It is presented as theoretically grounded, which is a reasonable direction if the derivations hold. The abstract also claims better spectral consistency and structural fidelity than leading generative baselines, which would matter for downstream vision tasks where artifacts are costly. The soft spot is that none of this can be checked from the given material. There are no equations showing how the coloring is implemented, how the adversary is parameterized, or how the optimization avoids circularity in the alignment step. The central assumption that mirroring spectral decay will bridge the geometric gap is stated but not demonstrated, and the outperformance claim sits without tables or controls. This leaves the soundness hard to assess. The work is aimed at researchers in generative computer vision who already work on diffusion or adversarial SR and want to explore differential geometry tools for alignment. A reader looking for new ways to think about noise modeling in restoration could extract value from the ideas even before the details are verified. It deserves peer review because the problem is real and the proposed direction is distinct enough to warrant checking the math and experiments in full.

Referee Report

0 major / 2 minor

Summary. The manuscript proposes ASASR, a framework for image super-resolution that addresses spectral misalignment between isotropic generative objectives and the natural image manifold. It recasts the generative flow into a Sobolev-induced Riemannian geometry by coloring the noise transition kernel to match natural spectral decay and integrates a parametric adversary based on the Riesz Representation Theorem to generate worst-case negative samples for directing optimization. The central empirical claim is that ASASR outperforms leading generative baselines in spectral consistency and structural fidelity while mitigating artifacts.

Significance. If the theoretical recasting and empirical outperformance hold, the work could offer a principled geometric approach to alignment in generative SR, potentially improving faithfulness over standard diffusion or GAN-based methods that rely on spectrally flat noise.

minor comments (2)

The abstract is dense with specialized terminology (Sobolev-induced Riemannian geometry, Riesz-based adversary); expanding the introduction with a brief intuitive overview of the noise-coloring step would improve accessibility.
No equations, pseudocode, or high-level algorithm box appear in the provided abstract; including these in §3 or §4 would clarify how the colored kernel is constructed and how the adversary is parameterized.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their summary of our work and for acknowledging the potential of recasting generative super-resolution into Sobolev Riemannian geometry with spectrally colored noise and Riesz-based adversaries. We are happy to provide clarifications on any points that contributed to the 'uncertain' recommendation.

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The abstract presents the ASASR framework as recasting generative flow into Sobolev-induced Riemannian geometry via colored noise kernels and a Riesz-based adversary, with empirical claims of outperformance on spectral consistency. No equations, derivations, fitted parameters presented as predictions, or self-citations appear in the provided text. Without load-bearing steps that reduce by construction to inputs (such as self-definitional alignments or ansatzes smuggled via prior work), the derivation chain is self-contained against external benchmarks and cannot be shown to contain circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract-only; ledger is minimal. The core premise that natural images possess an intrinsic spectral decay that can be directly mirrored by a colored noise kernel is treated as a domain assumption without independent evidence supplied.

axioms (1)

domain assumption Natural images possess an intrinsic spectral decay that isotropic Gaussian noise fails to match, creating a geometric misalignment.
Invoked in the first sentence of the abstract as the root cause of hallucinations.

pith-pipeline@v0.9.0 · 5691 in / 1052 out tokens · 40634 ms · 2026-05-25T04:38:49.138716+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

Σ_s = F^{-1} diag((1 + ||ω||²_2)^{-s}) F ... recovers the Sobolev inner product ... lifting the optimization from flat Euclidean space to the weighted Sobolev manifold H^s(Ω)
IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean absolute_floor_iff_bare_distinguishability echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

Riesz Representation Theorem ... worst-case Sobolev gradients ... tangent space of plausible structural failures

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

39 extracted references · 39 canonical work pages

[1]

Podell, Dustin and English, Zion and Lacey, Kyle and Blattmann, Andreas and Dockhorn, Tim and M

work page
[2]

Azar, Mohammad Gheshlaghi and Rowland, Mark and Piot, Bilal and Guo, Daniel and Calandriello, Daniele and Valko, Michal and Munos, R. A

work page
[3]

2024 , howpublished=

Black Forest Labs , title=. 2024 , howpublished=

work page 2024
[4]

Effective

Cheng, Kun and Yu, Lei and Tu, Zhijun and He, Xiao and Chen, Liyu and Guo, Yong and Zhu, Mingrui and Wang, Nannan and Gao, Xinbo and Hu, Jie , year = 2024, number =. Effective

work page 2024
[5]

, year = 2020, journal =

Ding, Keyan and Ma, Kede and Wang, Shiqi and Simoncelli, Eero P. , year = 2020, journal =. Image

work page 2020
[6]

Learning a

Dong, Chao and Loy, Chen Change and He, Kaiming and Tang, Xiaoou , editor =. Learning a

work page
[7]

Esser, Patrick and Kulal, Sumith and Blattmann, Andreas and Entezari, Rahim and M. Scaling

work page
[8]

Relations between the Statistics of Natural Images and the Response Properties of Cortical Cells , author =. J. Opt. Soc. Am. A , volume =

work page
[9]

Alireza and Dadsetan, Saba and Kitani, Kris M

Golestaneh, S. Alireza and Dadsetan, Saba and Kitani, Kris M. , year = 2022, number =. No-

work page 2022
[10]

Goodfellow, Ian J. and. Generative

work page
[11]

Gu, Jinjin and Cai, Haoming and Chen, Haoyu and Ye, Xiaoxing and Ren, Jimmy and Dong, Chao , year = 2020, number =

work page 2020
[12]

He, Kaiming and Gkioxari, Georgia and Doll. Mask

work page
[13]

and Shen, Yelong and Wallis, Phillip and

Hu, Edward J. and Shen, Yelong and Wallis, Phillip and

work page
[14]

Jiang, Yuxuan and Zeng, Chengxi and Teng, Siyue and Zhang, Fan and Zhu, Xiaoqing and Sole, Joel and Bull, David , year = 2025, number =

work page 2025
[15]

Li, Xiaohui and Liu, Yihao and Cao, Shuo and Chen, Ziyan and Zhuang, Shaobin and Chen, Xiangyu and He, Yinan and Wang, Yi and Qiao, Yu , year = 2025, number =

work page 2025
[16]

Li, Xinrui and Wu, Jianlong and Huang, Xinchuan and Chen, Chong and Guan, Weili and Hua, Xian-Sheng and Nie, Liqiang , year = 2025, number =

work page 2025
[17]

Lin, Tsung-Yi and Doll. Feature

work page
[18]

Lawrence and Doll

Lin, Tsung-Yi and Maire, Michael and Belongie, Serge and Bourdev, Lubomir and Girshick, Ross and Hays, James and Perona, Pietro and Ramanan, Deva and Zitnick, C. Lawrence and Doll. Microsoft

work page
[19]

Lipman, Yaron and Chen, Ricky T. Q. and. Flow

work page
[20]

Li, Chenxia and Liu, Weiwei and Guo, Ruoyu and Yin, Xiaoting and Jiang, Kaitao and Du, Yongkun and Du, Yuning and Zhu, Lingfeng and Lai, Baohua and Hu, Xiaoguang and Yu, Dianhai and Ma, Yanjun , year = 2022, journal =

work page 2022
[21]

Li, Weiqi and Zhang, Xuanyu and Zhao, Shijie and Zhang, Yabin and Li, Junlin and Zhang, Li and Zhang, Jian , year = 2025, number =. Q-

work page 2025
[22]

Li, Zekun and Liu, Hongying and Shang, Fanhua and Liu, Yuanyuan and Wan, Liang and Feng, Wei , year = 2024, volume =

work page 2024
[23]

Lu, Yiting and Li, Xin and Pei, Yajing and Yuan, Kun and Xie, Qizhi and Qu, Yunpeng and Sun, Ming and Zhou, Chao and Chen, Zhibo , year = 2024, number =

work page 2024
[24]

Advances in Neural Information Processing Systems , author =

work page
[25]

High-Resolution Image Synthesis with Latent Diffusion Models , booktitle =

Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj. High-Resolution Image Synthesis with Latent Diffusion Models , booktitle =

work page
[26]

Photorealistic

Saharia, Chitwan and Chan, William and Saxena, Saurabh and Li, Lala and Whang, Jay and Denton, Emily and Ghasemipour, Seyed Kamyar Seyed and Ayan, Burcu Karagol and Mahdavi, S Sara and. Photorealistic

work page
[27]

Improving the

Sun, Lingchen and Wu, Rongyuan and Liang, Jie and Zhang, Zhengqiang and Yong, Hongwei and Zhang, Lei , year = 2024, number =. Improving the

work page 2024
[28]

Su, Jianlin and Lu, Yu and Pan, Shengfeng and Murtadha, Ahmed and Wen, Bo and Liu, Yunfeng , year = 2024, journal =

work page 2024
[29]

Tan, Zhenxiong and Xue, Qiaochu and Yang, Xingyi and Liu, Songhua and Wang, Xinchao , year = 2025, number =

work page 2025
[30]

Tan, Zhenxiong and Liu, Songhua and Yang, Xingyi and Xue, Qiaochu and Wang, Xinchao , year = 2025, number =

work page 2025
[31]

Diffusion

Wallace, Bram and Dang, Meihua and Rafailov, Rafael and Zhou, Linqi and Lou, Aaron and Purushwalkam, Senthil and Ermon, Stefano and Xiong, Caiming and Joty, Shafiq and Naik, Nikhil , year = 2023, number =. Diffusion

work page 2023
[32]

Enhanced

Wang, Yiwen and Liang, Ying and Zhang, Yuxuan and Chai, Xinning and Cheng, Zhengxue and Qin, Yingsheng and Yang, Yucai and Xie, Rong and Song, Li , year = 2025, number =. Enhanced

work page 2025
[33]

Proceedings of the 32nd

Large. Proceedings of the 32nd

work page
[34]

Perceive,

Wei, Hongyang and Liu, Shuaizheng and Yuan, Chun and Zhang, Lei , year = 2025, number =. Perceive,

work page 2025
[35]

Yang, Sidi and Wu, Tianhe and Shi, Shuwei and Lao, Shanshan and Gong, Yuan and Cao, Mingdeng and Wang, Jiahao and Yang, Yujiu , year = 2022, number =

work page 2022
[36]

Yang, Hao and Yang, Yan and Zhang, Ruikun and Pan, Liyuan , year = 2025, number =. A

work page 2025
[37]

Improved

Yin, Tianwei and Gharbi, Micha. Improved

work page
[38]

Teaching

You, Zhiyuan and Cai, Xin and Gu, Jinjin and Xue, Tianfan and Dong, Chao , year = 2025, number =. Teaching

work page 2025
[39]

Adaptive

Zhu, Hanwei and Wu, Haoning and Li, Yixuan and Zhang, Zicheng and Chen, Baoliang and Zhu, Lingyu and Fang, Yuming and Zhai, Guangtao and Lin, Weisi and Wang, Shiqi , year = 2024, number =. Adaptive

work page 2024

[1] [1]

Podell, Dustin and English, Zion and Lacey, Kyle and Blattmann, Andreas and Dockhorn, Tim and M

work page

[2] [2]

Azar, Mohammad Gheshlaghi and Rowland, Mark and Piot, Bilal and Guo, Daniel and Calandriello, Daniele and Valko, Michal and Munos, R. A

work page

[3] [3]

2024 , howpublished=

Black Forest Labs , title=. 2024 , howpublished=

work page 2024

[4] [4]

Effective

Cheng, Kun and Yu, Lei and Tu, Zhijun and He, Xiao and Chen, Liyu and Guo, Yong and Zhu, Mingrui and Wang, Nannan and Gao, Xinbo and Hu, Jie , year = 2024, number =. Effective

work page 2024

[5] [5]

, year = 2020, journal =

Ding, Keyan and Ma, Kede and Wang, Shiqi and Simoncelli, Eero P. , year = 2020, journal =. Image

work page 2020

[6] [6]

Learning a

Dong, Chao and Loy, Chen Change and He, Kaiming and Tang, Xiaoou , editor =. Learning a

work page

[7] [7]

Esser, Patrick and Kulal, Sumith and Blattmann, Andreas and Entezari, Rahim and M. Scaling

work page

[8] [8]

Relations between the Statistics of Natural Images and the Response Properties of Cortical Cells , author =. J. Opt. Soc. Am. A , volume =

work page

[9] [9]

Alireza and Dadsetan, Saba and Kitani, Kris M

Golestaneh, S. Alireza and Dadsetan, Saba and Kitani, Kris M. , year = 2022, number =. No-

work page 2022

[10] [10]

Goodfellow, Ian J. and. Generative

work page

[11] [11]

Gu, Jinjin and Cai, Haoming and Chen, Haoyu and Ye, Xiaoxing and Ren, Jimmy and Dong, Chao , year = 2020, number =

work page 2020

[12] [12]

He, Kaiming and Gkioxari, Georgia and Doll. Mask

work page

[13] [13]

and Shen, Yelong and Wallis, Phillip and

Hu, Edward J. and Shen, Yelong and Wallis, Phillip and

work page

[14] [14]

Jiang, Yuxuan and Zeng, Chengxi and Teng, Siyue and Zhang, Fan and Zhu, Xiaoqing and Sole, Joel and Bull, David , year = 2025, number =

work page 2025

[15] [15]

Li, Xiaohui and Liu, Yihao and Cao, Shuo and Chen, Ziyan and Zhuang, Shaobin and Chen, Xiangyu and He, Yinan and Wang, Yi and Qiao, Yu , year = 2025, number =

work page 2025

[16] [16]

Li, Xinrui and Wu, Jianlong and Huang, Xinchuan and Chen, Chong and Guan, Weili and Hua, Xian-Sheng and Nie, Liqiang , year = 2025, number =

work page 2025

[17] [17]

Lin, Tsung-Yi and Doll. Feature

work page

[18] [18]

Lawrence and Doll

Lin, Tsung-Yi and Maire, Michael and Belongie, Serge and Bourdev, Lubomir and Girshick, Ross and Hays, James and Perona, Pietro and Ramanan, Deva and Zitnick, C. Lawrence and Doll. Microsoft

work page

[19] [19]

Lipman, Yaron and Chen, Ricky T. Q. and. Flow

work page

[20] [20]

Li, Chenxia and Liu, Weiwei and Guo, Ruoyu and Yin, Xiaoting and Jiang, Kaitao and Du, Yongkun and Du, Yuning and Zhu, Lingfeng and Lai, Baohua and Hu, Xiaoguang and Yu, Dianhai and Ma, Yanjun , year = 2022, journal =

work page 2022

[21] [21]

Li, Weiqi and Zhang, Xuanyu and Zhao, Shijie and Zhang, Yabin and Li, Junlin and Zhang, Li and Zhang, Jian , year = 2025, number =. Q-

work page 2025

[22] [22]

Li, Zekun and Liu, Hongying and Shang, Fanhua and Liu, Yuanyuan and Wan, Liang and Feng, Wei , year = 2024, volume =

work page 2024

[23] [23]

Lu, Yiting and Li, Xin and Pei, Yajing and Yuan, Kun and Xie, Qizhi and Qu, Yunpeng and Sun, Ming and Zhou, Chao and Chen, Zhibo , year = 2024, number =

work page 2024

[24] [24]

Advances in Neural Information Processing Systems , author =

work page

[25] [25]

High-Resolution Image Synthesis with Latent Diffusion Models , booktitle =

Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj. High-Resolution Image Synthesis with Latent Diffusion Models , booktitle =

work page

[26] [26]

Photorealistic

Saharia, Chitwan and Chan, William and Saxena, Saurabh and Li, Lala and Whang, Jay and Denton, Emily and Ghasemipour, Seyed Kamyar Seyed and Ayan, Burcu Karagol and Mahdavi, S Sara and. Photorealistic

work page

[27] [27]

Improving the

Sun, Lingchen and Wu, Rongyuan and Liang, Jie and Zhang, Zhengqiang and Yong, Hongwei and Zhang, Lei , year = 2024, number =. Improving the

work page 2024

[28] [28]

Su, Jianlin and Lu, Yu and Pan, Shengfeng and Murtadha, Ahmed and Wen, Bo and Liu, Yunfeng , year = 2024, journal =

work page 2024

[29] [29]

Tan, Zhenxiong and Xue, Qiaochu and Yang, Xingyi and Liu, Songhua and Wang, Xinchao , year = 2025, number =

work page 2025

[30] [30]

Tan, Zhenxiong and Liu, Songhua and Yang, Xingyi and Xue, Qiaochu and Wang, Xinchao , year = 2025, number =

work page 2025

[31] [31]

Diffusion

Wallace, Bram and Dang, Meihua and Rafailov, Rafael and Zhou, Linqi and Lou, Aaron and Purushwalkam, Senthil and Ermon, Stefano and Xiong, Caiming and Joty, Shafiq and Naik, Nikhil , year = 2023, number =. Diffusion

work page 2023

[32] [32]

Enhanced

Wang, Yiwen and Liang, Ying and Zhang, Yuxuan and Chai, Xinning and Cheng, Zhengxue and Qin, Yingsheng and Yang, Yucai and Xie, Rong and Song, Li , year = 2025, number =. Enhanced

work page 2025

[33] [33]

Proceedings of the 32nd

Large. Proceedings of the 32nd

work page

[34] [34]

Perceive,

Wei, Hongyang and Liu, Shuaizheng and Yuan, Chun and Zhang, Lei , year = 2025, number =. Perceive,

work page 2025

[35] [35]

Yang, Sidi and Wu, Tianhe and Shi, Shuwei and Lao, Shanshan and Gong, Yuan and Cao, Mingdeng and Wang, Jiahao and Yang, Yujiu , year = 2022, number =

work page 2022

[36] [36]

Yang, Hao and Yang, Yan and Zhang, Ruikun and Pan, Liyuan , year = 2025, number =. A

work page 2025

[37] [37]

Improved

Yin, Tianwei and Gharbi, Micha. Improved

work page

[38] [38]

Teaching

You, Zhiyuan and Cai, Xin and Gu, Jinjin and Xue, Tianfan and Dong, Chao , year = 2025, number =. Teaching

work page 2025

[39] [39]

Adaptive

Zhu, Hanwei and Wu, Haoning and Li, Yixuan and Zhang, Zicheng and Chen, Baoliang and Zhu, Lingyu and Fang, Yuming and Zhai, Guangtao and Lin, Weisi and Wang, Shiqi , year = 2024, number =. Adaptive

work page 2024