Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models

Guo Li; Weihong Chen; Yongfu Fan

arxiv: 2510.21783 · v2 · submitted 2025-10-18 · 💻 cs.CV · cs.AI· cs.CR

Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models

Guo Li , Weihong Chen , Yongfu Fan This is my paper

Pith reviewed 2026-05-18 05:59 UTC · model grok-4.3

classification 💻 cs.CV cs.AIcs.CR

keywords membership inferencediffusion modelsprivacy attacksnoise injectionnoise prediction consistencyStable Diffusionmachine learning securitydata leakage

0 comments

The pith

Diffusion models reveal training membership through consistent noise predictions that a single low-intensity injection can amplify for fewer queries and higher accuracy.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a membership inference attack that targets how diffusion models predict noise at each step of the reverse process. Existing attacks either measure direct loss differences or full image reconstructions, both of which either miss fine-grained signals or require many expensive model calls. By instead aggregating noise predictions across a short diffusion trajectory and injecting only a tiny amount of noise in one step, the method makes the consistency gap between training-set samples and unseen samples larger and easier to detect. This yields both lower query budgets and improved detection rates on models such as Stable Diffusion.

Core claim

The authors claim that the consistency of noise predictions across diffusion steps differs systematically between samples that were seen during training and those that were not, and that a single-step low-intensity noise injection suffices to enlarge this difference enough for reliable inference while sharply reducing the number of model evaluations required.

What carries the argument

Noise aggregation analysis driven by single-step low-intensity noise injection, which amplifies consistency differences in the model's noise predictions between member and non-member samples.

If this is right

Fewer model queries are needed to run the attack compared with loss-based or reconstruction-based baselines.
Higher membership inference accuracy is achieved by exploiting noise-prediction consistency rather than scalar loss or pixel-level reconstruction error.
The attack applies directly to text-to-image diffusion models such as Stable Diffusion without requiring access to intermediate activations.
Privacy auditing of released diffusion models becomes cheaper and therefore more practical for model owners and regulators.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Diffusion models may leak membership information through many intermediate denoising steps, not only through final output loss or reconstruction quality.
Defenses that add noise to training gradients or randomize the diffusion schedule might reduce the consistency gap the attack exploits.
The same single-step injection idea could be tested on other generative models whose training involves iterative refinement, such as score-based models or flow-matching networks.

Load-bearing premise

The consistency characteristics of noise prediction during the diffusion process differ meaningfully between member and non-member samples.

What would settle it

Apply the noise-aggregation procedure to a diffusion model with a known training set and a disjoint test set; if the aggregated noise-consistency scores show no statistically significant separation between the two groups, the central claim is false.

Figures

Figures reproduced from arXiv: 2510.21783 by Guo Li, Weihong Chen, Yongfu Fan.

**Figure 1.** Figure 1: Overview of our proposed membership inference attack pipeline. The approach injects small noise into test images, predicts noise at selected timesteps, [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 2.** Figure 2: Comparison of image diffusion effects under different noise intensities. The figure demonstrates how different noise levels affect image quality and [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Attack performance across different timestep parameters. The figure shows the relationship between timestep selection and attack effectiveness, with [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Impact of initial noise intensity on attack performance. The figure illustrates the optimal noise level that balances member/non-member distinguishability [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: Impact of DDIM sampling step size on attack performance. The figure shows the relationship between sampling granularity and attack effectiveness. [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

read the original abstract

Diffusion models have demonstrated powerful performance in generating high-quality images. A typical example is text-to-image generator like Stable Diffusion. However, their widespread use also poses potential privacy risks. A key concern is membership inference attacks, which attempt to determine whether a particular data sample was used in the model training process. Existing membership inference attacks against diffusion models either directly exploit sample loss differences or rely on image-level reconstruction differences. Both approaches commonly ignore the consistency characteristics of noise prediction during the diffusion process, resulting in either low inference accuracy or high computational costs. To address these shortcomings, we propose a membership inference method based on noise aggregation analysis, and introduce a single-step, low-intensity noise injection diffusion strategy to amplify differences between member and non-member samples. Our proposed approach substantially reduces model query requirements while delivering more efficient and accurate membership inference.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper offers a new angle on membership inference for diffusion models via noise aggregation and single-step low-intensity injection, but the abstract supplies no numbers or justification for why the consistency differences are membership-specific.

read the letter

The main point here is a proposed membership inference attack on diffusion models that looks at consistency in noise predictions across the diffusion process and adds a single low-intensity noise injection step to widen the gap between member and non-member samples, with the goal of using fewer queries than prior methods while getting better accuracy. What is new is the explicit focus on those consistency characteristics, which the authors say existing loss-based and reconstruction-based attacks ignore, plus the specific choice of a one-step low-intensity injection to amplify them. The paper does a reasonable job laying out the shortcomings of the two main existing families of attacks and sketching a lighter-weight alternative that could matter for practical privacy auditing of models like Stable Diffusion. That framing is useful even if the details still need checking. The soft spots are clear and fairly large. The abstract asserts efficiency and accuracy improvements but contains no results, no baseline numbers, no error bars, and no description of the experimental setup or how the aggregation thresholds are set. The load-bearing assumption—that noise consistency differs between members and non-members in a way that one low-intensity injection selectively enlarges—is stated without derivation or evidence that it is not driven by sample difficulty, generalization gaps, or plain stochasticity instead. If that assumption does not hold, the claimed reductions in query count and gains in accuracy cannot be credited to the proposed analysis. The full methods and results sections would be needed to judge whether the math is sound or whether any circularity crept into the metric choices. This kind of work is mainly for researchers in machine learning security and privacy who care about data leakage in generative models. A reader already following membership inference papers would get some value from the new technical direction, provided the experiments later show the gains are real and not artifacts. I would send it to peer review. The topic is timely and the idea is distinct enough from the cited priors that referees should have a chance to evaluate the evidence once it is presented.

Referee Report

2 major / 1 minor

Summary. The paper proposes a membership inference attack on diffusion models (e.g., Stable Diffusion) that exploits consistency characteristics of noise predictions during the diffusion process. It introduces a noise aggregation analysis method driven by a single-step, low-intensity noise injection strategy intended to amplify differences between member and non-member samples, claiming this yields substantially lower model query counts and higher efficiency/accuracy than prior loss-based or reconstruction-based attacks.

Significance. If the empirical claims hold, the work would supply a query-efficient membership inference technique that could improve privacy auditing for large-scale diffusion models. The emphasis on noise-prediction consistency rather than final loss or full reconstruction is a potentially useful angle, but its value depends on whether the observed differences are genuinely membership-driven and whether the single-step injection reliably isolates them.

major comments (2)

[Abstract] Abstract: the central claim that the method 'substantially reduces model query requirements while delivering more efficient and accurate membership inference' is presented without any quantitative results, baseline comparisons, error bars, dataset details, or experimental protocol. This absence makes it impossible to evaluate whether the efficiency and accuracy gains are real or attributable to the proposed noise aggregation analysis.
[Abstract] Abstract (and implied methods): the load-bearing assumption that 'consistency characteristics of noise prediction during the diffusion process differ meaningfully between member and non-member samples' and that single-step low-intensity injection 'can reliably amplify these differences' receives no derivation, equation, or theoretical justification. Without an explicit consistency metric, aggregation rule, or argument showing why this strategy outperforms multi-step or higher-intensity alternatives, the attribution of any gains to the new analysis remains ungrounded.

minor comments (1)

[Abstract] Abstract: the phrase 'noise aggregation analysis' is introduced without a concise definition or reference to the precise aggregation operation (e.g., mean, variance, or threshold) that readers would need to understand the contribution.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments. We agree that the abstract requires strengthening to better support its claims with quantitative evidence and explicit methodological justification. We address each point below and will incorporate revisions in the next version of the manuscript.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that the method 'substantially reduces model query requirements while delivering more efficient and accurate membership inference' is presented without any quantitative results, baseline comparisons, error bars, dataset details, or experimental protocol. This absence makes it impossible to evaluate whether the efficiency and accuracy gains are real or attributable to the proposed noise aggregation analysis.

Authors: We acknowledge that the current abstract is overly concise and does not include supporting quantitative details. The full manuscript reports these results in the experimental sections, including direct comparisons against loss-based and reconstruction-based baselines, query-count reductions, accuracy metrics with standard deviations across runs, and the datasets and protocols used. To address the referee's concern directly, we will revise the abstract to incorporate a concise summary of the key empirical outcomes and experimental setup so that the efficiency and accuracy claims can be evaluated on first reading. revision: yes
Referee: [Abstract] Abstract (and implied methods): the load-bearing assumption that 'consistency characteristics of noise prediction during the diffusion process differ meaningfully between member and non-member samples' and that single-step low-intensity injection 'can reliably amplify these differences' receives no derivation, equation, or theoretical justification. Without an explicit consistency metric, aggregation rule, or argument showing why this strategy outperforms multi-step or higher-intensity alternatives, the attribution of any gains to the new analysis remains ungrounded.

Authors: We agree that the abstract does not supply the requested derivation or explicit definitions. The manuscript's methods section introduces a consistency metric defined on the variance of the model's noise predictions and an aggregation rule that combines predictions from the single low-intensity injection step. We argue that this single-step, low-intensity regime is sufficient because it isolates early-stage prediction discrepancies caused by training-set overfitting while avoiding the computational overhead of multi-step trajectories; empirical ablations in the paper compare it against higher-intensity and multi-step variants. To make this grounding visible at the abstract level, we will add a brief clause stating the consistency metric and the rationale for the injection strategy. revision: partial

Circularity Check

0 steps flagged

No circularity: proposed method introduces new analysis without reducing to fitted inputs or self-referential definitions

full rationale

The paper proposes a membership inference approach based on noise aggregation analysis combined with a single-step low-intensity noise injection strategy. The abstract explicitly frames this as a novel way to exploit previously ignored consistency characteristics of noise predictions, addressing shortcomings of prior loss-based or reconstruction-based attacks. No equations, parameter-fitting procedures, or self-citations are shown that would make any claimed prediction or result equivalent to its inputs by construction. The derivation chain remains self-contained because the central contribution is an empirical strategy whose validity can be tested against external data and benchmarks rather than being forced by internal definitions or prior author work.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The work is an empirical attack proposal; the abstract mentions no explicit free parameters, mathematical axioms, or newly invented entities. The central claim rests on the unstated assumption that noise-prediction consistency differs systematically between training and non-training samples.

pith-pipeline@v0.9.0 · 5673 in / 1161 out tokens · 40705 ms · 2026-05-18T05:59:09.337864+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

39 extracted references · 39 canonical work pages · 8 internal anchors

[1]

Diffusion models beat gans on image synthesis,

P. Dhariwal and A. Nichol, “Diffusion models beat gans on image synthesis,”Advances in neural information processing systems, vol. 34, pp. 8780–8794, 2021

work page 2021
[2]

More control for free! image synthesis with semantic diffusion guidance,

X. Liu, D. H. Park, S. Azadi, G. Zhang, A. Chopikyan, Y . Hu, H. Shi, A. Rohrbach, and T. Darrell, “More control for free! image synthesis with semantic diffusion guidance,” inProceedings of the IEEE/CVF winter conference on applications of computer vision, 2023, pp. 289– 299

work page 2023
[3]

Hierarchical Text-Conditional Image Generation with CLIP Latents

A. Ramesh, P. Dhariwal, A. Nichol, C. Chu, and M. Chen, “Hierarchical text-conditional image generation with clip latents,”arXiv preprint arXiv:2204.06125, vol. 1, no. 2, p. 3, 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022
[4]

Dreambooth: Fine tuning text-to-image diffusion models for subject- driven generation,

N. Ruiz, Y . Li, V . Jampani, Y . Pritch, M. Rubinstein, and K. Aberman, “Dreambooth: Fine tuning text-to-image diffusion models for subject- driven generation,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2023, pp. 22 500–22 510

work page 2023
[5]

Photorealistic text-to-image diffusion models with deep language understanding,

C. Saharia, W. Chan, S. Saxena, L. Li, J. Whang, E. L. Denton, K. Ghasemipour, R. Gontijo Lopes, B. Karagol Ayan, T. Salimans et al., “Photorealistic text-to-image diffusion models with deep language understanding,”Advances in neural information processing systems, vol. 35, pp. 36 479–36 494, 2022

work page 2022
[6]

High- resolution image synthesis with latent diffusion models,

R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer, “High- resolution image synthesis with latent diffusion models,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 10 684–10 695

work page 2022
[7]

Classifier-Free Diffusion Guidance

J. Ho and T. Salimans, “Classifier-free diffusion guidance,” arXiv preprint arXiv:2207.12598, 2022. [Online]. Available: https://arxiv.org/abs/2207.12598

work page internal anchor Pith review Pith/arXiv arXiv 2022
[8]

Elucidating the design space of classifier-guided diffusion generation,

J. Ma, T. Hu, W. Wang, and J. Sun, “Elucidating the design space of classifier-guided diffusion generation,” inNeurIPS 2023 Workshop on Score-Based Methods, 2023. [Online]. Available: https://arxiv.org/abs/2310.11311

work page arXiv 2023
[9]

Membership inference attacks against machine learning models,

R. Shokri, M. Stronati, C. Song, and V . Shmatikov, “Membership inference attacks against machine learning models,” in2017 IEEE symposium on security and privacy (SP). IEEE, 2017, pp. 3–18

work page 2017
[10]

Privacy risk in machine learning: Analyzing the connection to overfitting,

S. Yeom, I. Giacomelli, M. Fredrikson, and S. Jha, “Privacy risk in machine learning: Analyzing the connection to overfitting,” in2018 IEEE 31st computer security foundations symposium (CSF). IEEE, 2018, pp. 268–282

work page 2018
[11]

ML-Leaks: Model and Data Independent Membership Inference Attacks and Defenses on Machine Learning Models

A. Salem, Y . Zhang, M. Humbert, P. Berrang, M. Fritz, and M. Backes, “Ml-leaks: Model and data independent membership inference at- tacks and defenses on machine learning models,”arXiv preprint arXiv:1806.01246, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[12]

Understanding Membership Inferences on Well-Generalized Learning Models

Y . Long, V . Bindschaedler, L. Wang, D. Bu, X. Wang, H. Tang, C. A. Gunter, and K. Chen, “Understanding membership inferences on well- generalized learning models,”arXiv preprint arXiv:1802.04889, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[13]

A pragmatic approach to membership inferences on machine learning models,

Y . Long, L. Wang, D. Bu, V . Bindschaedler, X. Wang, H. Tang, C. A. Gunter, and K. Chen, “A pragmatic approach to membership inferences on machine learning models,” in2020 IEEE European Symposium on Security and Privacy (EuroS&P). IEEE, 2020, pp. 521–534

work page 2020
[14]

“Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free move- ment of such data (General Data Protection Regulation),” https://eur- lex.europa.eu/eli/reg/2016/679/oj, 2016, official Journal of the European Union, L119, 1-88

work page 2016
[15]

California Consumer Privacy Act of 2018 (CCPA),

“California Consumer Privacy Act of 2018 (CCPA),” https://oag.ca.gov/privacy/ccpa, 2018, cal. Civ. Code §§1798.100– 1798.199

work page 2018
[16]

Membership inference attacks against diffusion models,

T. Matsumoto, T. Miura, and N. Yanai, “Membership inference attacks against diffusion models,” in2023 IEEE Security and Privacy Workshops (SPW). IEEE, 2023, pp. 77–83

work page 2023
[17]

Membership inference of diffusion models,

H. Hu and J. Pang, “Membership inference of diffusion models,”arXiv preprint arXiv:2301.09956, 2023

work page arXiv 2023
[18]

Are diffusion models vulnerable to membership inference attacks?

J. Duan, F. Kong, S. Wang, X. Shi, and K. Xu, “Are diffusion models vulnerable to membership inference attacks?” inInternational Confer- ence on Machine Learning. PMLR, 2023, pp. 8717–8730

work page 2023
[19]

Unlocking generative priors: A new membership inference framework for diffusion models,

X. Fu, X. Wang, Q. Li, J. Liu, J. Dai, J. Han, and X. Gao, “Unlocking generative priors: A new membership inference framework for diffusion models,”IEEE Transactions on Information Forensics and Security, 2025

work page 2025
[20]

Membership inference on text-to-image diffusion models via conditional likelihood discrepancy,

S. Zhai, H. Chen, Y . Dong, J. Li, Q. Shen, Y . Gao, H. Su, and Y . Liu, “Membership inference on text-to-image diffusion models via conditional likelihood discrepancy,”Advances in Neural Information Processing Systems, vol. 37, pp. 74 122–74 146, 2024

work page 2024
[21]

Unveiling structural memorization: Structural membership inference attack for text-to-image diffusion models,

Q. Li, X. Fu, X. Wang, J. Liu, X. Gao, J. Dai, and J. Han, “Unveiling structural memorization: Structural membership inference attack for text-to-image diffusion models,” inProceedings of the 32nd ACM International Conference on Multimedia, 2024, pp. 10 554–10 562

work page 2024
[22]

Denoising diffusion probabilistic models,

J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” Advances in neural information processing systems, vol. 33, pp. 6840– 6851, 2020

work page 2020
[23]

Deep unsupervised learning using nonequilibrium thermodynamics,

J. Sohl-Dickstein, E. Weiss, N. Maheswaranathan, and S. Ganguli, “Deep unsupervised learning using nonequilibrium thermodynamics,” in International conference on machine learning. pmlr, 2015, pp. 2256– 2265

work page 2015
[24]

Improved denoising diffusion probabilistic models,

A. Q. Nichol and P. Dhariwal, “Improved denoising diffusion probabilistic models,” inProceedings of the 38th International Conference on Machine Learning, ser. Proceedings of Machine Learning Research, vol. 139. PMLR, 2021, pp. 8162–8171. [Online]. Available: http://proceedings.mlr.press/v139/nichol21a.html

work page 2021
[25]

Generative adversarial nets,

I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y . Bengio, “Generative adversarial nets,” Advances in neural information processing systems, vol. 27, 2014

work page 2014
[26]

Auto-Encoding Variational Bayes

D. P. Kingma and M. Welling, “Auto-encoding variational bayes,”arXiv preprint arXiv:1312.6114, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013
[27]

Lumiere: A space-time diffusion model for video generation,

O. Bar-Tal, H. Chefer, O. Tov, C. Herrmann, R. Paiss, S. Zada, A. Ephrat, J. Hur, G. Liu, A. Rajet al., “Lumiere: A space-time diffusion model for video generation,” inSIGGRAPH Asia 2024 Conference Papers, 2024, pp. 1–11

work page 2024
[28]

Latte: Latent Diffusion Transformer for Video Generation

X. Ma, Y . Wang, G. Jia, X. Chen, Z. Liu, Y .-F. Li, C. Chen, and Y . Qiao, “Latte: Latent diffusion transformer for video generation,”arXiv preprint arXiv:2401.03048, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[29]

Grad- tts: A diffusion probabilistic model for text-to-speech,

V . Popov, I. V ovk, V . Gogoryan, T. Sadekova, and M. Kudinov, “Grad- tts: A diffusion probabilistic model for text-to-speech,” inInternational conference on machine learning. PMLR, 2021, pp. 8599–8608

work page 2021
[30]

Diff- tts: A denoising diffusion model for text-to-speech,

M. Jeong, H. Kim, S. J. Cheon, B. J. Choi, and N. S. Kim, “Diff- tts: A denoising diffusion model for text-to-speech,”arXiv preprint arXiv:2104.01409, 2021

work page arXiv 2021
[31]

Mixdiff-tts: Mixture alignment and diffusion model for text-to-speech,

Y . Long, K. Yang, Y . Ma, and Y . Yang, “Mixdiff-tts: Mixture alignment and diffusion model for text-to-speech,”Applied Sciences, vol. 15, no. 9, p. 4810, 2025

work page 2025
[32]

A dual diffusion model enables 3d molecule generation and lead optimization based on target pockets,

L. Huang, T. Xu, Y . Yu, P. Zhao, X. Chen, J. Han, Z. Xie, H. Li, W. Zhong, K.-C. Wonget al., “A dual diffusion model enables 3d molecule generation and lead optimization based on target pockets,” Nature Communications, vol. 15, no. 1, p. 2657, 2024

work page 2024
[33]

Drugdiff: small molecule diffusion model with flexible guidance towards molecular properties,

M. Oestreich, E. Merdivan, M. Lee, J. L. Schultze, M. Piraud, and M. Becker, “Drugdiff: small molecule diffusion model with flexible guidance towards molecular properties,”Journal of cheminformatics, vol. 17, no. 1, p. 23, 2025

work page 2025
[34]

Denoising Diffusion Implicit Models

J. Song, C. Meng, and S. Ermon, “Denoising diffusion implicit models,” arXiv preprint arXiv:2010.02502, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010
[35]

Dall· e 2: Hierarchical text-conditional image gener- ation with clip latents,

A. Ramesh, M. Pavlov, G. Goh, S. Gray, C. V oss, A. Radford, M. Chen, and I. Sutskever, “Dall· e 2: Hierarchical text-conditional image gener- ation with clip latents,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022

work page 2022
[36]

LOGAN: Membership Inference Attacks Against Generative Models

J. Hayes, L. Melis, G. Danezis, and E. De Cristofaro, “Logan: Mem- bership inference attacks against generative models,”arXiv preprint arXiv:1705.07663, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[37]

Monte carlo and re- construction membership inference attacks against generative models,

B. Hilprecht, M. H ¨arterich, and D. Bernau, “Monte carlo and re- construction membership inference attacks against generative models,” Proceedings on Privacy Enhancing Technologies, 2019

work page 2019
[38]

Performing co-membership attacks against deep generative models,

K. S. Liu, C. Xiao, B. Li, and J. Gao, “Performing co-membership attacks against deep generative models,” in2019 IEEE International Conference on Data Mining (ICDM). IEEE, 2019, pp. 459–467

work page 2019
[39]

Gan-leaks: A taxonomy of membership inference attacks against generative models,

D. Chen, N. Yu, Y . Zhang, and M. Fritz, “Gan-leaks: A taxonomy of membership inference attacks against generative models,” inPro- ceedings of the 2020 ACM SIGSAC conference on computer and communications security, 2020, pp. 343–362

work page 2020

[1] [1]

Diffusion models beat gans on image synthesis,

P. Dhariwal and A. Nichol, “Diffusion models beat gans on image synthesis,”Advances in neural information processing systems, vol. 34, pp. 8780–8794, 2021

work page 2021

[2] [2]

More control for free! image synthesis with semantic diffusion guidance,

X. Liu, D. H. Park, S. Azadi, G. Zhang, A. Chopikyan, Y . Hu, H. Shi, A. Rohrbach, and T. Darrell, “More control for free! image synthesis with semantic diffusion guidance,” inProceedings of the IEEE/CVF winter conference on applications of computer vision, 2023, pp. 289– 299

work page 2023

[3] [3]

Hierarchical Text-Conditional Image Generation with CLIP Latents

A. Ramesh, P. Dhariwal, A. Nichol, C. Chu, and M. Chen, “Hierarchical text-conditional image generation with clip latents,”arXiv preprint arXiv:2204.06125, vol. 1, no. 2, p. 3, 2022

work page internal anchor Pith review Pith/arXiv arXiv 2022

[4] [4]

Dreambooth: Fine tuning text-to-image diffusion models for subject- driven generation,

N. Ruiz, Y . Li, V . Jampani, Y . Pritch, M. Rubinstein, and K. Aberman, “Dreambooth: Fine tuning text-to-image diffusion models for subject- driven generation,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2023, pp. 22 500–22 510

work page 2023

[5] [5]

Photorealistic text-to-image diffusion models with deep language understanding,

C. Saharia, W. Chan, S. Saxena, L. Li, J. Whang, E. L. Denton, K. Ghasemipour, R. Gontijo Lopes, B. Karagol Ayan, T. Salimans et al., “Photorealistic text-to-image diffusion models with deep language understanding,”Advances in neural information processing systems, vol. 35, pp. 36 479–36 494, 2022

work page 2022

[6] [6]

High- resolution image synthesis with latent diffusion models,

R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer, “High- resolution image synthesis with latent diffusion models,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2022, pp. 10 684–10 695

work page 2022

[7] [7]

Classifier-Free Diffusion Guidance

J. Ho and T. Salimans, “Classifier-free diffusion guidance,” arXiv preprint arXiv:2207.12598, 2022. [Online]. Available: https://arxiv.org/abs/2207.12598

work page internal anchor Pith review Pith/arXiv arXiv 2022

[8] [8]

Elucidating the design space of classifier-guided diffusion generation,

J. Ma, T. Hu, W. Wang, and J. Sun, “Elucidating the design space of classifier-guided diffusion generation,” inNeurIPS 2023 Workshop on Score-Based Methods, 2023. [Online]. Available: https://arxiv.org/abs/2310.11311

work page arXiv 2023

[9] [9]

Membership inference attacks against machine learning models,

R. Shokri, M. Stronati, C. Song, and V . Shmatikov, “Membership inference attacks against machine learning models,” in2017 IEEE symposium on security and privacy (SP). IEEE, 2017, pp. 3–18

work page 2017

[10] [10]

Privacy risk in machine learning: Analyzing the connection to overfitting,

S. Yeom, I. Giacomelli, M. Fredrikson, and S. Jha, “Privacy risk in machine learning: Analyzing the connection to overfitting,” in2018 IEEE 31st computer security foundations symposium (CSF). IEEE, 2018, pp. 268–282

work page 2018

[11] [11]

ML-Leaks: Model and Data Independent Membership Inference Attacks and Defenses on Machine Learning Models

A. Salem, Y . Zhang, M. Humbert, P. Berrang, M. Fritz, and M. Backes, “Ml-leaks: Model and data independent membership inference at- tacks and defenses on machine learning models,”arXiv preprint arXiv:1806.01246, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[12] [12]

Understanding Membership Inferences on Well-Generalized Learning Models

Y . Long, V . Bindschaedler, L. Wang, D. Bu, X. Wang, H. Tang, C. A. Gunter, and K. Chen, “Understanding membership inferences on well- generalized learning models,”arXiv preprint arXiv:1802.04889, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[13] [13]

A pragmatic approach to membership inferences on machine learning models,

Y . Long, L. Wang, D. Bu, V . Bindschaedler, X. Wang, H. Tang, C. A. Gunter, and K. Chen, “A pragmatic approach to membership inferences on machine learning models,” in2020 IEEE European Symposium on Security and Privacy (EuroS&P). IEEE, 2020, pp. 521–534

work page 2020

[14] [14]

“Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free move- ment of such data (General Data Protection Regulation),” https://eur- lex.europa.eu/eli/reg/2016/679/oj, 2016, official Journal of the European Union, L119, 1-88

work page 2016

[15] [15]

California Consumer Privacy Act of 2018 (CCPA),

“California Consumer Privacy Act of 2018 (CCPA),” https://oag.ca.gov/privacy/ccpa, 2018, cal. Civ. Code §§1798.100– 1798.199

work page 2018

[16] [16]

Membership inference attacks against diffusion models,

T. Matsumoto, T. Miura, and N. Yanai, “Membership inference attacks against diffusion models,” in2023 IEEE Security and Privacy Workshops (SPW). IEEE, 2023, pp. 77–83

work page 2023

[17] [17]

Membership inference of diffusion models,

H. Hu and J. Pang, “Membership inference of diffusion models,”arXiv preprint arXiv:2301.09956, 2023

work page arXiv 2023

[18] [18]

Are diffusion models vulnerable to membership inference attacks?

J. Duan, F. Kong, S. Wang, X. Shi, and K. Xu, “Are diffusion models vulnerable to membership inference attacks?” inInternational Confer- ence on Machine Learning. PMLR, 2023, pp. 8717–8730

work page 2023

[19] [19]

Unlocking generative priors: A new membership inference framework for diffusion models,

X. Fu, X. Wang, Q. Li, J. Liu, J. Dai, J. Han, and X. Gao, “Unlocking generative priors: A new membership inference framework for diffusion models,”IEEE Transactions on Information Forensics and Security, 2025

work page 2025

[20] [20]

Membership inference on text-to-image diffusion models via conditional likelihood discrepancy,

S. Zhai, H. Chen, Y . Dong, J. Li, Q. Shen, Y . Gao, H. Su, and Y . Liu, “Membership inference on text-to-image diffusion models via conditional likelihood discrepancy,”Advances in Neural Information Processing Systems, vol. 37, pp. 74 122–74 146, 2024

work page 2024

[21] [21]

Unveiling structural memorization: Structural membership inference attack for text-to-image diffusion models,

Q. Li, X. Fu, X. Wang, J. Liu, X. Gao, J. Dai, and J. Han, “Unveiling structural memorization: Structural membership inference attack for text-to-image diffusion models,” inProceedings of the 32nd ACM International Conference on Multimedia, 2024, pp. 10 554–10 562

work page 2024

[22] [22]

Denoising diffusion probabilistic models,

J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” Advances in neural information processing systems, vol. 33, pp. 6840– 6851, 2020

work page 2020

[23] [23]

Deep unsupervised learning using nonequilibrium thermodynamics,

J. Sohl-Dickstein, E. Weiss, N. Maheswaranathan, and S. Ganguli, “Deep unsupervised learning using nonequilibrium thermodynamics,” in International conference on machine learning. pmlr, 2015, pp. 2256– 2265

work page 2015

[24] [24]

Improved denoising diffusion probabilistic models,

A. Q. Nichol and P. Dhariwal, “Improved denoising diffusion probabilistic models,” inProceedings of the 38th International Conference on Machine Learning, ser. Proceedings of Machine Learning Research, vol. 139. PMLR, 2021, pp. 8162–8171. [Online]. Available: http://proceedings.mlr.press/v139/nichol21a.html

work page 2021

[25] [25]

Generative adversarial nets,

I. J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y . Bengio, “Generative adversarial nets,” Advances in neural information processing systems, vol. 27, 2014

work page 2014

[26] [26]

Auto-Encoding Variational Bayes

D. P. Kingma and M. Welling, “Auto-encoding variational bayes,”arXiv preprint arXiv:1312.6114, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013

[27] [27]

Lumiere: A space-time diffusion model for video generation,

O. Bar-Tal, H. Chefer, O. Tov, C. Herrmann, R. Paiss, S. Zada, A. Ephrat, J. Hur, G. Liu, A. Rajet al., “Lumiere: A space-time diffusion model for video generation,” inSIGGRAPH Asia 2024 Conference Papers, 2024, pp. 1–11

work page 2024

[28] [28]

Latte: Latent Diffusion Transformer for Video Generation

X. Ma, Y . Wang, G. Jia, X. Chen, Z. Liu, Y .-F. Li, C. Chen, and Y . Qiao, “Latte: Latent diffusion transformer for video generation,”arXiv preprint arXiv:2401.03048, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[29] [29]

Grad- tts: A diffusion probabilistic model for text-to-speech,

V . Popov, I. V ovk, V . Gogoryan, T. Sadekova, and M. Kudinov, “Grad- tts: A diffusion probabilistic model for text-to-speech,” inInternational conference on machine learning. PMLR, 2021, pp. 8599–8608

work page 2021

[30] [30]

Diff- tts: A denoising diffusion model for text-to-speech,

M. Jeong, H. Kim, S. J. Cheon, B. J. Choi, and N. S. Kim, “Diff- tts: A denoising diffusion model for text-to-speech,”arXiv preprint arXiv:2104.01409, 2021

work page arXiv 2021

[31] [31]

Mixdiff-tts: Mixture alignment and diffusion model for text-to-speech,

Y . Long, K. Yang, Y . Ma, and Y . Yang, “Mixdiff-tts: Mixture alignment and diffusion model for text-to-speech,”Applied Sciences, vol. 15, no. 9, p. 4810, 2025

work page 2025

[32] [32]

A dual diffusion model enables 3d molecule generation and lead optimization based on target pockets,

L. Huang, T. Xu, Y . Yu, P. Zhao, X. Chen, J. Han, Z. Xie, H. Li, W. Zhong, K.-C. Wonget al., “A dual diffusion model enables 3d molecule generation and lead optimization based on target pockets,” Nature Communications, vol. 15, no. 1, p. 2657, 2024

work page 2024

[33] [33]

Drugdiff: small molecule diffusion model with flexible guidance towards molecular properties,

M. Oestreich, E. Merdivan, M. Lee, J. L. Schultze, M. Piraud, and M. Becker, “Drugdiff: small molecule diffusion model with flexible guidance towards molecular properties,”Journal of cheminformatics, vol. 17, no. 1, p. 23, 2025

work page 2025

[34] [34]

Denoising Diffusion Implicit Models

J. Song, C. Meng, and S. Ermon, “Denoising diffusion implicit models,” arXiv preprint arXiv:2010.02502, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2010

[35] [35]

Dall· e 2: Hierarchical text-conditional image gener- ation with clip latents,

A. Ramesh, M. Pavlov, G. Goh, S. Gray, C. V oss, A. Radford, M. Chen, and I. Sutskever, “Dall· e 2: Hierarchical text-conditional image gener- ation with clip latents,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022

work page 2022

[36] [36]

LOGAN: Membership Inference Attacks Against Generative Models

J. Hayes, L. Melis, G. Danezis, and E. De Cristofaro, “Logan: Mem- bership inference attacks against generative models,”arXiv preprint arXiv:1705.07663, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[37] [37]

Monte carlo and re- construction membership inference attacks against generative models,

B. Hilprecht, M. H ¨arterich, and D. Bernau, “Monte carlo and re- construction membership inference attacks against generative models,” Proceedings on Privacy Enhancing Technologies, 2019

work page 2019

[38] [38]

Performing co-membership attacks against deep generative models,

K. S. Liu, C. Xiao, B. Li, and J. Gao, “Performing co-membership attacks against deep generative models,” in2019 IEEE International Conference on Data Mining (ICDM). IEEE, 2019, pp. 459–467

work page 2019

[39] [39]

Gan-leaks: A taxonomy of membership inference attacks against generative models,

D. Chen, N. Yu, Y . Zhang, and M. Fritz, “Gan-leaks: A taxonomy of membership inference attacks against generative models,” inPro- ceedings of the 2020 ACM SIGSAC conference on computer and communications security, 2020, pp. 343–362

work page 2020