Cyclic Denoising Reveals Ultrastable Memories in Diffusion Models

Rishabh Sharma; Stefano Martiniani

arxiv: 2606.24000 · v1 · pith:F4UBLMQ3new · submitted 2026-06-22 · 💻 cs.LG · cond-mat.dis-nn· cs.CR· cs.CV

Cyclic Denoising Reveals Ultrastable Memories in Diffusion Models

Rishabh Sharma , Stefano Martiniani This is my paper

Pith reviewed 2026-06-26 08:26 UTC · model grok-4.3

classification 💻 cs.LG cond-mat.dis-nncs.CRcs.CV

keywords diffusion modelsmemorizationcyclic denoisingultrastable attractorsextraction attackgenerative landscapesprivacy auditingmodel fingerprinting

0 comments

The pith

Cyclic denoising extracts ultrastable attractors that match memorized training images from diffusion models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents cyclic denoising, which consists of repeated forward and reverse diffusion steps at fixed noise amplitudes, as a way to reach regions of the model's output distribution that standard sampling misses. Samples are driven toward attractors whose stability varies widely, with the deepest ones regenerating after near-total corruption and surviving thousands of cycles. Many of these attractors match specific training images such as stock photographs and watermarks. The method needs only sampler control and no prompts or training-data knowledge, so it supplies a practical route for auditing memorization with direct consequences for privacy and copyright questions.

Core claim

Cyclic denoising exposes ultrastable attractors in diffusion models that regenerate after near-total corruption and persist through thousands of noising-denoising cycles. Many of these attractors correspond to memorized training images. The protocol works in both latent-space models such as Stable Diffusion v1.4 and pixel-space DDPMs, requires no gradients or conditioning, and displays a yielding-like transition: low noise amplitudes produce trivial fixed points while larger amplitudes produce basin hopping and long-lived trapping in structured memorized basins.

What carries the argument

Cyclic denoising: repeated forward and reverse diffusion at controlled noise amplitudes that drives samples toward attractors with a broad stability spectrum.

If this is right

Ultrastable attractors regenerate after near-total corruption and persist through thousands of cycles.
Many attractors correspond to memorized training images including stock photographs, brand watermarks, and web-crawl artifacts.
The attack works fully unconditioned and requires only sampler-level control with no gradients, weights, or prompts.
Noise amplitude controls a yielding-like transition from trivial fixed points to rearrangements and trapping in memorized basins.
The recovered attractor set shows hierarchical partial absorption, prompt-stabilized basins, and universality across different initial conditions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Developers could run cyclic denoising on their own models before deployment to locate and remove memorized content.
The same cycling procedure might serve as a general probe for memorization in other generative architectures beyond diffusion.
Adding an explicit membership-inference verification step would make the attack more robust against false positives.
The observed cross-initial-condition universality suggests the memorized basins occupy a sizable fraction of the model's measure.

Load-bearing premise

The recovered ultrastable attractors are verifiably memorized training images rather than model-generated artifacts that happen to resemble training data.

What would settle it

Direct comparison of the extracted attractor images against the full training set shows that none of them match any training example, or the attractors fail to reappear after a second round of near-total corruption.

Figures

Figures reproduced from arXiv: 2606.24000 by Rishabh Sharma, Stefano Martiniani.

**Figure 2.** Figure 2: Basin hopping in latent space under unconditional cyclic denoising in Stable Diffusion v1.4. [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: Yielding diagram for cyclic denoising. Steady-state stroboscopic similarity ⟨cos(zn, zn−1)⟩ss between consecutive cycles versus the perturbation (cycling) amplitude γ. Each trajectory is run for 10,000 cycles; for each seed we average the similarity over the final 1000 cycles, then report the mean across seeds with ±SEM error bars. Stable Diffusion v1.4 is shown for ImageNet, model-generated, and Gaussian … view at source ↗

**Figure 4.** Figure 4: Moderate-amplitude cycling recovers memorized logos and web-crawl artifacts. [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗

**Figure 5.** Figure 5: High-amplitude cycling isolates deep memorized attractors. [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

**Figure 6.** Figure 6: Unconditional cyclic denoising drives CIFAR-10 DDPM samples toward memorized attractors. [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗

**Figure 7.** Figure 7: Stability of prompt-conditioned image attractors under cyclic denoising. [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗

**Figure 8.** Figure 8: Trivial absorbing states from low-amplitude unconditional cycling in Stable Diffusion v1.4. [PITH_FULL_IMAGE:figures/full_fig_p016_8.png] view at source ↗

**Figure 9.** Figure 9: Two kinds of attractor reached by low-amplitude unconditional cycling in Stable Diffusion v1.4: [PITH_FULL_IMAGE:figures/full_fig_p017_9.png] view at source ↗

**Figure 10.** Figure 10: Multiple routes to the same attractors [PITH_FULL_IMAGE:figures/full_fig_p018_10.png] view at source ↗

**Figure 11.** Figure 11: How cyclic denoising traverses the generative landscape at intermediate and high noise [PITH_FULL_IMAGE:figures/full_fig_p019_11.png] view at source ↗

**Figure 12.** Figure 12: Selected CIFAR-10 attractors recovered by unconditional cyclic dynamics. [PITH_FULL_IMAGE:figures/full_fig_p020_12.png] view at source ↗

**Figure 13.** Figure 13: Prompt-conditioned cycling settling into two kinds of absorbing states: concept basins (a,b) [PITH_FULL_IMAGE:figures/full_fig_p021_13.png] view at source ↗

**Figure 14.** Figure 14: Decorrelation of prompt-conditioned absorbing states after prompt removal ( [PITH_FULL_IMAGE:figures/full_fig_p022_14.png] view at source ↗

read the original abstract

We introduce cyclic denoising -- repeated forward and reverse diffusion at controlled noise amplitudes -- as an extraction attack for image diffusion models. Inspired by random organization in disordered solids, cyclic denoising exposes regions of the learned distribution that are largely inaccessible to standard sampling. The dynamics drive samples toward attractors with a broad stability spectrum. The deepest attractors are ultrastable: they regenerate after near-total corruption and persist through thousands of noising-denoising cycles. Many of these attractors correspond to memorized training images, including stock photographs, brand watermarks, and web-crawl artifacts. The attack requires only sampler-level control, with no gradients, weight inspection, prompts, captions, or prior knowledge of the training data. Unlike generate-and-filter attacks, which rely on large-scale prompted generation and post-hoc similarity or membership-inference filtering, our main protocol is fully unconditioned. We demonstrate the phenomenon in Stable Diffusion v1.4 and in a pixel-space DDPM, showing consistent behavior across latent- and pixel-space diffusion models. Across noise amplitudes, we observe a yielding-like transition: low-amplitude cycling produces trivial absorbing fixed points or limit cycles, while larger amplitudes induce rearrangements, basin hopping, and long-lived trapping in structured memorized attractor basins. We also observe hierarchical partial absorption, prompt-stabilized basins, and cross-initial-condition universality of the recovered attractor set. Our results therefore show that cyclic denoising is both a physics-inspired probe of generative landscapes and a practical tool for memorization auditing, with implications for privacy, copyright compliance, and model fingerprinting.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Cyclic denoising pulls out ultrastable attractors that often match training images, but the memorization link needs explicit controls to hold up.

read the letter

The main takeaway is that repeated forward-reverse diffusion cycles at fixed noise levels can trap samples in deep attractors that regenerate after heavy corruption and frequently align with training-set images such as stock photos and watermarks. The protocol runs with sampler access only and no prompts or gradients.

The paper does a few things cleanly. It borrows the random-organization idea from disordered solids and turns it into a concrete extraction method that works on both Stable Diffusion v1.4 and a pixel-space DDPM. The description of the yielding-like transition with noise amplitude, plus notes on hierarchical absorption and cross-initial-condition consistency, gives a usable map of the behavior. These observations are presented as direct outcomes of running the sampler rather than fitted parameters.

The soft spot is the verification step for memorization. The abstract states that many attractors correspond to training images, yet the stress-test concern is fair: without reported similarity thresholds, membership tests, or controls that measure how often the same procedure recovers near-duplicates of non-training images, the claim rests on visual inspection. If the full text supplies quantitative false-positive rates and exact-match criteria, that gap closes; otherwise the stability result stands on its own while the memorization interpretation stays provisional.

This is aimed at researchers who audit generative models for privacy or copyright issues and at people who want physics-style probes of learned distributions. A reader who needs a new, low-overhead attack or who studies basin structure in diffusion would get concrete value. The work shows honest engagement with the empirical landscape and prior attack literature, so it deserves a serious referee even if the verification details require tightening.

Referee Report

2 major / 1 minor

Summary. The paper introduces cyclic denoising—repeated forward and reverse diffusion at controlled noise amplitudes—as an extraction attack that drives diffusion models toward ultrastable attractors. These attractors are claimed to regenerate after near-total corruption, persist through thousands of cycles, and frequently correspond to memorized training images (stock photographs, watermarks, web artifacts). The method is fully unconditioned, requires only sampler control, and is demonstrated on Stable Diffusion v1.4 and a pixel-space DDPM, with observations of yielding-like transitions, hierarchical absorption, and cross-initial-condition universality.

Significance. If the claimed correspondence to memorized training images can be secured with explicit verification protocols, the work would provide a novel physics-inspired probe of generative landscapes and a practical, gradient-free tool for memorization auditing with implications for privacy and copyright compliance.

major comments (2)

[Abstract] Abstract: the assertion that 'many of these attractors correspond to memorized training images, including stock photographs, brand watermarks, and web-crawl artifacts' is load-bearing for the central claim yet supplies no membership test, exact-match criterion, similarity threshold, or false-positive control (e.g., recovery rate on non-training images).
[Demonstration sections] Demonstration sections (Stable Diffusion v1.4 and pixel-space DDPM experiments): no quantitative verification statistics, error bars, dataset sizes, or membership-inference results are reported to establish that recovered attractors are verifiably training-set images rather than de-novo model outputs.

minor comments (1)

[Abstract] Abstract: the phrase 'yielding-like transition' invokes a physics analogy without a precise operational definition or reference to the disordered-solids literature.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the constructive feedback. We address each major comment below, indicating planned revisions where the manuscript can be strengthened without misrepresenting our results.

read point-by-point responses

Referee: [Abstract] Abstract: the assertion that 'many of these attractors correspond to memorized training images, including stock photographs, brand watermarks, and web-crawl artifacts' is load-bearing for the central claim yet supplies no membership test, exact-match criterion, similarity threshold, or false-positive control (e.g., recovery rate on non-training images).

Authors: The current manuscript identifies memorized images through visual inspection of distinctive, recognizable features (e.g., brand watermarks and web artifacts) that standard sampling rarely produces. We agree this falls short of rigorous verification. We will revise the abstract to qualify the language and add a new subsection with quantitative metrics, including image similarity thresholds and controls on non-training images, for the experiments under our control. revision: yes
Referee: [Demonstration sections] Demonstration sections (Stable Diffusion v1.4 and pixel-space DDPM experiments): no quantitative verification statistics, error bars, dataset sizes, or membership-inference results are reported to establish that recovered attractors are verifiably training-set images rather than de-novo model outputs.

Authors: We acknowledge the lack of quantitative statistics and error bars in the presented demonstrations. For the pixel-space DDPM (trained in-house), we will add dataset sizes, error bars, and basic verification statistics. For Stable Diffusion v1.4, we will explicitly discuss the limitations imposed by proprietary training data while retaining the qualitative observations. revision: partial

standing simulated objections not resolved

Full membership-inference testing for Stable Diffusion v1.4 cannot be performed because its training dataset is not publicly available.

Circularity Check

0 steps flagged

No circularity: empirical protocol with independent observations

full rationale

The paper introduces cyclic denoising as a sampler-level protocol and reports direct empirical observations of attractor stability and correspondence to training data. No derivation chain, equations, or first-principles predictions are claimed. Results follow from applying the described forward-reverse cycling procedure to existing models without reduction to fitted parameters, self-definitions, or load-bearing self-citations. The method is presented as unconditioned and independent of training-data knowledge.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are stated.

pith-pipeline@v0.9.1-grok · 5815 in / 1132 out tokens · 19663 ms · 2026-06-26T08:26:49.629929+00:00 · methodology

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Flow Reasoning Models: Scaling Reasoning Through Iterative Self-Refinement
cs.AI 2026-06 conditional novelty 7.0

Flow models reach 99.2% Sudoku accuracy in 7 passes and 96.1% on out-of-distribution Sudoku-Extreme by selecting dynamically stable candidates and training with self-conditioning plus DPO to avoid failed outputs.

Reference graph

Works this paper leans on

16 extracted references · 5 canonical work pages · cited by 1 Pith paper

[1]

Random organization in periodically driven systems , journal =

Cort. Random organization in periodically driven systems , journal =. 2008 , volume =. doi:10.1038/nphys891 , url =

work page doi:10.1038/nphys891 2008
[2]

Self-Organization and Memory in a Disordered Solid Subject to Random Driving , author =. Phys. Rev. Lett. , volume =. 2025 , month =. doi:10.1103/PhysRevLett.134.178203 , url =

work page doi:10.1103/physrevlett.134.178203 2025
[3]

Active Hamiltonian

Sharma, Rishabh and Karmakar, Smarajit , title =. Nature Physics , year =. doi:10.1038/s41567-024-02724-5 , url =

work page doi:10.1038/s41567-024-02724-5
[4]

2023 , eprint=

Extracting Training Data from Diffusion Models , author=. 2023 , eprint=

2023
[5]

2023 , eprint=

A Reproducible Extraction of Training Images from Diffusion Models , author=. 2023 , eprint=

2023
[6]

A phase transition in diffusion models reveals the hierarchical nature of data.Proceedings of the National Academy of Sciences, 122(1):e2408799121, January 2025

Antonio Sclocchi and Alessandro Favero and Matthieu Wyart , title =. Proceedings of the National Academy of Sciences , volume =. 2025 , doi =. https://www.pnas.org/doi/pdf/10.1073/pnas.2408799121 , abstract =

work page doi:10.1073/pnas.2408799121 2025
[7]

Why Diffusion Models Don

Tony Bonnaire and Rapha. Why Diffusion Models Don. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=
[8]

The Fourteenth International Conference on Learning Representations , year=

Navigating the Latent Space Dynamics of Neural Models , author=. The Fourteenth International Conference on Learning Representations , year=
[9]

2022 , eprint=

Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models , author=. 2022 , eprint=

2022
[10]

2023 , eprint=

Understanding and Mitigating Copying in Diffusion Models , author=. 2023 , eprint=

2023
[11]

2025 , eprint=

SIDE: Surrogate Conditional Data Extraction from Diffusion Models , author=. 2025 , eprint=

2025
[12]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =

Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =. 2022 , pages =

2022
[13]

2020 , eprint=

Denoising Diffusion Probabilistic Models , author=. 2020 , eprint=

2020
[14]

Absorbing state dynamics of stochastic gradient descent , author =. Phys. Rev. E , volume =. 2026 , month =. doi:10.1103/sbv7-syp7 , url =

work page doi:10.1103/sbv7-syp7 2026
[15]

2025 , eprint=

Memorization to Generalization: Emergence of Diffusion Models from Associative Memory , author=. 2025 , eprint=

2025
[16]

Proceedings of Bridges 2015: Mathematics, Music, Art, Architecture, Culture , year =

Werth, Andrew , title =. Proceedings of Bridges 2015: Mathematics, Music, Art, Architecture, Culture , year =

2015

[1] [1]

Random organization in periodically driven systems , journal =

Cort. Random organization in periodically driven systems , journal =. 2008 , volume =. doi:10.1038/nphys891 , url =

work page doi:10.1038/nphys891 2008

[2] [2]

Self-Organization and Memory in a Disordered Solid Subject to Random Driving , author =. Phys. Rev. Lett. , volume =. 2025 , month =. doi:10.1103/PhysRevLett.134.178203 , url =

work page doi:10.1103/physrevlett.134.178203 2025

[3] [3]

Active Hamiltonian

Sharma, Rishabh and Karmakar, Smarajit , title =. Nature Physics , year =. doi:10.1038/s41567-024-02724-5 , url =

work page doi:10.1038/s41567-024-02724-5

[4] [4]

2023 , eprint=

Extracting Training Data from Diffusion Models , author=. 2023 , eprint=

2023

[5] [5]

2023 , eprint=

A Reproducible Extraction of Training Images from Diffusion Models , author=. 2023 , eprint=

2023

[6] [6]

A phase transition in diffusion models reveals the hierarchical nature of data.Proceedings of the National Academy of Sciences, 122(1):e2408799121, January 2025

Antonio Sclocchi and Alessandro Favero and Matthieu Wyart , title =. Proceedings of the National Academy of Sciences , volume =. 2025 , doi =. https://www.pnas.org/doi/pdf/10.1073/pnas.2408799121 , abstract =

work page doi:10.1073/pnas.2408799121 2025

[7] [7]

Why Diffusion Models Don

Tony Bonnaire and Rapha. Why Diffusion Models Don. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

[8] [8]

The Fourteenth International Conference on Learning Representations , year=

Navigating the Latent Space Dynamics of Neural Models , author=. The Fourteenth International Conference on Learning Representations , year=

[9] [9]

2022 , eprint=

Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models , author=. 2022 , eprint=

2022

[10] [10]

2023 , eprint=

Understanding and Mitigating Copying in Diffusion Models , author=. 2023 , eprint=

2023

[11] [11]

2025 , eprint=

SIDE: Surrogate Conditional Data Extraction from Diffusion Models , author=. 2025 , eprint=

2025

[12] [12]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =

Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =. 2022 , pages =

2022

[13] [13]

2020 , eprint=

Denoising Diffusion Probabilistic Models , author=. 2020 , eprint=

2020

[14] [14]

Absorbing state dynamics of stochastic gradient descent , author =. Phys. Rev. E , volume =. 2026 , month =. doi:10.1103/sbv7-syp7 , url =

work page doi:10.1103/sbv7-syp7 2026

[15] [15]

2025 , eprint=

Memorization to Generalization: Emergence of Diffusion Models from Associative Memory , author=. 2025 , eprint=

2025

[16] [16]

Proceedings of Bridges 2015: Mathematics, Music, Art, Architecture, Culture , year =

Werth, Andrew , title =. Proceedings of Bridges 2015: Mathematics, Music, Art, Architecture, Culture , year =

2015