MGI: Member vs Generated Inference

Adam Dziedzic; Bihe Zhao; Franziska Boenisch; Juangui Xu; Michel Meintz

arxiv: 2606.23872 · v1 · pith:NP7IBOARnew · submitted 2026-06-22 · 💻 cs.LG · cs.AI

MGI: Member vs Generated Inference

Bihe Zhao , Michel Meintz , Juangui Xu , Franziska Boenisch , Adam Dziedzic This is my paper

Pith reviewed 2026-06-26 08:44 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords membership inferencegenerative modelsdata provenancediffusion modelsautoregressive modelsmodel derivativesimage generation

0 comments

The pith

A three-stage method combining autoencoder and latent generator signals distinguishes training data members from a model's own generated outputs.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper formalizes Member vs Generated Inference as the task of determining, given a sample and a generative model, whether the sample came from the model's training set or was produced by the model itself. Existing membership inference attacks and attribution methods both fail at this task because they depend on likelihood signals that are high for both true training examples and the model's reproductions. The authors introduce Data Circuit Breaker, which extracts and fuses complementary signals from the model's autoencoder reconstruction and its latent generator to perform the distinction. Experiments across autoregressive and diffusion models show the method succeeds where prior approaches do not, including cases of near-duplicate reproduction and when new models are trained on the outputs of the original model.

Core claim

DCB is a three-stage procedure that uses reconstruction error and latent-space statistics from a generative model's autoencoder together with statistics from its latent generator to classify a sample as either a training member or a model-generated output; this combination remains reliable even when the model has memorized and can reproduce near-duplicates of its training data and continues to work in derivative-model settings where subsequent models are trained on the first model's outputs.

What carries the argument

Data Circuit Breaker (DCB): a three-stage pipeline that fuses reconstruction signals from the autoencoder with latent-generator signals to separate members from generated samples.

If this is right

DCB succeeds on both autoregressive and diffusion image models where membership inference and attribution baselines fail.
The method continues to separate members from generated samples even when the model reproduces near-duplicates of training data.
DCB generalizes to model-derivative settings in which new models are trained on the outputs of an earlier generative model.
The same three-stage signal combination addresses the systematic misclassification errors of likelihood-based methods.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If DCB works across model families, similar signal-fusion approaches could be explored for non-image generative tasks such as text or audio.
The distinction between member and generated data may affect how downstream systems decide whether to include model outputs in future training sets.
Practical deployment would require checking whether the three stages remain stable when the target model is fine-tuned or quantized after initial training.

Load-bearing premise

That signals from the autoencoder and latent generator remain sufficiently complementary and stable to separate members from generated samples across different model architectures and training regimes.

What would settle it

A controlled test set in which generated samples that are near-duplicates of training images are systematically labeled as members by DCB while true members are labeled as generated would falsify the method's claimed reliability.

Figures

Figures reproduced from arXiv: 2606.23872 by Adam Dziedzic, Bihe Zhao, Franziska Boenisch, Juangui Xu, Michel Meintz.

**Figure 1.** Figure 1: Overview of the new Member vs Generated Inference (MGI) task. The core challenge is separating genuine training membership from model generation, even across chains of models trained on generated data. Let N = NM ∪ NN denote a natural dataset, where NM ∩ NN = ∅. A generative model M1 is trained on the member set NM, while NN is held out as natural non-member data. After training, M1 produces a generated da… view at source ↗

**Figure 2.** Figure 2: Distributions of scores for membership inference attack and image attribution on IARs. In all cases, the differentiation between training (Train) and generated (Belonging) images is more difficult than between training (Train) and validation (Val) images. This indicates more difficult cases of the MGI (Member vs Generated Inference) than MI task. The evaluated model is VAR [24]. 0.30 0.25 0.20 0.15 0.10 0.… view at source ↗

**Figure 3.** Figure 3: Distributions of scores for memorized training samples vs re-generated cases. State-of-the-art membership inference (a) and attribution (b) methods fail to distinguish memorized training samples (Memorized Training) from their verbatim generated counterparts (Verbatim Re-generated), whereas our approach (c) clearly separates the two. The evaluated model is RAR-XXL [31]. IGM tends to be more self-consistent… view at source ↗

read the original abstract

As generative models increasingly produce samples that are indistinguishable from human-created content, it becomes difficult to determine whether a given data point was part of a model's natural training set or was generated by the model itself, especially when models memorize and reproduce training data. We formalize this challenge as Member vs Generated Inference (MGI): given a sample and a target generative model, infer whether the sample is a true training member or a generated output of that model. Focusing on image generation, we show that existing membership inference methods systematically misclassify generated samples as training members, while attribution-based methods often misclassify true members as generated. This failure arises because both approaches rely on likelihood-related signals that are similarly elevated for training examples and for the model's own outputs. To address MGI, we propose Data Circuit Breaker (DCB), a three-stage method that combines complementary signals from a generative model's autoencoder and latent generator to distinguish training members from generated samples. Across multiple generative models, including image autoregressive and diffusion models, DCB consistently addresses the shortcomings of membership inference and attribution methods, remains effective even when models reproduce near-duplicates of training samples, and generalizes to challenging model derivative settings in which new models are trained on generated data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper names a practical gap where membership inference and attribution both fail on generated samples but offers no data to show its DCB fix works.

read the letter

The main point is that the authors formalize Member vs Generated Inference as a distinct task from standard membership inference. They note that models which memorize training data produce outputs that existing likelihood-based methods treat the same as real members, and they propose using internal signals from the autoencoder and latent generator to separate the two cases.

What is new is the explicit framing of MGI and the observation that both membership inference and attribution methods run into the same signal problem. That distinction is useful for anyone thinking about auditing generative models or data provenance. The three-stage DCB approach is a reasonable attempt to combine complementary model-internal information rather than relying on one type of signal.

The paper does a decent job explaining the failure modes of prior methods on near-duplicates and on models trained on generated data. Those are realistic settings.

The soft spot is the total lack of evidence. The abstract claims consistent performance across autoregressive and diffusion models, effectiveness on near-duplicates, and generalization to derivative models, yet supplies no numbers, baselines, ablations, or error rates. Without those details it is impossible to judge whether the three-stage combination actually delivers or whether hidden assumptions break down.

This is for people working on ML privacy and generative model auditing who want to think about new attack surfaces. A reader focused on problem definitions might get value from the framing, but anyone needing validated methods should wait for the experiments.

I would send it to peer review if the full manuscript contains proper quantitative evaluation; the task definition is timely enough to justify referee time even if the current version is preliminary.

Referee Report

2 major / 0 minor

Summary. The paper formalizes Member vs Generated Inference (MGI) as the task of determining, given a sample and a generative model, whether the sample is a true training member or a model-generated output. It argues that membership inference methods misclassify generated samples as members and attribution methods misclassify members as generated, due to both relying on similarly elevated likelihood signals. To solve this, it introduces Data Circuit Breaker (DCB), a three-stage method that combines signals from a generative model's autoencoder and latent generator. The abstract claims DCB works consistently across image autoregressive and diffusion models, remains effective with near-duplicates of training samples, and generalizes to model-derivative settings where new models are trained on generated data.

Significance. If the empirical claims hold under rigorous testing, the work could contribute to data-provenance tools for generative models by providing a method that avoids the failure modes of standard membership and attribution approaches. The framing of MGI as a distinct problem and the use of internal model components (autoencoder + latent generator) for complementary signals are potentially useful, but the absence of any quantitative results, baselines, error bars, or methodology details in the provided text prevents assessment of whether the result is novel or practically impactful.

major comments (2)

[Abstract] Abstract: the central claim that 'DCB consistently addresses the shortcomings of membership inference and attribution methods' and 'remains effective even when models reproduce near-duplicates' is asserted without any reported quantitative results, baselines, ablation studies, or experimental methodology, making the soundness of the contribution impossible to evaluate from the manuscript text.
[Abstract] Abstract: the description of DCB as 'a three-stage method that combines complementary signals from a generative model's autoencoder and latent generator' provides no equations, pseudocode, or formal definition of the stages or combination rule, so it is not possible to verify whether the approach avoids the likelihood-signal circularity identified as the root cause of prior method failures.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their detailed review and for highlighting issues with the abstract's self-contained presentation. We agree that the abstract must better support its claims and will revise it (and cross-references) to improve evaluability while preserving its brevity. Below we respond point-by-point to the major comments.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that 'DCB consistently addresses the shortcomings of membership inference and attribution methods' and 'remains effective even when models reproduce near-duplicates' is asserted without any reported quantitative results, baselines, ablation studies, or experimental methodology, making the soundness of the contribution impossible to evaluate from the manuscript text.

Authors: The abstract is a concise summary; the full manuscript reports quantitative results, baselines (membership inference and attribution methods), ablations, error bars, and methodology details in Section 4 (Experiments) and the appendix. We will revise the abstract to include one or two key performance numbers (e.g., accuracy/F1 on autoregressive and diffusion models) and explicit pointers to Section 4, making the central claims directly evaluable from the abstract itself. revision: yes
Referee: [Abstract] Abstract: the description of DCB as 'a three-stage method that combines complementary signals from a generative model's autoencoder and latent generator' provides no equations, pseudocode, or formal definition of the stages or combination rule, so it is not possible to verify whether the approach avoids the likelihood-signal circularity identified as the root cause of prior method failures.

Authors: We agree the abstract description is high-level. Section 3 of the full manuscript defines the three stages formally, supplies the combination rule (a calibrated product of reconstruction error from the autoencoder and a latent-space consistency score from the generator), and includes pseudocode (Algorithm 1) together with an explicit argument why the two signals break the shared likelihood circularity. We will expand the abstract by one sentence that names the two signals and the combination rule, and we will add a parenthetical reference to Section 3. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The abstract and provided context frame DCB as a new three-stage combination of independent autoencoder and latent-generator signals to solve MGI. No equations, self-citations, or fitted parameters are shown that reduce the claimed distinction to a definition or input by construction. The derivation chain appears self-contained against external benchmarks with no load-bearing self-referential steps.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 2 invented entities

Ledger is sparse because only the abstract is available; no explicit free parameters, detailed axioms, or invented entities beyond the named task and method are described.

axioms (1)

domain assumption Likelihood-related signals are similarly elevated for both training members and model-generated outputs
Invoked in the abstract to explain why existing methods fail.

invented entities (2)

Member vs Generated Inference (MGI) no independent evidence
purpose: Formalized task of inferring whether a sample is a training member or generated output
Newly defined challenge in the abstract.
Data Circuit Breaker (DCB) no independent evidence
purpose: Three-stage method combining autoencoder and latent generator signals to solve MGI
Newly proposed method in the abstract.

pith-pipeline@v0.9.1-grok · 5754 in / 1322 out tokens · 35741 ms · 2026-06-26T08:44:30.511406+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

37 extracted references · 3 linked inside Pith

[1]

Self-consuming gener- ative models go MAD

Sina Alemohammad, Josue Casco-Rodriguez, Lorenzo Luzi, Ahmed Imtiaz Humayun, Hossein Babaei, Daniel LeJeune, Ali Siahkoohi, and Richard Baraniuk. Self-consuming gener- ative models go MAD. InThe Twelfth International Confer- ence on Learning Representations, 2024. 3

2024
[2]

Membership inference attacks from first principles

Nicholas Carlini, Steve Chien, Milad Nasr, Shuang Song, Andreas Terzis, and Florian Tram`er. Membership inference attacks from first principles. In2022 IEEE Symposium on Security and Privacy (SP), pages 1897–1914, 2022. 3, 8

1914
[3]

Extracting training data from diffusion models

Nicolas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramer, Borja Balle, Daphne Ippolito, and Eric Wallace. Extracting training data from diffusion models. In32nd USENIX Security Symposium (USENIX Security 23), pages 5253–5270, 2023. 1, 3

2023
[4]

Prada: Probability-ratio-based attribution and detec- tion of autoregressive-generated images.arXiv preprint arXiv:2511.20068, 2025

Simon Damm, Jonas Ricker, Henning Petzka, and Asja Fis- cher. Prada: Probability-ratio-based attribution and detec- tion of autoregressive-generated images.arXiv preprint arXiv:2511.20068, 2025. 2, 3, 4, 5, 7, 11, 12

Pith/arXiv arXiv 2025
[5]

Imagenet: A large-scale hierarchical image database

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. In2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009. 7

2009
[6]

CDI: Copyrighted Data Identification in Diffusion Models

Jan Dubi´nski, Antoni Kowalczuk, Franziska Boenisch, and Adam Dziedzic. CDI: Copyrighted Data Identification in Diffusion Models. InThe IEEE CVF Computer Vision and Pattern Recognition Conference (CVPR), 2025. 7

2025
[7]

Does learning require memorization? a short tale about a long tail

Vitaly Feldman. Does learning require memorization? a short tale about a long tail. InProceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, pages 954–959, 2020. 3

2020
[8]

Classifier-free diffusion guidance.arXiv preprint arXiv:2207.12598, 2022

Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance.arXiv preprint arXiv:2207.12598, 2022. 4

Pith/arXiv arXiv 2022
[9]

Denoising Diffu- sion Probabilistic Models

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising Diffu- sion Probabilistic Models. InConference on Neural Informa- tion Processing Systems (NeurIPS), pages 6840–6851, 2020. 2

2020
[10]

Watermarking autoregressive image generation.Advances in Neural Information Process- ing Systems, 38:71801–71848, 2026

Nikola Jovanovi ´c, Ismail Labiad, Tom ´aˇs Sou ˇcek, Martin Vechev, and Pierre Fernandez. Watermarking autoregressive image generation.Advances in Neural Information Process- ing Systems, 38:71801–71848, 2026. 16 9

2026
[11]

Privacy attacks on image autoregressive mod- els

Antoni Kowalczuk, Jan Dubi´nski, Franziska Boenisch, and Adam Dziedzic. Privacy attacks on image autoregressive mod- els. InForty-Second International Conference on Machine Learning (ICML), 2025. 1, 2, 3, 4, 5, 7, 11, 12

2025
[12]

Benchmarking empirical privacy protection for adaptations of large language models

Bartłomiej Marek, Lorenzo Rossi, Vincent Hanke, Xun Wang, Michael Backes, Franziska Boenisch, and Adam Dziedzic. Benchmarking empirical privacy protection for adaptations of large language models. InThe Fourteenth International Conference on Learning Representations, 2026. 3

2026
[13]

A self-supervised descriptor for image copy detection

Ed Pizzi, Sreya Dutta Roy, Sugosh Nagavara Ravindra, Priya Goyal, and Matthijs Douze. A self-supervised descriptor for image copy detection. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14532–14542, 2022. 4

2022
[14]

Learning transferable visual models from natural language supervision

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. Learning transferable visual models from natural language supervision. InInternational Conference on Machine Learning (ICML), pages 8748–8763,
[15]

High-resolution image synthesis with latent diffusion models

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Bj ¨orn Ommer. High-resolution image synthesis with latent diffusion models. InIEEE/CVF Con- ference on Computer Vision and Pattern Recognition, 2022. 7

2022
[16]

U- net: Convolutional networks for biomedical image segmenta- tion

Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U- net: Convolutional networks for biomedical image segmenta- tion. InMedical Image Computing and Computer-Assisted Intervention – MICCAI 2015, pages 234–241, Cham, 2015. Springer International Publishing. 7

2015
[17]

Natural identifiers for privacy and data audits in large language models

Lorenzo Rossi, Bartłomiej Marek, Franziska Boenisch, and Adam Dziedzic. Natural identifiers for privacy and data audits in large language models. InThe Fourteenth International Conference on Learning Representations, 2026. 3

2026
[18]

Ml-leaks: Model and data independent membership inference attacks and de- fenses on machine learning models

Ahmed Salem, Yang Zhang, Mathias Humbert, Pascal Berrang, Mario Fritz, and Michael Backes. Ml-leaks: Model and data independent membership inference attacks and de- fenses on machine learning models. InNetwork and Dis- tributed System Security Symposium (NDSS), 2019. 3

2019
[19]

Membership inference attacks against machine learning models

Reza Shokri, Marco Stronati, Congzheng Song, and Vitaly Shmatikov. Membership inference attacks against machine learning models. In2017 IEEE symposium on security and privacy (SP), pages 3–18. IEEE, 2017. 3

2017
[20]

Ai models collapse when trained on recursively generated data.Nature, 631 (8022):755–759, 2024

Ilia Shumailov, Zakhar Shumaylov, Yiren Zhao, Nicolas Pa- pernot, Ross Anderson, and Yarin Gal. Ai models collapse when trained on recursively generated data.Nature, 631 (8022):755–759, 2024. 3

2024
[21]

Improved Techniques for Training Score-Based Generative Models

Yang Song and Stefano Ermon. Improved Techniques for Training Score-Based Generative Models. InConference on Neural Information Processing Systems (NeurIPS), pages 12438–12448, 2020. 2

2020
[22]

Autoregressive model beats diffusion: Llama for scalable image generation.arXiv preprint arXiv:2406.06525, 2024

Peize Sun, Yi Jiang, Shoufa Chen, Shilong Zhang, Bingyue Peng, Ping Luo, and Zehuan Yuan. Autoregressive model beats diffusion: Llama for scalable image generation.arXiv preprint arXiv:2406.06525, 2024. 3, 7

Pith/arXiv arXiv 2024
[23]

Visual autoregressive modeling: Scalable image generation via next-scale prediction.Advances in neural information processing systems, 37:84839–84865, 2024

Keyu Tian, Yi Jiang, Zehuan Yuan, Bingyue Peng, and Li- wei Wang. Visual autoregressive modeling: Scalable image generation via next-scale prediction.Advances in neural information processing systems, 37:84839–84865, 2024. 3

2024
[24]

Visual autoregressive modeling: Scalable image generation via next-scale prediction, 2024

Keyu Tian, Yi Jiang, Zehuan Yuan, Bingyue Peng, and Li- wei Wang. Visual autoregressive modeling: Scalable image generation via next-scale prediction, 2024. 5, 7, 12, 13, 14

2024
[25]

Gomez, Lukasz Kaiser, and Illia Polosukhin

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkor- eit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. InConference on Neural Information Processing Systems (NeurIPS), pages 5998–6008, 2017. 2

2017
[26]

Aedr: Training-free ai-generated image attri- bution via autoencoder double-reconstruction.arXiv preprint arXiv:2507.18988, 2025

Chao Wang, Kejiang Chen, Zijin Yang, Yaofei Wang, and Weiming Zhang. Aedr: Training-free ai-generated image attri- bution via autoencoder double-reconstruction.arXiv preprint arXiv:2507.18988, 2025. 5

arXiv 2025
[27]

Memorization in self-supervised learning improves down- stream generalization

Wenhao Wang, Muhammad Ahmad Kaleem, Adam Dziedzic, Michael Backes, Nicolas Papernot, and Franziska Boenisch. Memorization in self-supervised learning improves down- stream generalization. InThe Twelfth International Confer- ence on Learning Representations (ICLR), 2024. 3

2024
[28]

Reconstruction vs

Jingfeng Yao, Bin Yang, and Xinggang Wang. Reconstruction vs. generation: Taming optimization dilemma in latent diffu- sion models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025. 15

2025
[29]

Icas: Detecting training data from autoregressive image gen- erative models

Hongyao Yu, Yixiang Qiu, Yiheng Yang, Hao Fang, Tianqu Zhuang, Jiaxin Hong, Bin Chen, Hao Wu, and Shu-Tao Xia. Icas: Detecting training data from autoregressive image gen- erative models. InProceedings of the 33rd ACM International Conference on Multimedia, pages 11209–11217, 2025. 2, 3, 4, 5, 7, 8, 13

2025
[30]

Randomized autoregressive visual generation,

Qihang Yu, Ju He, Xueqing Deng, Xiaohui Shen, and Liang- Chieh Chen. Randomized autoregressive visual generation,
[31]

Randomized autoregressive visual generation

Qihang Yu, Ju He, Xueqing Deng, Xiaohui Shen, and Liang- Chieh Chen. Randomized autoregressive visual generation. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 18431–18441, 2025. 3, 5, 12, 13, 14

2025
[32]

Representa- tion alignment for generation: Training diffusion transform- ers is easier than you think

Sihyun Yu, Sangkyung Kwak, Huiwon Jang, Jongheon Jeong, Jonathan Huang, Jinwoo Shin, and Saining Xie. Representa- tion alignment for generation: Training diffusion transform- ers is easier than you think. InInternational Conference on Learning Representations, 2025. 15

2025
[33]

Low-cost high-power membership inference attacks

Sajjad Zarifzadeh, Philippe Liu, and Reza Shokri. Low-cost high-power membership inference attacks. InForty-first In- ternational Conference on Machine Learning, 2024. 3, 8

2024
[34]

Mem- bership inference on text-to-image diffusion models via con- ditional likelihood discrepancy

Shengfang Zhai, Huanran Chen, Yinpeng Dong, Jiajun Li, Qingni Shen, Yansong Gao, Hang Su, and Yang Liu. Mem- bership inference on text-to-image diffusion models via con- ditional likelihood discrepancy. InThe Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024. 2, 3, 4, 7, 11, 13

2024
[35]

Data provenance for image auto-regressive generation

Bihe Zhao, Louis Kerner, Michel Meintz, Tameem Bakr, Franziska Boenisch, and Adam Dziedzic. Data provenance for image auto-regressive generation. InThe Fourteenth In- ternational Conference on Learning Representations, 2026. 3, 6

2026
[36]

Data provenance 10 for image auto-regressive generation

Bihe Zhao, Louis Kerner, Michel Meintz, Tameem Bakr, Franziska Boenisch, and Adam Dziedzic. Data provenance 10 for image auto-regressive generation. InThe Fourteenth In- ternational Conference on Learning Representations (ICLR),
[37]

Belonging

16 A. Further Implementation Details A.1. Data Pre-processing We follow the augmentations in the original training recipes of each model to ensure that our evaluation faithfully reflects the conditions under which membership and generation sig- nals arise. For V AR and LlamaGen, when computing the MIA scores on M1, we apply the same data augmentations use...

arXiv

[1] [1]

Self-consuming gener- ative models go MAD

Sina Alemohammad, Josue Casco-Rodriguez, Lorenzo Luzi, Ahmed Imtiaz Humayun, Hossein Babaei, Daniel LeJeune, Ali Siahkoohi, and Richard Baraniuk. Self-consuming gener- ative models go MAD. InThe Twelfth International Confer- ence on Learning Representations, 2024. 3

2024

[2] [2]

Membership inference attacks from first principles

Nicholas Carlini, Steve Chien, Milad Nasr, Shuang Song, Andreas Terzis, and Florian Tram`er. Membership inference attacks from first principles. In2022 IEEE Symposium on Security and Privacy (SP), pages 1897–1914, 2022. 3, 8

1914

[3] [3]

Extracting training data from diffusion models

Nicolas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramer, Borja Balle, Daphne Ippolito, and Eric Wallace. Extracting training data from diffusion models. In32nd USENIX Security Symposium (USENIX Security 23), pages 5253–5270, 2023. 1, 3

2023

[4] [4]

Prada: Probability-ratio-based attribution and detec- tion of autoregressive-generated images.arXiv preprint arXiv:2511.20068, 2025

Simon Damm, Jonas Ricker, Henning Petzka, and Asja Fis- cher. Prada: Probability-ratio-based attribution and detec- tion of autoregressive-generated images.arXiv preprint arXiv:2511.20068, 2025. 2, 3, 4, 5, 7, 11, 12

Pith/arXiv arXiv 2025

[5] [5]

Imagenet: A large-scale hierarchical image database

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. In2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009. 7

2009

[6] [6]

CDI: Copyrighted Data Identification in Diffusion Models

Jan Dubi´nski, Antoni Kowalczuk, Franziska Boenisch, and Adam Dziedzic. CDI: Copyrighted Data Identification in Diffusion Models. InThe IEEE CVF Computer Vision and Pattern Recognition Conference (CVPR), 2025. 7

2025

[7] [7]

Does learning require memorization? a short tale about a long tail

Vitaly Feldman. Does learning require memorization? a short tale about a long tail. InProceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, pages 954–959, 2020. 3

2020

[8] [8]

Classifier-free diffusion guidance.arXiv preprint arXiv:2207.12598, 2022

Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance.arXiv preprint arXiv:2207.12598, 2022. 4

Pith/arXiv arXiv 2022

[9] [9]

Denoising Diffu- sion Probabilistic Models

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising Diffu- sion Probabilistic Models. InConference on Neural Informa- tion Processing Systems (NeurIPS), pages 6840–6851, 2020. 2

2020

[10] [10]

Watermarking autoregressive image generation.Advances in Neural Information Process- ing Systems, 38:71801–71848, 2026

Nikola Jovanovi ´c, Ismail Labiad, Tom ´aˇs Sou ˇcek, Martin Vechev, and Pierre Fernandez. Watermarking autoregressive image generation.Advances in Neural Information Process- ing Systems, 38:71801–71848, 2026. 16 9

2026

[11] [11]

Privacy attacks on image autoregressive mod- els

Antoni Kowalczuk, Jan Dubi´nski, Franziska Boenisch, and Adam Dziedzic. Privacy attacks on image autoregressive mod- els. InForty-Second International Conference on Machine Learning (ICML), 2025. 1, 2, 3, 4, 5, 7, 11, 12

2025

[12] [12]

Benchmarking empirical privacy protection for adaptations of large language models

Bartłomiej Marek, Lorenzo Rossi, Vincent Hanke, Xun Wang, Michael Backes, Franziska Boenisch, and Adam Dziedzic. Benchmarking empirical privacy protection for adaptations of large language models. InThe Fourteenth International Conference on Learning Representations, 2026. 3

2026

[13] [13]

A self-supervised descriptor for image copy detection

Ed Pizzi, Sreya Dutta Roy, Sugosh Nagavara Ravindra, Priya Goyal, and Matthijs Douze. A self-supervised descriptor for image copy detection. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14532–14542, 2022. 4

2022

[14] [14]

Learning transferable visual models from natural language supervision

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. Learning transferable visual models from natural language supervision. InInternational Conference on Machine Learning (ICML), pages 8748–8763,

[15] [15]

High-resolution image synthesis with latent diffusion models

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Bj ¨orn Ommer. High-resolution image synthesis with latent diffusion models. InIEEE/CVF Con- ference on Computer Vision and Pattern Recognition, 2022. 7

2022

[16] [16]

U- net: Convolutional networks for biomedical image segmenta- tion

Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U- net: Convolutional networks for biomedical image segmenta- tion. InMedical Image Computing and Computer-Assisted Intervention – MICCAI 2015, pages 234–241, Cham, 2015. Springer International Publishing. 7

2015

[17] [17]

Natural identifiers for privacy and data audits in large language models

Lorenzo Rossi, Bartłomiej Marek, Franziska Boenisch, and Adam Dziedzic. Natural identifiers for privacy and data audits in large language models. InThe Fourteenth International Conference on Learning Representations, 2026. 3

2026

[18] [18]

Ml-leaks: Model and data independent membership inference attacks and de- fenses on machine learning models

Ahmed Salem, Yang Zhang, Mathias Humbert, Pascal Berrang, Mario Fritz, and Michael Backes. Ml-leaks: Model and data independent membership inference attacks and de- fenses on machine learning models. InNetwork and Dis- tributed System Security Symposium (NDSS), 2019. 3

2019

[19] [19]

Membership inference attacks against machine learning models

Reza Shokri, Marco Stronati, Congzheng Song, and Vitaly Shmatikov. Membership inference attacks against machine learning models. In2017 IEEE symposium on security and privacy (SP), pages 3–18. IEEE, 2017. 3

2017

[20] [20]

Ai models collapse when trained on recursively generated data.Nature, 631 (8022):755–759, 2024

Ilia Shumailov, Zakhar Shumaylov, Yiren Zhao, Nicolas Pa- pernot, Ross Anderson, and Yarin Gal. Ai models collapse when trained on recursively generated data.Nature, 631 (8022):755–759, 2024. 3

2024

[21] [21]

Improved Techniques for Training Score-Based Generative Models

Yang Song and Stefano Ermon. Improved Techniques for Training Score-Based Generative Models. InConference on Neural Information Processing Systems (NeurIPS), pages 12438–12448, 2020. 2

2020

[22] [22]

Autoregressive model beats diffusion: Llama for scalable image generation.arXiv preprint arXiv:2406.06525, 2024

Peize Sun, Yi Jiang, Shoufa Chen, Shilong Zhang, Bingyue Peng, Ping Luo, and Zehuan Yuan. Autoregressive model beats diffusion: Llama for scalable image generation.arXiv preprint arXiv:2406.06525, 2024. 3, 7

Pith/arXiv arXiv 2024

[23] [23]

Visual autoregressive modeling: Scalable image generation via next-scale prediction.Advances in neural information processing systems, 37:84839–84865, 2024

Keyu Tian, Yi Jiang, Zehuan Yuan, Bingyue Peng, and Li- wei Wang. Visual autoregressive modeling: Scalable image generation via next-scale prediction.Advances in neural information processing systems, 37:84839–84865, 2024. 3

2024

[24] [24]

Visual autoregressive modeling: Scalable image generation via next-scale prediction, 2024

Keyu Tian, Yi Jiang, Zehuan Yuan, Bingyue Peng, and Li- wei Wang. Visual autoregressive modeling: Scalable image generation via next-scale prediction, 2024. 5, 7, 12, 13, 14

2024

[25] [25]

Gomez, Lukasz Kaiser, and Illia Polosukhin

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkor- eit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. InConference on Neural Information Processing Systems (NeurIPS), pages 5998–6008, 2017. 2

2017

[26] [26]

Aedr: Training-free ai-generated image attri- bution via autoencoder double-reconstruction.arXiv preprint arXiv:2507.18988, 2025

Chao Wang, Kejiang Chen, Zijin Yang, Yaofei Wang, and Weiming Zhang. Aedr: Training-free ai-generated image attri- bution via autoencoder double-reconstruction.arXiv preprint arXiv:2507.18988, 2025. 5

arXiv 2025

[27] [27]

Memorization in self-supervised learning improves down- stream generalization

Wenhao Wang, Muhammad Ahmad Kaleem, Adam Dziedzic, Michael Backes, Nicolas Papernot, and Franziska Boenisch. Memorization in self-supervised learning improves down- stream generalization. InThe Twelfth International Confer- ence on Learning Representations (ICLR), 2024. 3

2024

[28] [28]

Reconstruction vs

Jingfeng Yao, Bin Yang, and Xinggang Wang. Reconstruction vs. generation: Taming optimization dilemma in latent diffu- sion models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025. 15

2025

[29] [29]

Icas: Detecting training data from autoregressive image gen- erative models

Hongyao Yu, Yixiang Qiu, Yiheng Yang, Hao Fang, Tianqu Zhuang, Jiaxin Hong, Bin Chen, Hao Wu, and Shu-Tao Xia. Icas: Detecting training data from autoregressive image gen- erative models. InProceedings of the 33rd ACM International Conference on Multimedia, pages 11209–11217, 2025. 2, 3, 4, 5, 7, 8, 13

2025

[30] [30]

Randomized autoregressive visual generation,

Qihang Yu, Ju He, Xueqing Deng, Xiaohui Shen, and Liang- Chieh Chen. Randomized autoregressive visual generation,

[31] [31]

Randomized autoregressive visual generation

Qihang Yu, Ju He, Xueqing Deng, Xiaohui Shen, and Liang- Chieh Chen. Randomized autoregressive visual generation. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 18431–18441, 2025. 3, 5, 12, 13, 14

2025

[32] [32]

Representa- tion alignment for generation: Training diffusion transform- ers is easier than you think

Sihyun Yu, Sangkyung Kwak, Huiwon Jang, Jongheon Jeong, Jonathan Huang, Jinwoo Shin, and Saining Xie. Representa- tion alignment for generation: Training diffusion transform- ers is easier than you think. InInternational Conference on Learning Representations, 2025. 15

2025

[33] [33]

Low-cost high-power membership inference attacks

Sajjad Zarifzadeh, Philippe Liu, and Reza Shokri. Low-cost high-power membership inference attacks. InForty-first In- ternational Conference on Machine Learning, 2024. 3, 8

2024

[34] [34]

Mem- bership inference on text-to-image diffusion models via con- ditional likelihood discrepancy

Shengfang Zhai, Huanran Chen, Yinpeng Dong, Jiajun Li, Qingni Shen, Yansong Gao, Hang Su, and Yang Liu. Mem- bership inference on text-to-image diffusion models via con- ditional likelihood discrepancy. InThe Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024. 2, 3, 4, 7, 11, 13

2024

[35] [35]

Data provenance for image auto-regressive generation

Bihe Zhao, Louis Kerner, Michel Meintz, Tameem Bakr, Franziska Boenisch, and Adam Dziedzic. Data provenance for image auto-regressive generation. InThe Fourteenth In- ternational Conference on Learning Representations, 2026. 3, 6

2026

[36] [36]

Data provenance 10 for image auto-regressive generation

Bihe Zhao, Louis Kerner, Michel Meintz, Tameem Bakr, Franziska Boenisch, and Adam Dziedzic. Data provenance 10 for image auto-regressive generation. InThe Fourteenth In- ternational Conference on Learning Representations (ICLR),

[37] [37]

Belonging

16 A. Further Implementation Details A.1. Data Pre-processing We follow the augmentations in the original training recipes of each model to ensure that our evaluation faithfully reflects the conditions under which membership and generation sig- nals arise. For V AR and LlamaGen, when computing the MIA scores on M1, we apply the same data augmentations use...

arXiv