Empirical Evaluation of Structured Synthetic Data Privacy Metrics: Novel experimental framework

· 2025 · cs.CR · arXiv 2512.16284

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Synthetic data generation is gaining traction as a privacy enhancing technology (PET). When properly generated, synthetic data preserve the analytic utility of real data while avoiding the retention of information that would allow the identification of specific individuals. However, the concept of data privacy remains elusive, making it challenging for practitioners to evaluate and benchmark the degree of privacy protection offered by synthetic data. In this paper, we propose a framework to empirically assess the efficacy of tabular synthetic data privacy quantification methods through controlled, deliberate risk insertion. To demonstrate this framework, we survey existing approaches to synthetic data privacy quantification and the related legal theory. We then apply the framework to the main privacy quantification methods with no-box threat models on publicly available datasets.

representative citing papers

ReMIA: a Powerful and Efficient Alternative to Membership Inference Attacks against Synthetic Data Generators

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

ReMIA offers a practical privacy metric for synthetic data by training two generators and using a classifier to detect source dataset membership, achieving sensitivity comparable to standard MIAs with far less computation.

citing papers explorer

Showing 1 of 1 citing paper.

ReMIA: a Powerful and Efficient Alternative to Membership Inference Attacks against Synthetic Data Generators cs.LG · 2026-05-14 · unverdicted · none · ref 35 · internal anchor
ReMIA offers a practical privacy metric for synthetic data by training two generators and using a classifier to detect source dataset membership, achieving sensitivity comparable to standard MIAs with far less computation.

Empirical Evaluation of Structured Synthetic Data Privacy Metrics: Novel experimental framework

fields

years

verdicts

representative citing papers

citing papers explorer