Adversarial perturbations reliably fabricate membership signals in vision-model MIAs, separated by a gradient-norm collapse trajectory that enables robust detection and inference.
On the importance of difficulty calibra- tion in membership inference attacks
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
method 1polarities
use method 1representative citing papers
Finetuning open LMs on ChatGPT outputs creates models that mimic style and fool human raters but fail to close the performance gap to proprietary systems on tasks not well-represented in the imitation data.
Advanced generalization techniques such as augmentation and early stopping can reduce membership inference attack success rates by up to 100 times, confirmed across more than 1,000 models.
citing papers explorer
-
A Unified Perspective on Adversarial Membership Manipulation in Vision Models
Adversarial perturbations reliably fabricate membership signals in vision-model MIAs, separated by a gradient-norm collapse trajectory that enables robust detection and inference.
-
The False Promise of Imitating Proprietary LLMs
Finetuning open LMs on ChatGPT outputs creates models that mimic style and fool human raters but fail to close the performance gap to proprietary systems on tasks not well-represented in the imitation data.
-
Generalization and Membership Inference Attack a Practical Perspective
Advanced generalization techniques such as augmentation and early stopping can reduce membership inference attack success rates by up to 100 times, confirmed across more than 1,000 models.