Privacy Risk in Machine Learning: Analyzing the Connection to Overfitting

Samuel Yeom, Irene Giacomelli, Matt Fredrikson, Somesh Jha · 2018 · arXiv 2018.00027

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

read on arXiv browse 9 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Fair Finetuning Mitigates Distribution Inference Attacks

cs.LG · 2026-06-01 · conditional · novelty 7.0

Fair fine-tuning under Equalized Odds yields a tight bound Adv(A, M_f) ≤ Δ_EO · W on adversarial advantage in distribution inference attacks, with empirical reductions below detection threshold across six datasets.

MRMMIA: Membership Inference Attacks on Memory in Chat Agents

cs.CR · 2026-05-27 · unverdicted · novelty 7.0

MRMMIA is a multi-recall-probe membership inference attack that extracts signals from chat agent memory and outperforms baselines in black-, gray-, and white-box settings.

Measuring the Depth of LLM Unlearning via Activation Patching

cs.CL · 2026-05-23 · unverdicted · novelty 7.0

Introduces Unlearning Depth Score (UDS) via activation patching to quantify LLM unlearning depth and claims it outperforms 20 other metrics in faithfulness and robustness on 150 models.

PACZero: PAC-Private Fine-Tuning of Language Models via Sign Quantization

cs.LG · 2026-05-07 · unverdicted · novelty 7.0

PACZero achieves zero mutual information privacy in LLM fine-tuning via sign-quantized subset-aggregated ZO gradients, delivering near non-private accuracy on SST-2 at I=0.

Detecting Pretraining Data from Large Language Models

cs.CL · 2023-10-25 · conditional · novelty 7.0

Min-K% Prob detects pretraining data in LLMs by flagging outlier low-probability words in text, achieving 7.4% better performance than prior methods on the new WIKIMIA benchmark.

Evaluating Differential Privacy Against Membership Inference in Federated Learning: Insights from the NIST Genomics Red Team Challenge

cs.CR · 2026-04-14 · unverdicted · novelty 5.0

Stacking seven black-box estimators into a meta-classifier reveals persistent membership leakage in differentially private federated learning models at epsilon=200 on NIST genomics data, outperforming single-signal baselines.

Chernoff Information as a Privacy Constraint for Adversarial Classification and Membership Advantage

cs.IT · 2024-03-15 · unverdicted · novelty 5.0

Chernoff DP is sandwiched between KL DP and ε-DP, outperforms KL in numerical Laplace-mechanism tests, and yields a new upper bound on adversary membership advantage compared with (ε,δ)-DP bounds.

Towards the Anonymization of the Language Modeling

cs.CL · 2025-01-05 · unverdicted · novelty 4.0

Authors introduce MLM and CLM specialization methods that avoid memorizing identifiers in sensitive training data while aiming for a privacy-utility tradeoff on medical datasets.

Software Engineering for Self-Adaptive Robotics: A Research Agenda

cs.SE · 2025-05-26 · unverdicted · novelty 3.0

This paper proposes a research agenda for software engineering of self-adaptive robotic systems along lifecycle stages and enabling technologies, identifying challenges and a roadmap to 2030.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Fair Finetuning Mitigates Distribution Inference Attacks cs.LG · 2026-06-01 · conditional · none · ref 3
Fair fine-tuning under Equalized Odds yields a tight bound Adv(A, M_f) ≤ Δ_EO · W on adversarial advantage in distribution inference attacks, with empirical reductions below detection threshold across six datasets.
Detecting Pretraining Data from Large Language Models cs.CL · 2023-10-25 · conditional · none · ref 66
Min-K% Prob detects pretraining data in LLMs by flagging outlier low-probability words in text, achieving 7.4% better performance than prior methods on the new WIKIMIA benchmark.

Privacy Risk in Machine Learning: Analyzing the Connection to Overfitting

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer