Privacy Risk in Machine Learning: Analyzing the Connection to Overfitting

· 2018 · arXiv 2018.00027

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Detecting Pretraining Data from Large Language Models

cs.CL · 2023-10-25 · conditional · novelty 7.0

Min-K% Prob detects pretraining data in LLMs by flagging outlier low-probability words in text, achieving 7.4% better performance than prior methods on the new WIKIMIA benchmark.

Evaluating Differential Privacy Against Membership Inference in Federated Learning: Insights from the NIST Genomics Red Team Challenge

cs.CR · 2026-04-14 · unverdicted · novelty 5.0

Stacking seven black-box estimators into a meta-classifier reveals persistent membership leakage in differentially private federated learning models at epsilon=200 on NIST genomics data, outperforming single-signal baselines.

Chernoff Information as a Privacy Constraint for Adversarial Classification and Membership Advantage

cs.IT · 2024-03-15 · unverdicted · novelty 5.0

Chernoff DP is sandwiched between KL DP and ε-DP, outperforms KL in numerical Laplace-mechanism tests, and yields a new upper bound on adversary membership advantage compared with (ε,δ)-DP bounds.

Towards the Anonymization of the Language Modeling

cs.CL · 2025-01-05 · unverdicted · novelty 4.0

Authors introduce MLM and CLM specialization methods that avoid memorizing identifiers in sensitive training data while aiming for a privacy-utility tradeoff on medical datasets.

Software Engineering for Self-Adaptive Robotics: A Research Agenda

cs.SE · 2025-05-26 · unverdicted · novelty 3.0

This paper proposes a research agenda for software engineering of self-adaptive robotic systems along lifecycle stages and enabling technologies, identifying challenges and a roadmap to 2030.

citing papers explorer

Showing 5 of 5 citing papers.

Detecting Pretraining Data from Large Language Models cs.CL · 2023-10-25 · conditional · none · ref 66
Min-K% Prob detects pretraining data in LLMs by flagging outlier low-probability words in text, achieving 7.4% better performance than prior methods on the new WIKIMIA benchmark.
Evaluating Differential Privacy Against Membership Inference in Federated Learning: Insights from the NIST Genomics Red Team Challenge cs.CR · 2026-04-14 · unverdicted · none · ref 9
Stacking seven black-box estimators into a meta-classifier reveals persistent membership leakage in differentially private federated learning models at epsilon=200 on NIST genomics data, outperforming single-signal baselines.
Chernoff Information as a Privacy Constraint for Adversarial Classification and Membership Advantage cs.IT · 2024-03-15 · unverdicted · none · ref 25
Chernoff DP is sandwiched between KL DP and ε-DP, outperforms KL in numerical Laplace-mechanism tests, and yields a new upper bound on adversary membership advantage compared with (ε,δ)-DP bounds.
Towards the Anonymization of the Language Modeling cs.CL · 2025-01-05 · unverdicted · none · ref 64
Authors introduce MLM and CLM specialization methods that avoid memorizing identifiers in sensitive training data while aiming for a privacy-utility tradeoff on medical datasets.
Software Engineering for Self-Adaptive Robotics: A Research Agenda cs.SE · 2025-05-26 · unverdicted · none · ref 32
This paper proposes a research agenda for software engineering of self-adaptive robotic systems along lifecycle stages and enabling technologies, identifying challenges and a roadmap to 2030.

Privacy Risk in Machine Learning: Analyzing the Connection to Overfitting

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer