Learning Differentially Private Recurrent Language Models

Daniel Ramage; H. Brendan McMahan; Kunal Talwar; Li Zhang

arxiv: 1710.06963 · v3 · pith:6KBFRPFVnew · submitted 2017-10-18 · 💻 cs.LG

Learning Differentially Private Recurrent Language Models

H. Brendan McMahan , Daniel Ramage , Kunal Talwar , Li Zhang This is my paper

classification 💻 cs.LG

keywords largemodelsprivacylanguageuser-levelworkcostdata

0 comments

read the original abstract

We demonstrate that it is possible to train large recurrent language models with user-level differential privacy guarantees with only a negligible cost in predictive accuracy. Our work builds on recent advances in the training of deep networks on user-partitioned data and privacy accounting for stochastic gradient descent. In particular, we add user-level privacy protection to the federated averaging algorithm, which makes "large step" updates from user-level data. Our work demonstrates that given a dataset with a sufficiently large number of users (a requirement easily met by even small internet-scale datasets), achieving differential privacy comes at the cost of increased computation, rather than in decreased utility as in most prior work. We find that our private LSTM language models are quantitatively and qualitatively similar to un-noised models when trained on a large dataset.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 11 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Act in Collusion: Distributed Multi-Target Backdoor Attacks in Federated Learning
cs.CV 2024-11 unverdicted novelty 7.0

DMBA maintains attack success rates above 80% for all backdoors in a distributed multi-target FL setting where baselines drop below 50%.
Statistical Limits and Efficient Algorithms for Differentially Private Federated Learning
stat.ML 2026-05 unverdicted novelty 6.0

Introduces FedHybrid and FedNewton for DP federated M-estimation, with finite-sample MSE bounds, minimax lower bound, and evaluations on vision datasets.
Response-Conditioned Parallel-to-Sequential Orchestration for Multi-Agent Systems
cs.CL 2026-05 unverdicted novelty 6.0

Nexa learns a response-conditioned policy that starts with parallel agent execution and adds at most one round of sequential message passing via a predicted sparse DAG, strictly subsuming pure parallel mode.
Practical Quantum Federated Learning for Privacy-Sensitive Healthcare: Communication Efficiency and Noise Resilience
quant-ph 2026-03 unverdicted novelty 6.0

Hybrid QFL cuts quantum transmissions from 3TNMP to {3t + 2(T-t)}NMP over T rounds while preserving near-centralized convergence and improving depolarizing-noise resilience via decentralized aggregation and Steane-code QEC.
Adaptive Federated Optimization
cs.LG 2020-02 unverdicted novelty 6.0

Proposes federated adaptive optimizers (FedAdagrad, FedAdam, FedYogi) with convergence analysis for non-convex objectives under data heterogeneity and reports empirical gains over FedAvg.
When Determinants Are Not Enough: Private Rare Switching
cs.LG 2026-05 unverdicted novelty 5.0

Replaces determinant growth with generalized Rayleigh quotient for rare switching in private linear bandits to control worst-direction volume despite non-monotonic design matrices from noise.
DP-LAC: Lightweight Adaptive Clipping for Differentially Private Federated Fine-tuning of Language Models
cs.LG 2026-05 unverdicted novelty 5.0

DP-LAC provides a new adaptive clipping technique for DP-SGD in federated LLM fine-tuning that improves accuracy by 6.6% on average without consuming additional privacy budget or requiring new hyperparameters.
Enhanced Privacy and Communication Efficiency in Non-IID Federated Learning with Adaptive Quantization and Differential Privacy
cs.CV 2026-04 unverdicted novelty 5.0

Adaptive bit-length schedulers plus Laplacian DP in non-IID FL reduce communicated data by up to 52.64% on MNIST and 45% on CIFAR-10 while keeping competitive accuracy and privacy.
Secure, Verifiable, and Scalable Multi-Client Data Sharing via Consensus-Based Privacy-Preserving Data Distribution
cs.CR 2026-01 unverdicted novelty 5.0

CPPDD is a new consensus-based protocol for privacy-preserving multi-client data sharing that achieves unanimous-release confidentiality, linear scalability, and high-probability malicious deviation detection.
The Value of Collaboration in Convex Machine Learning with Differential Privacy
cs.CR 2019-06 conditional novelty 4.0

The fitness difference between DP and non-private convex ML models is inversely proportional to training dataset size squared and privacy budget squared.
Knowledge Distillation in Federated Learning: a Survey on Long Lasting Challenges and New Solutions
cs.LG 2024-06 unverdicted novelty 2.0

A survey organizing knowledge distillation techniques for addressing privacy, heterogeneity, communication, and personalization challenges in federated learning.