The Price of Differential Privacy For Online Learning

Karan Singh; Naman Agarwal

arxiv: 1701.07953 · v2 · pith:PKIXKDRPnew · submitted 2017-01-27 · 💻 cs.LG · stat.ML

The Price of Differential Privacy For Online Learning

Naman Agarwal , Karan Singh This is my paper

classification 💻 cs.LG stat.ML

keywords epsilonfracregrettildeleftrightsqrtbandit

0 comments

read the original abstract

We design differentially private algorithms for the problem of online linear optimization in the full information and bandit settings with optimal $\tilde{O}(\sqrt{T})$ regret bounds. In the full-information setting, our results demonstrate that $\epsilon$-differential privacy may be ensured for free -- in particular, the regret bounds scale as $O(\sqrt{T})+\tilde{O}\left(\frac{1}{\epsilon}\right)$. For bandit linear optimization, and as a special case, for non-stochastic multi-armed bandits, the proposed algorithm achieves a regret of $\tilde{O}\left(\frac{1}{\epsilon}\sqrt{T}\right)$, while the previously known best regret bound was $\tilde{O}\left(\frac{1}{\epsilon}T^{\frac{2}{3}}\right)$.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

CHRONOS: Temporally-Aware Multi-Agent Coordination for Evolving Data Marketplaces
cs.DB 2026-05 unverdicted novelty 5.0

CHRONOS is a three-layer system for evolving data marketplaces that applies neural-ODE temporal decay, changepoint-aware Shapley valuation, and EXP3-IX private coordination to achieve 0.937 recall, 2.74 qps, 161 ms la...