Differentially Private Multi-Armed Bandits in the Shuffle Model

Haim Kaplan; Jay Tenenbaum; Uri Stemmer; Yishay Mansour

arxiv: 2106.02900 · v3 · pith:HFDHF3FSnew · submitted 2021-06-05 · 💻 cs.LG · cs.CR

Differentially Private Multi-Armed Bandits in the Shuffle Model

Jay Tenenbaum , Haim Kaplan , Yishay Mansour , Uri Stemmer This is my paper

classification 💻 cs.LG cs.CR

keywords deltafracmodelleftregretrightsqrtvarepsilon

0 comments

read the original abstract

We give an $(\varepsilon,\delta)$-differentially private algorithm for the multi-armed bandit (MAB) problem in the shuffle model with a distribution-dependent regret of $O\left(\left(\sum_{a\in [k]:\Delta_a>0}\frac{\log T}{\Delta_a}\right)+\frac{k\sqrt{\log\frac{1}{\delta}}\log T}{\varepsilon}\right)$, and a distribution-independent regret of $O\left(\sqrt{kT\log T}+\frac{k\sqrt{\log\frac{1}{\delta}}\log T}{\varepsilon}\right)$, where $T$ is the number of rounds, $\Delta_a$ is the suboptimality gap of the arm $a$, and $k$ is the total number of arms. Our upper bound almost matches the regret of the best known algorithms for the centralized model, and significantly outperforms the best known algorithm in the local model.

This paper has not been read by Pith yet.

Differentially Private Multi-Armed Bandits in the Shuffle Model

discussion (0)