Recognition: unknown
Differentially Private Policy Evaluation
classification
💻 cs.LG
stat.ML
keywords
algorithmsdifferentiallypolicyprivacyprivateachievinganalysisapply
read the original abstract
We present the first differentially private algorithms for reinforcement learning, which apply to the task of evaluating a fixed policy. We establish two approaches for achieving differential privacy, provide a theoretical analysis of the privacy and utility of the two algorithms, and show promising results on simple empirical examples.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Differential Privacy in the Extensive-Form Bandit Problem
An algorithm achieves Õ(√(A ln(S) T)/ε) regret for extensive-form bandits under ε-local differential privacy, claimed as the first such result.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.