Towards Differentially Private Reinforcement Learning with General Function Approximation

· 2026 · cs.LG · arXiv 2605.07049

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We present the first theoretical guarantees for differentially private online reinforcement learning (RL) with general function approximation, extending beyond prior work restricted to tabular and linear settings. Our approach combines a batched policy update scheme with the exponential mechanism, together with a novel regret analysis. We show that, even under general function approximation, the regret in the model-free setting under differential privacy matches the state of the art for the linear case, scaling as $\widetilde{O}(K^{3/5})$, where $K$ denotes the number of episodes. As an important by-product, we also establish the first regret bound for online RL with batch update that depends on the standard complexity measure of coverability, complementing existing results based on a newly introduced Eluder-Condition class. In addition, we uncover fundamental gaps in recent results for private RL with linear function approximation, thereby clarifying its landscape.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

When Determinants Are Not Enough: Private Rare Switching

cs.LG · 2026-05-22 · unverdicted · novelty 5.0

Replaces determinant growth with generalized Rayleigh quotient for rare switching in private linear bandits to control worst-direction volume despite non-monotonic design matrices from noise.

citing papers explorer

Showing 1 of 1 citing paper.

When Determinants Are Not Enough: Private Rare Switching cs.LG · 2026-05-22 · unverdicted · none · ref 2 · internal anchor
Replaces determinant growth with generalized Rayleigh quotient for rare switching in private linear bandits to control worst-direction volume despite non-monotonic design matrices from noise.

Towards Differentially Private Reinforcement Learning with General Function Approximation

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer