Privately aligning language models with reinforcement learning.arXiv preprint arXiv:2310.16960, 2023

Fan Wu, Huseyin A Inan, Arturs Backurs, Varun Chandrasekaran, Janardhan Kulkarni, Robert Sim · 2023 · arXiv 2310.16960

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

On the Sample Complexity of Differentially Private Policy Optimization

cs.LG · 2025-10-24 · unverdicted · novelty 7.0

Differential privacy in policy optimization adds sample complexity costs that often appear as lower-order terms rather than dominating the bounds.

Autonomy Reshapes How Personalization Affects Privacy Concerns and Trust in LLM Agents

cs.HC · 2025-10-06 · conditional · novelty 5.0

A 3x3 between-subjects experiment finds that risk-contingent autonomy in LLM agents attenuates personalization's negative effects on privacy concerns and trust via increased perceived control.

citing papers explorer

Showing 2 of 2 citing papers.

On the Sample Complexity of Differentially Private Policy Optimization cs.LG · 2025-10-24 · unverdicted · none · ref 16
Differential privacy in policy optimization adds sample complexity costs that often appear as lower-order terms rather than dominating the bounds.
Autonomy Reshapes How Personalization Affects Privacy Concerns and Trust in LLM Agents cs.HC · 2025-10-06 · conditional · none · ref 102
A 3x3 between-subjects experiment finds that risk-contingent autonomy in LLM agents attenuates personalization's negative effects on privacy concerns and trust via increased perceived control.

Privately aligning language models with reinforcement learning.arXiv preprint arXiv:2310.16960, 2023

fields

years

verdicts

representative citing papers

citing papers explorer