Simple statistical gradient-following algorithms for connectionist reinforce- ment learning.Machine learning, 8:229–256

Ronald J Williams · 1992

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

On the Sample Complexity of Differentially Private Policy Optimization

cs.LG · 2025-10-24 · unverdicted · novelty 7.0

Differential privacy in policy optimization adds sample complexity costs that often appear as lower-order terms rather than dominating the bounds.

When Policy Entropy Constraint Fails: Preserving Diversity in Flow-based RLHF via Perceptual Entropy

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

Policy entropy remains constant in flow-matching models during RLHF due to fixed noise schedules while perceptual diversity collapses from mode-seeking policy gradients, so perceptual entropy constraints are introduced to preserve diversity and improve quality.

citing papers explorer

Showing 2 of 2 citing papers.

On the Sample Complexity of Differentially Private Policy Optimization cs.LG · 2025-10-24 · unverdicted · none · ref 1
Differential privacy in policy optimization adds sample complexity costs that often appear as lower-order terms rather than dominating the bounds.
When Policy Entropy Constraint Fails: Preserving Diversity in Flow-based RLHF via Perceptual Entropy cs.CV · 2026-05-12 · unverdicted · none · ref 67
Policy entropy remains constant in flow-matching models during RLHF due to fixed noise schedules while perceptual diversity collapses from mode-seeking policy gradients, so perceptual entropy constraints are introduced to preserve diversity and improve quality.

Simple statistical gradient-following algorithms for connectionist reinforce- ment learning.Machine learning, 8:229–256

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer