In policy gradient RL, careful variance control and simple estimator switching frequently outperform explicit discontinuity detection even when using differentiable simulators.
Analytical derivatives of rigid body dynamics algorithms
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Does "Do Differentiable Simulators Give Better Policy Gradients?'' Give Better Policy Gradients?
In policy gradient RL, careful variance control and simple estimator switching frequently outperform explicit discontinuity detection even when using differentiable simulators.