Learn- ing quadrupedal locomotion via differentiable simulation.arXiv preprint arXiv:2404.02887

· 2025 · arXiv 2404.02887

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient

cs.RO · 2026-05-26 · unverdicted · novelty 4.0

SDPG is a new on-policy visual RL algorithm that estimates gradients via stochastic perturbations of rollouts, achieving faster training and lower memory use than baselines on visual MuJoCo tasks while adding new robotics benchmarks and sim-to-real results.

Does "Do Differentiable Simulators Give Better Policy Gradients?'' Give Better Policy Gradients?

cs.LG · 2026-04-20 · unverdicted · novelty 4.0

In policy gradient RL, careful variance control and simple estimator switching frequently outperform explicit discontinuity detection even when using differentiable simulators.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient cs.RO · 2026-05-26 · unverdicted · none · ref 18
SDPG is a new on-policy visual RL algorithm that estimates gradients via stochastic perturbations of rollouts, achieving faster training and lower memory use than baselines on visual MuJoCo tasks while adding new robotics benchmarks and sim-to-real results.
Does "Do Differentiable Simulators Give Better Policy Gradients?'' Give Better Policy Gradients? cs.LG · 2026-04-20 · unverdicted · none · ref 5
In policy gradient RL, careful variance control and simple estimator switching frequently outperform explicit discontinuity detection even when using differentiable simulators.

Learn- ing quadrupedal locomotion via differentiable simulation.arXiv preprint arXiv:2404.02887

fields

years

verdicts

representative citing papers

citing papers explorer