Proximal policy optimization algorithms

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Tune to Learn: How Controller Gains Shape Robot Policy Learning

cs.RO · 2026-04-02 · conditional · novelty 7.0

Controller gains affect learnability differently for behavior cloning, RL from scratch, and sim-to-real transfer, so optimal gains depend on the learning paradigm rather than desired task behavior.

citing papers explorer

Showing 1 of 1 citing paper.

Tune to Learn: How Controller Gains Shape Robot Policy Learning cs.RO · 2026-04-02 · conditional · none · ref 23
Controller gains affect learnability differently for behavior cloning, RL from scratch, and sim-to-real transfer, so optimal gains depend on the learning paradigm rather than desired task behavior.

Proximal policy optimization algorithms

fields

years

verdicts

representative citing papers

citing papers explorer