pith. sign in

You may not need ratio clipping in PPO.arXiv preprint arXiv:2202.00079, 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

clear filters

representative citing papers

KLip-PPO: A per-sample KL perspective on PPO-Clip

cs.LG · 2026-06-22 · unverdicted · novelty 7.0

PPO-Clip gradient equals a per-sample KL surrogate with closed-form coefficient on importance ratio and advantage, yielding identical curves on five MuJoCo tasks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • KLip-PPO: A per-sample KL perspective on PPO-Clip cs.LG · 2026-06-22 · unverdicted · none · ref 17

    PPO-Clip gradient equals a per-sample KL surrogate with closed-form coefficient on importance ratio and advantage, yielding identical curves on five MuJoCo tasks.