Review history

arxiv: 2603.19470 · 2 revisions

Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

2026-05-21 UNVERDICTED LOW v0.9.0 novelty 5.0

28337 ms 5803 in 1190 out 2026-05-21T10:27:50.027693+00:00
2026-05-15 UNVERDICTED LOW v0.9.0 novelty 6.0

53252 ms 5572 in 1260 out 2026-05-15T08:01:49.847475+00:00