pith. machine review for the scientific record. sign in

← back to paper

Review history

arxiv: 2605.09923 · 2 revisions

expo: Exploration-prioritized policy optimization via adaptive kl regulation and gaussian curriculum sampling

  1. 2026-05-14 UNVERDICTED LOW v0.9.0 novelty 4.0
    32560 ms 5595 in 1223 out 2026-05-14T22:07:29.679461+00:00
  2. 2026-05-12 UNVERDICTED LOW v0.9.0 novelty 4.0
    62418 ms 5623 in 1527 out 2026-05-12T04:15:45.088843+00:00