pith. machine review for the scientific record. sign in

← back to paper

Review history

arxiv: 2605.06785 · 2 revisions

Distributional Process Reward Models: Calibrated Prediction of Future Rewards via Conditional Optimal Transport

  1. 2026-05-13 UNVERDICTED LOW v0.9.0 novelty 6.0
    22265 ms 5496 in 1173 out 2026-05-13T06:21:26.571631+00:00
  2. 2026-05-11 UNVERDICTED LOW v0.9.0 novelty 7.0
    51552 ms 5496 in 1337 out 2026-05-11T01:13:15.016208+00:00