pith. sign in

← back to paper

Review history

arxiv: 2512.00920 · 2 revisions

Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios

  1. 2026-05-21 UNVERDICTED LOW v0.9.0 novelty 6.0
    27949 ms 5728 in 1187 out 2026-05-21T18:22:06.615100+00:00
  2. 2026-05-17 UNVERDICTED LOW v0.9.0 novelty 5.0
    35127 ms 5497 in 1397 out 2026-05-17T02:53:18.202770+00:00