pith. machine review for the scientific record. sign in

← back to paper

Review history

arxiv: 2604.10547 · 2 revisions

Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training?

  1. 2026-05-14 UNVERDICTED LOW v0.9.0 novelty 7.0
    37963 ms 5643 in 1318 out 2026-05-14T21:26:00.830667+00:00
  2. 2026-05-10 UNVERDICTED LOW v0.9.0 novelty 8.0
    56841 ms 5624 in 1404 out 2026-05-10T16:18:34.749813+00:00