Review history

arxiv: 2604.10547 · 2 revisions

Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training?

2026-05-14 UNVERDICTED LOW v0.9.0 novelty 7.0

37963 ms 5643 in 1318 out 2026-05-14T21:26:00.830667+00:00
2026-05-10 UNVERDICTED LOW v0.9.0 novelty 8.0

56841 ms 5624 in 1404 out 2026-05-10T16:18:34.749813+00:00