pith. sign in

← back to paper

Review history

arxiv: 2605.06638 · 3 revisions

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

  1. 2026-05-20 UNVERDICTED LOW v0.9.0 novelty 6.0
    38525 ms 5870 in 1357 out 2026-05-20T22:32:39.924890+00:00
  2. 2026-05-12 UNVERDICTED LOW v0.9.0 novelty 7.0
    32909 ms 5636 in 1343 out 2026-05-12T03:05:35.344614+00:00
  3. 2026-05-08 UNVERDICTED LOW v0.9.0 novelty 7.0
    34375 ms 5579 in 1484 out 2026-05-08T09:31:01.296057+00:00