pith. machine review for the scientific record. sign in

← back to paper

Review history

arxiv: 2605.05812 · 2 revisions

Long-Horizon Q-Learning: Accurate Value Learning via n-Step Inequalities

  1. 2026-05-12 UNVERDICTED LOW v0.9.0 novelty 6.0
    54150 ms 5514 in 1225 out 2026-05-12T05:00:20.063346+00:00
  2. 2026-05-08 UNVERDICTED LOW v0.9.0 novelty 6.0
    34611 ms 5514 in 1181 out 2026-05-08T11:17:24.214196+00:00