pith. sign in

← back to paper

Review history

arxiv: 2604.04539 · 2 revisions

FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control

  1. 2026-05-19 UNVERDICTED LOW v0.9.0 novelty 6.0
    51101 ms 5805 in 1419 out 2026-05-19T17:04:49.619467+00:00
  2. 2026-05-10 UNVERDICTED LOW v0.9.0 novelty 6.0
    51175 ms 5574 in 1296 out 2026-05-10T19:59:18.599888+00:00