pith. sign in

← back to paper

Review history

arxiv: 2604.28005 · 2 revisions

Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning

  1. 2026-05-20 UNVERDICTED LOW v0.9.0 novelty 6.0
    58211 ms 5758 in 1221 out 2026-05-20T23:44:40.873348+00:00
  2. 2026-05-07 UNVERDICTED LOW v0.9.0 novelty 6.0
    62808 ms 5555 in 1376 out 2026-05-07T06:54:54.394441+00:00