pith. sign in

← back to paper

Review history

arxiv: 2604.07035 · 2 revisions

Unified Deployment-Aware Evaluation of Open Reasoning Language Models

  1. 2026-05-21 UNVERDICTED LOW v0.9.0 novelty 4.0
    51135 ms 5845 in 1149 out 2026-05-21T09:38:30.153040+00:00
  2. 2026-05-10 ACCEPT MODERATE v0.9.0 novelty 4.0
    61592 ms 5705 in 1227 out 2026-05-10T17:44:59.556352+00:00