pith. sign in

← back to paper

Review history

arxiv: 2605.11599 · 2 revisions

Targeted Tests for LLM Reasoning: An Audit-Constrained Protocol

  1. 2026-05-20 UNVERDICTED LOW v0.9.0 novelty 6.0
    60315 ms 5722 in 1201 out 2026-05-20T21:57:51.035214+00:00
  2. 2026-05-13 UNVERDICTED LOW v0.9.0 novelty 6.0
    52480 ms 5491 in 1376 out 2026-05-13T01:31:06.740966+00:00