Review history

arxiv: 2605.11599 · 2 revisions

Targeted Tests for LLM Reasoning: An Audit-Constrained Protocol

2026-05-20 UNVERDICTED LOW v0.9.0 novelty 6.0

60315 ms 5722 in 1201 out 2026-05-20T21:57:51.035214+00:00
2026-05-13 UNVERDICTED LOW v0.9.0 novelty 6.0

52480 ms 5491 in 1376 out 2026-05-13T01:31:06.740966+00:00