← back to paper
arxiv: 2605.11599 · 2 revisions
Targeted Tests for LLM Reasoning: An Audit-Constrained Protocol