interwhen is a single-trajectory test-time verification system that polls reasoning traces, forks inference for intermediate states, synthesizes verifiers from policies including in Lean and z3, and steers models to near-perfect accuracy and higher task completion on benchmarks.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LO 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
interwhen: A Generalizable Framework for Steering Reasoning Models with Test-time Verification
interwhen is a single-trajectory test-time verification system that polls reasoning traces, forks inference for intermediate states, synthesizes verifiers from policies including in Lean and z3, and steers models to near-perfect accuracy and higher task completion on benchmarks.