← back to paper
arxiv: 2604.27644 · 2 revisions
ANCORA: Learning to Question via Manifold-Anchored Self-Play for Verifiable Reasoning