arXiv preprint arXiv:2510.06857 , year=

Guo, Q · 2025 · arXiv 2510.06857

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Beyond the Library: An Agentic Framework for Autoformalizing Research Mathematics

cs.AI · 2026-06-30 · conditional · novelty 7.0

Agentic LLM framework autoformalizes 32 Putnam problems and main theorems plus proofs from five STOC papers into Lean 4, with two proofs using only kernel axioms.

The Signal-Coverage Matrix: Stratifying Type and Semantic Errors in Statement Autoformalization

cs.CL · 2026-06-26 · unverdicted · novelty 6.0

The signal-coverage matrix stratifies autoformalization outputs into true success, type-only, semantic-only, and both-fail cells, showing type-correctness gains are mostly type-stratum recovery with semantic errors largely unchanged.

OProver: A Unified Framework for Agentic Formal Theorem Proving

cs.CL · 2026-05-17 · unverdicted · novelty 6.0

OProver-32B achieves top Pass@32 scores on MiniF2F, ProverBench, and PutnamBench by combining continued pretraining with iterative agentic proving, retrieval, SFT on repairs, and RL on unresolved cases using a 6.86M-proof dataset.

citing papers explorer

Showing 3 of 3 citing papers.

Beyond the Library: An Agentic Framework for Autoformalizing Research Mathematics cs.AI · 2026-06-30 · conditional · none · ref 15
Agentic LLM framework autoformalizes 32 Putnam problems and main theorems plus proofs from five STOC papers into Lean 4, with two proofs using only kernel axioms.
The Signal-Coverage Matrix: Stratifying Type and Semantic Errors in Statement Autoformalization cs.CL · 2026-06-26 · unverdicted · none · ref 22
The signal-coverage matrix stratifies autoformalization outputs into true success, type-only, semantic-only, and both-fail cells, showing type-correctness gains are mostly type-stratum recovery with semantic errors largely unchanged.
OProver: A Unified Framework for Agentic Formal Theorem Proving cs.CL · 2026-05-17 · unverdicted · none · ref 164
OProver-32B achieves top Pass@32 scores on MiniF2F, ProverBench, and PutnamBench by combining continued pretraining with iterative agentic proving, retrieval, SFT on repairs, and RL on unresolved cases using a 6.86M-proof dataset.

arXiv preprint arXiv:2510.06857 , year=

fields

years

verdicts

representative citing papers

citing papers explorer