arXiv preprint arXiv:2407.10040 , year=

Lean-star: Learning to interleave thinking, proving , author= · 2024 · arXiv 2407.10040

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

representative citing papers

Event-B Agent: Towards LLM Agent for Formal Model Synthesis and Repair

cs.SE · 2026-05-17 · unverdicted · novelty 7.0

Event-B Agent is an LLM agent that synthesizes, refines, and repairs Event-B formal models from natural language requirements via iterative verification feedback loops.

CAM-Bench: A Benchmark for Computational and Applied Mathematics in Lean

cs.AI · 2026-05-17 · accept · novelty 7.0

CAM-Bench is a new Lean 4 theorem-proving benchmark of 1,000 problems in computational and applied mathematics, built from textbook exercises using a dependency-recovery pipeline to reconstruct local context.

OProver: A Unified Framework for Agentic Formal Theorem Proving

cs.CL · 2026-05-17 · unverdicted · novelty 6.0

OProver-32B achieves top Pass@32 scores on MiniF2F, ProverBench, and PutnamBench by combining continued pretraining with iterative agentic proving, retrieval, SFT on repairs, and RL on unresolved cases using a 6.86M-proof dataset.

Rethinking Supervision Granularity: Segment-Level Learning for LLM-Based Theorem Proving

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

Segment-level supervision extracts coherent proof segments to train policy models that achieve 61-66% success on miniF2F, outperforming step-level and whole-proof methods while also improving existing provers.

Intent-aligned Formal Specification Synthesis via Traceable Refinement

cs.LG · 2026-04-12 · unverdicted · novelty 6.0

VeriSpecGen uses traceable refinement to synthesize intent-aligned Lean specifications from natural language, reaching 86.6% on the VERINA SpecGen task and producing 343K training trajectories that improve downstream models by 62-106%.

Aristotle: IMO-level Automated Theorem Proving

cs.AI · 2025-10-01 · unverdicted · novelty 6.0

Aristotle reaches gold-medal-equivalent performance on 2025 IMO problems via integrated Lean proof search, informal lemma formalization, and a dedicated geometry solver.

citing papers explorer

Showing 6 of 6 citing papers.

Event-B Agent: Towards LLM Agent for Formal Model Synthesis and Repair cs.SE · 2026-05-17 · unverdicted · none · ref 30
Event-B Agent is an LLM agent that synthesizes, refines, and repairs Event-B formal models from natural language requirements via iterative verification feedback loops.
CAM-Bench: A Benchmark for Computational and Applied Mathematics in Lean cs.AI · 2026-05-17 · accept · none · ref 19
CAM-Bench is a new Lean 4 theorem-proving benchmark of 1,000 problems in computational and applied mathematics, built from textbook exercises using a dependency-recovery pipeline to reconstruct local context.
OProver: A Unified Framework for Agentic Formal Theorem Proving cs.CL · 2026-05-17 · unverdicted · none · ref 158
OProver-32B achieves top Pass@32 scores on MiniF2F, ProverBench, and PutnamBench by combining continued pretraining with iterative agentic proving, retrieval, SFT on repairs, and RL on unresolved cases using a 6.86M-proof dataset.
Rethinking Supervision Granularity: Segment-Level Learning for LLM-Based Theorem Proving cs.AI · 2026-05-12 · unverdicted · none · ref 18
Segment-level supervision extracts coherent proof segments to train policy models that achieve 61-66% success on miniF2F, outperforming step-level and whole-proof methods while also improving existing provers.
Intent-aligned Formal Specification Synthesis via Traceable Refinement cs.LG · 2026-04-12 · unverdicted · none · ref 2
VeriSpecGen uses traceable refinement to synthesize intent-aligned Lean specifications from natural language, reaching 86.6% on the VERINA SpecGen task and producing 343K training trajectories that improve downstream models by 62-106%.
Aristotle: IMO-level Automated Theorem Proving cs.AI · 2025-10-01 · unverdicted · none · ref 24
Aristotle reaches gold-medal-equivalent performance on 2025 IMO problems via integrated Lean proof search, informal lemma formalization, and a dedicated geometry solver.

arXiv preprint arXiv:2407.10040 , year=

fields

years

verdicts

representative citing papers

citing papers explorer