ImProver: Agent-Based Automated Proof Optimization

Jeremy Avigad; Prasad Tetali; Riyaz Ahuja; Sean Welleck

arxiv: 2410.04753 · v2 · pith:AZAG5ORXnew · submitted 2024-10-07 · 💻 cs.AI · cs.CL· cs.LG· cs.LO

ImProver: Agent-Based Automated Proof Optimization

Riyaz Ahuja , Jeremy Avigad , Prasad Tetali , Sean Welleck This is my paper

classification 💻 cs.AI cs.CLcs.LGcs.LO

keywords proofproofsimproveroptimizationautomatedleanrewritingarbitrary

0 comments

read the original abstract

Large language models (LLMs) have been used to generate formal proofs of mathematical theorems in proofs assistants such as Lean. However, we often want to optimize a formal proof with respect to various criteria, depending on its downstream use. For example, we may want a proof to adhere to a certain style, or to be readable, concise, or modularly structured. Having suitably optimized proofs is also important for learning tasks, especially since human-written proofs may not optimal for that purpose. To this end, we study a new problem of automated proof optimization: rewriting a proof so that it is correct and optimizes for an arbitrary criterion, such as length or readability. As a first method for automated proof optimization, we present ImProver, a large-language-model agent that rewrites proofs to optimize arbitrary user-defined metrics in Lean. We find that naively applying LLMs to proof optimization falls short, and we incorporate various improvements into ImProver, such as the use of symbolic Lean context in a novel Chain-of-States technique, as well as error-correction and retrieval. We test ImProver on rewriting real-world undergraduate, competition, and research-level mathematics theorems, finding that ImProver is capable of rewriting proofs so that they are substantially shorter, more modular, and more readable.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Lean Refactor: Multi-Objective Controllable Proof Optimization via Agentic Strategy Search
cs.LO 2026-05 unverdicted novelty 6.0

Lean Refactor uses retrieval from a curated multi-objective strategy database to guide frozen LLMs in refactoring Lean proofs, reporting over 70% token compression on benchmarks and improved version transfer.