Introduces LCS-Bench, a theory-scale benchmark covering 327 textbook items and 4,076 Lean declarations, with evaluations showing state-of-the-art models reach only 20.1% on auto-formalization tasks.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Theory-Scale Auto-Formalization of Logics for Computer Science
Introduces LCS-Bench, a theory-scale benchmark covering 327 textbook items and 4,076 Lean declarations, with evaluations showing state-of-the-art models reach only 20.1% on auto-formalization tasks.