Autoformalization with Large Language Models

Yuhuai Wu; Albert Qiaochu Jiang; Wenda Li; Markus Rabe; Charles Staats; Mateja Jamnik; Christian Szegedy

2022 NIPS NeurIPS 2022

Autoformalization with Large Language Models

Abstract

Autoformalization is the process of automatically translating from natural language mathematics to formal specifications and proofs. A successful autoformalization system could advance the fields of formal verification, program synthesis, and artificial intelligence.While the long-term goal of autoformalization seemed elusive for a long time, we show large language models provide new prospects towards this goal. We make the surprising observation that LLMs can correctly translate a significant portion ($25.3\%$) of mathematical competition problems perfectly to formal specifications in Isabelle/HOL. We demonstrate the usefulness of this process by improving a previously introduced neural theorem prover via training on these autoformalized theorems. Our methodology results in a new state-of-the-art result on the MiniF2F theorem proving benchmark, improving the proof rate from~$29.6\%$ to~$35.2\%$.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — proof synthesis

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yuhuai Wu , Albert Qiaochu Jiang , Wenda Li , Markus Rabe , Charles Staats , Mateja Jamnik , Christian Szegedy

Topics

Artificial Intelligence > Core AI > Foundation Models Natural Language Processing > Applications > Machine Reading Comprehension Natural Language Processing > Resources & Methods > Large Language Models Deep Learning > Models > Large Language Models

Keywords

theorem proving formal verification large language model neural theorem prover proof assistant proof synthesis

Download PDF

Related papers

Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching 2022

A Theoretical View on Sparsely Activated Networks 2022

Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks 2022

Matryoshka Representation Learning 2022

Off-Policy Evaluation with Deficient Support Using Side Information 2022