Formal Theorem Generation via MCTS with LLM-Guided Process Optimization

Lei Wang, Zhengfeng Yang*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

While large language models (LLMs) have shown great potential in formal theorem proving tasks, their performance is limited by the scarcity of formal theorem data. To address this issue, we propose a formal theorem generation method aimed at providing high-quality fine-tuning training data in Lean4 for LLMs. We propose a Training-Free Policy-Value framework for Monte Carlo Tree Search (MCTS-TFPV). We introduce LLM-generated policy probabilities combined with dynamic temperature annealing to optimize the search process, along with a rule-based value function to dynamically evaluate the quality of generated theorems. Our method replaces traditional policy and value networks with a rule-based design, optimizing the theorem generation process to significantly reduce computational costs while enhancing the quality of generated theorems. Experiments are conducted on Llama3-8B and Qwen2.5-7B, with comprehensive comparisons against traditional methods. The results show that our method significantly improves performance in formal theorem generation tasks, offering an efficient and scalable solution for LLMs in formal mathematical tasks.

Original languageEnglish
Title of host publicationAdvanced Intelligent Computing Technology and Applications - 21st International Conference, ICIC 2025, Proceedings
EditorsDe-Shuang Huang, Yijie Pan, Wei Chen, Haiming Chen
PublisherSpringer Science and Business Media Deutschland GmbH
Pages55-65
Number of pages11
ISBN (Print)9789819698486
DOIs
StatePublished - 2025
Event21st International Conference on Intelligent Computing, ICIC 2025 - Ningbo, China
Duration: 26 Jul 202529 Jul 2025

Publication series

NameLecture Notes in Computer Science
Volume15851 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Conference on Intelligent Computing, ICIC 2025
Country/TerritoryChina
CityNingbo
Period26/07/2529/07/25

Keywords

  • Formal theorem generation
  • Large language models
  • MCTS

Fingerprint

Dive into the research topics of 'Formal Theorem Generation via MCTS with LLM-Guided Process Optimization'. Together they form a unique fingerprint.

Cite this