TY - GEN
T1 - Formal Theorem Generation via MCTS with LLM-Guided Process Optimization
AU - Wang, Lei
AU - Yang, Zhengfeng
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - While large language models (LLMs) have shown great potential in formal theorem proving tasks, their performance is limited by the scarcity of formal theorem data. To address this issue, we propose a formal theorem generation method aimed at providing high-quality fine-tuning training data in Lean4 for LLMs. We propose a Training-Free Policy-Value framework for Monte Carlo Tree Search (MCTS-TFPV). We introduce LLM-generated policy probabilities combined with dynamic temperature annealing to optimize the search process, along with a rule-based value function to dynamically evaluate the quality of generated theorems. Our method replaces traditional policy and value networks with a rule-based design, optimizing the theorem generation process to significantly reduce computational costs while enhancing the quality of generated theorems. Experiments are conducted on Llama3-8B and Qwen2.5-7B, with comprehensive comparisons against traditional methods. The results show that our method significantly improves performance in formal theorem generation tasks, offering an efficient and scalable solution for LLMs in formal mathematical tasks.
AB - While large language models (LLMs) have shown great potential in formal theorem proving tasks, their performance is limited by the scarcity of formal theorem data. To address this issue, we propose a formal theorem generation method aimed at providing high-quality fine-tuning training data in Lean4 for LLMs. We propose a Training-Free Policy-Value framework for Monte Carlo Tree Search (MCTS-TFPV). We introduce LLM-generated policy probabilities combined with dynamic temperature annealing to optimize the search process, along with a rule-based value function to dynamically evaluate the quality of generated theorems. Our method replaces traditional policy and value networks with a rule-based design, optimizing the theorem generation process to significantly reduce computational costs while enhancing the quality of generated theorems. Experiments are conducted on Llama3-8B and Qwen2.5-7B, with comprehensive comparisons against traditional methods. The results show that our method significantly improves performance in formal theorem generation tasks, offering an efficient and scalable solution for LLMs in formal mathematical tasks.
KW - Formal theorem generation
KW - Large language models
KW - MCTS
UR - https://www.scopus.com/pages/publications/105012095881
U2 - 10.1007/978-981-96-9849-3_5
DO - 10.1007/978-981-96-9849-3_5
M3 - 会议稿件
AN - SCOPUS:105012095881
SN - 9789819698486
T3 - Lecture Notes in Computer Science
SP - 55
EP - 65
BT - Advanced Intelligent Computing Technology and Applications - 21st International Conference, ICIC 2025, Proceedings
A2 - Huang, De-Shuang
A2 - Pan, Yijie
A2 - Chen, Wei
A2 - Chen, Haiming
PB - Springer Science and Business Media Deutschland GmbH
T2 - 21st International Conference on Intelligent Computing, ICIC 2025
Y2 - 26 July 2025 through 29 July 2025
ER -