TY - GEN
T1 - SolSearch
T2 - 47th IEEE/ACM International Conference on Software Engineering: New Ideas and Emerging Results, ICSE-NIER 2025
AU - Sheng, Junjie
AU - Lin, Yanqiu
AU - Wu, Jiehao
AU - Huang, Yanhong
AU - Shi, Jianqi
AU - Zhang, Min
AU - Wang, Xiangfeng
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - The Satisfiability (SAT) problem is a core challenge with significant applications in software engineering, including automated testing, configuration management, and program verification. This paper presents SolSearch, a novel framework that harnesses large language models (LLMs) to discover and optimize SAT-solving strategies automatically. Leveraging a curriculum-based, trial-and-error process, SolSearch enables the LLM to iteratively modify and generate SAT solver code, thereby improving solving efficiency and performance. This automated SAT-solving paradigm has the advantage of being plug-and-play, allowing integration with any SAT solver and accelerating the development or design process of new SAT solvers (new methods). Our preliminary experimental results are encouraging by demonstrating that the LLM-powered paradigm improves state-of-the-art SAT solvers on general SAT benchmarks and significantly enhances the performance of the widely used Z3 solver (11% on PAR-2 score). These results highlight the potential for using LLM-driven methods to advance solver adaptability and effectiveness in real-world software engineering challenges. Future research directions are discussed to further refine and validate this approach, offering a promising avenue for integrating AI with traditional software engineering tasks.
AB - The Satisfiability (SAT) problem is a core challenge with significant applications in software engineering, including automated testing, configuration management, and program verification. This paper presents SolSearch, a novel framework that harnesses large language models (LLMs) to discover and optimize SAT-solving strategies automatically. Leveraging a curriculum-based, trial-and-error process, SolSearch enables the LLM to iteratively modify and generate SAT solver code, thereby improving solving efficiency and performance. This automated SAT-solving paradigm has the advantage of being plug-and-play, allowing integration with any SAT solver and accelerating the development or design process of new SAT solvers (new methods). Our preliminary experimental results are encouraging by demonstrating that the LLM-powered paradigm improves state-of-the-art SAT solvers on general SAT benchmarks and significantly enhances the performance of the widely used Z3 solver (11% on PAR-2 score). These results highlight the potential for using LLM-driven methods to advance solver adaptability and effectiveness in real-world software engineering challenges. Future research directions are discussed to further refine and validate this approach, offering a promising avenue for integrating AI with traditional software engineering tasks.
KW - Code Generation
KW - Heuristic Method
KW - Large Language Models (LLM)
KW - SAT Solver
UR - https://www.scopus.com/pages/publications/105008498097
U2 - 10.1109/ICSE-NIER66352.2025.00007
DO - 10.1109/ICSE-NIER66352.2025.00007
M3 - 会议稿件
AN - SCOPUS:105008498097
T3 - Proceedings - International Conference on Software Engineering
SP - 6
EP - 10
BT - Proceedings - 2025 IEEE/ACM 47th International Conference on Software Engineering
PB - IEEE Computer Society
Y2 - 27 April 2025 through 3 May 2025
ER -