|国家预印本平台
首页|Step-Wise Formal Verification for LLM-Based Mathematical Problem Solving

Step-Wise Formal Verification for LLM-Based Mathematical Problem Solving

Step-Wise Formal Verification for LLM-Based Mathematical Problem Solving

来源:Arxiv_logoArxiv
英文摘要

Large Language Models (LLMs) have demonstrated formidable capabilities in solving mathematical problems, yet they may still commit logical reasoning and computational errors during the problem-solving process. Thus, this paper proposes a framework, MATH-VF, which includes a Formalizer and a Critic, for formally verifying the correctness of the solutions generated by large language models. Our framework first utilizes a Formalizer which employs an LLM to translate a natural language solution into a formal context. Afterward, our Critic (which integrates various external tools such as a Computer Algebra System and an SMT solver) evaluates the correctness of each statement within the formal context, and when a statement is incorrect, our Critic provides corrective feedback. We empirically investigate the effectiveness of MATH-VF in two scenarios: 1) Verification: MATH-VF is utilized to determine the correctness of a solution to a given problem. 2) Refinement: When MATH-VF identifies errors in the solution generated by an LLM-based solution generator for a given problem, it submits the corrective suggestions proposed by the Critic to the solution generator to regenerate the solution. We evaluate our framework on widely used mathematical benchmarks: MATH500 and ProcessBench, demonstrating the superiority of our approach over existing approaches.

Kuo Zhou、Lu Zhang

数学

Kuo Zhou,Lu Zhang.Step-Wise Formal Verification for LLM-Based Mathematical Problem Solving[EB/OL].(2025-05-27)[2025-07-02].https://arxiv.org/abs/2505.20869.点此复制

评论