Quick answer

Paper2025-07-08•Source ↗•10 attns0 checkouts

Claim

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Authors

Discuss with Grok

Zhongyuan Peng·

Yifan Yao·

Kaijing Ma·

Shuyue Guo·

Yizhe Li·

Yichi Zhang·

Chenchen Zhang·

Yifan Zhang·

Zhouliang Yu·

Luming Li·

Minghao Liu·

Yihang Xia·

Jiawei Shen·

Yuchen Wu·

Yixin Cao·

Zhaoxiang Zhang·

Wenhao Huang·

Jiaheng Liu·

Ge Zhang

ABSTRACT

Translating natural language mathematical statements into formal, executable code is a fundamental challenge in automated theorem proving. While prior work has focused on generation and compilation success, little attention has been paid to the critic phase-the evaluation of whether generated formalizations truly capture the semantic intent of the original problem. In this paper, we introduce CriticLean, a novel critic-guided reinforcement learning framework that elevates the role of the critic from a passive validator to an active learning component. Specifically, first, we propose the CriticLeanGPT, trained via supervised fine-tuning and reinforcement learning, to rigorously assess the semantic fidelity of Lean 4 formalizations. Then, we introduce CriticLeanBench, a benchmark designed to measure models' ability to distinguish semantically correct from incorrect formalizations, and demonstrate that our trained CriticLeanGPT models can significantly outperform strong open- and closed-source baselines. Building on the CriticLean framework, we construct FineLeanCorpus, a dataset comprising over 285K problems that exhibits rich domain diversity, broad difficulty coverage, and high correctness based on human evaluation. Overall, our findings highlight that optimizing the critic phase is essential for producing reliable formalizations, and we hope our CriticLean will provide valuable insights for future advances in formal mathematical reasoning.

#computer-version/year/2025 #deep-learning/month/202507 #llm/paper/year/2025 #llm/month/202507 #computer-version #multimodal-model #computer-version/month/202507 #llm/paper/month/202507 #llm/paper #deep-learning/from/bytedance-research #deep-learning/year/2025 #llm/year/2025 #world-model #deep-learning #llm ByteDance Research

Review Snapshot

Explore ratings

0.0

★★★★★

0 ratings

5 star

4 star

3 star

2 star

1 star

Recommendation

recommend this content.

Review this content

Share your opinion to help other learners triage faster.

Write a review

Invite a reviewer

Invite someone by email to share an invited review for CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization.

Author Inquiries

Public questions about this content. Attendemia will route your question to the author. Vote on the most important ones. No guarantee of response.

Post an inquiry

Sort by: Most helpful