Theorem prover for formal mathematics using Lean 4
Top 60.4% on sourcepulse
DeepSeek-Prover-V1.5 is an open-source language model for theorem proving in Lean 4, designed to improve upon its predecessor by optimizing training and inference. It targets researchers and practitioners in formal mathematics and AI, offering state-of-the-art performance on benchmarks like miniF2F and ProofNet.
How It Works
The model is pre-trained on formal mathematical languages and fine-tuned with supervised learning and reinforcement learning from proof assistant feedback (RLPAF). A key innovation is RMaxTS, a Monte-Carlo tree search variant employing intrinsic-reward-driven exploration for diverse proof path generation, moving beyond single-pass proof generation.
Quick Start & Requirements
pip install -r requirements.txt
), set up Lean 4, build Mathlib4.python -m prover.launch --config=configs/RMaxTS.py --log_dir=logs/RMaxTS_results
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The model is specifically designed for Lean 4 theorem proving and may not generalize to other formal languages or domains without further adaptation. Performance claims are based on specific benchmark evaluations.
11 months ago
Inactive