ReProver by lean-dojo

Theorem prover for Lean, augmented by retrieval

Created 2 years ago

315 stars

Top 85.8% on SourcePulse

Project Summary

ReProver provides retrieval-augmented language models for theorem proving in Lean 4. It addresses the challenge of generating correct and relevant tactics by leveraging a premise retrieval system to augment proof states, enabling more efficient and accurate automated theorem proving. The project is targeted at researchers and developers working with formal verification and automated reasoning systems.

How It Works

ReProver employs a two-stage approach. First, a premise retriever, based on a ByT5 encoder, embeds proof states and a corpus of relevant mathematical premises into vector representations. Cosine similarity is used to find the most relevant premises for a given proof state. Second, a tactic generator, a ByT5 encoder-decoder model, takes the proof state concatenated with the retrieved premises as input to generate a sequence of Lean tactics. This retrieval-augmented strategy aims to improve the quality and relevance of generated tactics by providing contextual information.

Quick Start & Requirements

Installation: Create a conda environment (conda create --yes --name ReProver python=3.11 ipython), activate it (conda activate ReProver), and install dependencies (pip install torch ...). Prepend the repo's root to PYTHONPATH.
Data: Download LeanDojo Benchmark 4 (python scripts/download_data.py), trace repos (python scripts/trace_repos.py), and log in to Weights & Biases (wandb login).
Prerequisites: Python 3.11, PyTorch (with CUDA support), transformers, deepspeed, pytorch-lightning, wandb, openai, rank_bm25, lean-dojo, vllm.
Resources: Training requires an NVIDIA A100 GPU with 80GB memory; smaller GPUs may necessitate adjustments to batch size and gradient accumulation.
Documentation: LeanDojo Website, Hugging Face Models

Highlighted Details

Offers pre-trained ByT5 models for tactic generation and premise retrieval on Hugging Face.
Supports training custom premise retrievers and tactic generators using Lightning CLI.
Includes scripts for evaluating premise retrieval (R@1, R@10, MRR) and theorem proving performance.
Provides utilities to convert PyTorch Lightning checkpoints to Hugging Face format for easier integration.

Maintenance & Community

The project is associated with the LeanDojo initiative and NeurIPS 2023.
Questions and discussions are handled via GitHub Discussions. Bug reports should be filed as GitHub Issues.

Licensing & Compatibility

The README does not explicitly state a license. The associated LeanDojo project uses an Apache 2.0 license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The main branch supports Lean 4; Lean 3 support is available on a legacy branch.
Training and evaluation procedures are complex, requiring significant setup and computational resources.
The README mentions that premise generation tactics may include markers (<a>, </a>) that need to be removed before use.

Health Check

Last Commit

11 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

6 stars in the last 30 days

Explore Similar Projects

Starred by

Victor Taelin

Victor Taelin(Author of Bend, Kind, HVM) and

Magnus Müller

Magnus Müller(Cofounder of Browser Use).

Kimina-Prover-Preview by MoonshotAI

Research paper for formal reasoning model in Lean 4

Created 9 months ago

Updated 6 months ago

Raspberry by daveshap

Open-source dataset for finetuning LLMs with reasoning

Created 1 year ago

Updated 1 year ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA).

marc by ekinakyurek

Research paper implementation for abstract reasoning via test-time training

Created 1 year ago

Updated 2 months ago

chain-of-draft by sileix

Research paper code for efficient LLM reasoning

Created 10 months ago

Updated 10 months ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect) and

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

discovering_latent_knowledge by collin-burns

Research paper code for unsupervised discovery of latent knowledge in LLMs

Created 3 years ago

Updated 1 year ago

Starred by

Casper Hansen

Casper Hansen(Author of AutoAWQ).

Plan-and-Solve-Prompting by AGI-Edgerunners

Research paper code for improved zero-shot chain-of-thought reasoning

Created 2 years ago

Updated 2 years ago

Starred by

Wing Lian

Wing Lian(Founder of Axolotl AI).

Logic-LLM by teacherpeterpan

Logic-LM: Framework for improved logical reasoning via LLMs and symbolic solvers

Created 2 years ago

Updated 1 year ago

Starred by

Benjamin Bolte

Benjamin Bolte(Cofounder of K-Scale Labs).

DeepSeek-Prover-V2 by deepseek-ai

LLM for formal theorem proving in Lean 4, initialized with DeepSeek-V3 data

Created 8 months ago

Updated 5 months ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA),

Eric Zhu

Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), and

7 more.

reasoning-gym by open-thought

Procedural dataset generator for reasoning models

Created 11 months ago

Updated 3 weeks ago

Starred by

Dan Abramov

Dan Abramov(Core Contributor to React; Coauthor of Redux, Create React App) and

Edward Sun

Edward Sun(Research Scientist at Meta Superintelligence Lab).

LeanDojo by lean-dojo

Machine learning for theorem proving in Lean

Created 2 years ago

Updated 1 week ago

train-deepseek-r1 by FareedKhan-dev

Replicate DeepSeek R1 LLM training from scratch

Created 11 months ago

Updated 9 months ago

DeepSeek-Prover-V1.5 by deepseek-ai

Theorem prover for formal mathematics using Lean 4

Created 1 year ago

Updated 1 year ago

Feedback? Help us improve.