LeanDojo  by lean-dojo

Machine learning for theorem proving in Lean

Created 2 years ago
750 stars

Top 46.4% on SourcePulse

GitHubView on GitHub
Project Summary

LeanDojo is a Python library designed for machine learning-based theorem proving in the Lean 4 proof assistant. It enables researchers and developers to extract structured data from Lean projects and interact with the Lean environment programmatically, facilitating the development of AI-powered theorem proving tools.

How It Works

LeanDojo leverages a robust data extraction pipeline to parse Lean code, capturing intricate details like proof states, tactics, and premises. This structured data is then used to train and interact with language models, allowing for programmatic control and automation of theorem proving tasks. The library's design focuses on providing a comprehensive interface for machine learning models to understand and manipulate Lean proofs.

Quick Start & Requirements

  • Installation: pip install lean-dojo or pip install . from the Git repo.
  • Prerequisites: Git >= 2.25, Python 3.9 to < 3.13, wget, elan. A GitHub personal access token is required and should be set as the GITHUB_ACCESS_TOKEN environment variable.
  • Documentation: Getting Started, Demo, Full documentation.

Highlighted Details

  • Supports data extraction and programmatic interaction with Lean 4.
  • Provides datasets for theorem proving research (LeanDojo Benchmark, LeanDojo Benchmark 4).
  • Includes related projects like ReProver and a LeanDojo ChatGPT Plugin.
  • Cited in NeurIPS 2023 for its work on retrieval-augmented language models for theorem proving.

Maintenance & Community

  • Active development with contributions from multiple researchers.
  • Discussions and bug reporting are managed via GitHub Issues and Discussions.

Licensing & Compatibility

  • The project's license is not explicitly stated in the provided README. Compatibility for commercial use or closed-source linking is therefore undetermined.

Limitations & Caveats

  • The README mentions compatibility with Lean 4 v4.3.0-rc2 or later, with a legacy branch available for Lean 3. Ensure the correct Lean version is used.
Health Check
Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
20 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Junyang Lin Junyang Lin(Core Maintainer at Alibaba Qwen).

InternLM-Math by InternLM

0.4%
532
Math LLM for bilingual reasoning tasks
Created 2 years ago
Updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
3 more.

DeepSeek-Math-V2 by deepseek-ai

0.8%
2k
LLM for self-verifiable mathematical reasoning
Created 1 month ago
Updated 1 month ago
Starred by Dan Abramov Dan Abramov(Core Contributor to React; Coauthor of Redux, Create React App), Thomas Wolf Thomas Wolf(Cofounder of Hugging Face), and
1 more.

LeanCopilot by lean-dojo

0.4%
1k
Copilot for theorem proving in Lean
Created 2 years ago
Updated 1 day ago
Feedback? Help us improve.