llm-verified-with-monte-carlo-tree-search by namin

LLM code synthesis via Monte Carlo Tree Search

Created 2 years ago

284 stars

Top 92.1% on SourcePulse

Project Summary

This project addresses the challenge of synthesizing verified code by integrating Large Language Models (LLMs) with Monte Carlo Tree Search (MCTS) and formal verification. It enables weaker LLMs to compete with stronger ones by guiding generation with a verifier, targeting researchers and developers in program synthesis and formal methods.

How It Works

The core approach uses MCTS to explore the vast space of potential program generations. At each step of the search, a formal verifier (Dafny, Coq, Lean, Scala, or Rust) is invoked to check the partial program's correctness. This feedback loop prunes incorrect branches early, ensuring that only valid or promising code paths are pursued, leading to provably correct program synthesis.

Quick Start & Requirements

Install: Clone with --recurse-submodules, create a conda environment (conda create --name llm-verified python=3.10), activate it, and run pip install -r requirements.txt. Hugging Face authentication is required.
Prerequisites: GPU access (tested on NVIDIA A100s), Python 3.10, Conda, Hugging Face CLI, and potentially specific verifier installations (Dafny, Coq, Lean). Docker/Singularity images are available.
Links: Dockerfile, Docker Hub

Highlighted Details

Leverages MCTS for guided program synthesis.
Supports multiple formal verification backends: Dafny, Coq, Lean, Scala, Rust.
Achieved an Outstanding Paper Award at the Math-AI workshop co-located with NeurIPS 2024.
Includes trainers for PPO and DPO reinforcement learning methods.

Maintenance & Community

The project is associated with authors from institutions like Google DeepMind and the University of Cambridge. Further community engagement details are not explicitly provided in the README.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README text.

Limitations & Caveats

The project is described as a prototype. Setup requires significant technical expertise, including managing LLM dependencies, Hugging Face authentication, and potentially complex installations for formal verifiers like Coq or Lean. GPU resources are mandatory.

llm-verified-with-monte-carlo-tree-search by namin

Explore Similar Projects

supercharger by catid

LLM4SE by gai4se

llm-debugger-vscode-extension by mohsen1

Seed-Coder by ByteDance-Seed

LLMDebugger by FloridSleeves

cwm by facebookresearch

CodeTF by salesforce

LiveCodeBench by LiveCodeBench

OpenCodeInterpreter by OpenCodeInterpreter

Awesome-Code-LLM by codefuse-ai

AlphaCodium by Codium-ai

openevolve by algorithmicsuperintelligence