dgm  by jennyzzt

Self-improving agent system

created 2 months ago
1,540 stars

Top 27.5% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

The Darwin Gödel Machine (DGM) project enables open-ended evolution of self-improving AI agents capable of iteratively modifying and validating their own code. It targets researchers and developers in AI, particularly those focused on artificial general intelligence and self-evolving systems, offering a framework for agents that enhance their own problem-solving abilities through empirical testing.

How It Works

DGM employs a meta-learning approach where an agent iteratively refines its codebase. This process involves generating code modifications, executing them within a sandboxed environment, and evaluating their performance against coding benchmarks like SWE-bench and Polyglot. The system is designed to foster emergent self-improvement by rewarding agents that successfully enhance their own capabilities.

Quick Start & Requirements

  • Install: Clone the repository, set up a Python virtual environment (python3 -m venv venv, source venv/bin/activate), and install dependencies (pip install -r requirements.txt).
  • Prerequisites: Docker must be installed and configured. API keys for OpenAI and Anthropic are required and should be exported as environment variables. For development analysis, graphviz and graphviz-dev are needed.
  • Setup: Cloning the repo and installing dependencies is estimated to take 5-15 minutes.
  • Links: SWE-bench, Polyglot

Highlighted Details

  • Implements an open-ended evolution framework for self-improving agents.
  • Utilizes coding benchmarks (SWE-bench, Polyglot) for empirical validation of code changes.
  • Leverages foundation models (via API keys) for code generation and self-modification.
  • Includes a detailed file structure for organizing agent code, benchmarks, and logs.

Maintenance & Community

The project is associated with researchers from Princeton University. Specific community channels or active maintenance signals are not detailed in the README.

Licensing & Compatibility

The repository's license is not explicitly stated in the provided README. Compatibility for commercial use or closed-source linking would require clarification of the licensing terms.

Limitations & Caveats

This project involves executing untrusted, model-generated code, posing inherent safety risks. While designed with safety in mind, generated code may exhibit unintended or destructive behavior due to model limitations. The project is experimental and may require significant computational resources.

Health Check
Last commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
1,560 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.