dgm by jennyzzt

Self-improving agent system

Created 7 months ago

1,788 stars

Top 23.9% on SourcePulse

View on GitHub

4 Experts Love This Project

Vincent Weisser

Cofounder of Prime Intellect

Coauthor of SWE-bench, SWE-agent

Project Summary

The Darwin Gödel Machine (DGM) project enables open-ended evolution of self-improving AI agents capable of iteratively modifying and validating their own code. It targets researchers and developers in AI, particularly those focused on artificial general intelligence and self-evolving systems, offering a framework for agents that enhance their own problem-solving abilities through empirical testing.

How It Works

DGM employs a meta-learning approach where an agent iteratively refines its codebase. This process involves generating code modifications, executing them within a sandboxed environment, and evaluating their performance against coding benchmarks like SWE-bench and Polyglot. The system is designed to foster emergent self-improvement by rewarding agents that successfully enhance their own capabilities.

Quick Start & Requirements

Install: Clone the repository, set up a Python virtual environment (python3 -m venv venv, source venv/bin/activate), and install dependencies (pip install -r requirements.txt).
Prerequisites: Docker must be installed and configured. API keys for OpenAI and Anthropic are required and should be exported as environment variables. For development analysis, graphviz and graphviz-dev are needed.
Setup: Cloning the repo and installing dependencies is estimated to take 5-15 minutes.
Links: SWE-bench, Polyglot

Highlighted Details

Implements an open-ended evolution framework for self-improving agents.
Utilizes coding benchmarks (SWE-bench, Polyglot) for empirical validation of code changes.
Leverages foundation models (via API keys) for code generation and self-modification.
Includes a detailed file structure for organizing agent code, benchmarks, and logs.

Maintenance & Community

The project is associated with researchers from Princeton University. Specific community channels or active maintenance signals are not detailed in the README.

Licensing & Compatibility

The repository's license is not explicitly stated in the provided README. Compatibility for commercial use or closed-source linking would require clarification of the licensing terms.

Limitations & Caveats

This project involves executing untrusted, model-generated code, posing inherent safety risks. While designed with safety in mind, generated code may exhibit unintended or destructive behavior due to model limitations. The project is experimental and may require significant computational resources.

Health Check

Last Commit

5 months ago

Responsiveness

1 week

Pull Requests (30d)

Issues (30d)

Star History

25 stars in the last 30 days