Auto-GPT-Benchmarks by Significant-Gravitas

Deprecated repo for agent performance benchmarking

Created 3 years ago

276 stars

Top 93.6% on SourcePulse

View on GitHub

3 Experts Love This Project

Gabriel Almeida

Cofounder of Langflow

Nir Gazit

Cofounder of Traceloop

Tomas Valenta

Cofounder of E2B

Project Summary

This repository is deprecated and has been superseded by the main Auto-GPT repository's benchmark folder. It was designed to objectively benchmark the performance of AI agents across categories like code generation, information retrieval, memory management, and safety, aiming to save users time and money through automation.

How It Works

The project aimed to provide automated, objective performance metrics for AI agents. It allowed users to compare different agent setups and implementations based on quantifiable results in key operational areas.

Quick Start & Requirements

This repository is deprecated. For updated benchmarks, refer to the benchmark folder within the main Auto-GPT repository: https://github.com/Significant-Gravitas/AutoGPT.

Highlighted Details

Provides objective performance metrics for AI agents.
Covers categories such as code generation, retrieval, memory, and safety.
Aims to automate benchmarking for time and cost savings.
Includes rankings and detailed results for agents like Beebot, mini-agi, and Auto-GPT.

Maintenance & Community

This repository is marked as deprecated. Further development and community engagement are likely focused on the main Auto-GPT repository.

Licensing & Compatibility

The license is not specified in the provided README excerpt. Compatibility for commercial use or closed-source linking would require checking the license of the main Auto-GPT repository.

Limitations & Caveats

The repository is explicitly deprecated, meaning it is no longer actively maintained or updated. Users should consult the successor repository for current benchmarking information and tools.

Health Check

Last Commit

2 years ago

Responsiveness

1+ week

Pull Requests (30d)

Issues (30d)

Star History

0 stars in the last 30 days