SWE-agent  by SWE-agent

Agent for automated software engineering (NeurIPS 2024)

Created 1 year ago
17,387 stars

Top 2.7% on SourcePulse

GitHubView on GitHub
Project Summary

SWE-agent automates software engineering tasks by enabling large language models (LLMs) to interact with isolated computer environments. It targets developers and cybersecurity professionals, offering solutions for bug fixing, web tasks, and offensive cybersecurity challenges, with state-of-the-art performance on benchmarks like SWE-bench.

How It Works

SWE-agent utilizes configurable Agent-Computer Interfaces (ACIs) to provide LLMs with tools and access to isolated environments. This architecture allows models like GPT-4o or Claude 3.7 to execute commands, interact with web services, and debug code, facilitating autonomous task completion. The approach is advantageous for its flexibility in integrating various LLMs and its robust environment isolation for safe execution.

Quick Start & Requirements

  • Install: pip install swe-agent
  • Prerequisites: Python 3.8+, an LLM API key (e.g., OpenAI, Anthropic).
  • Resources: Requires access to LLM APIs. Setup is generally quick, depending on LLM access.
  • Docs: Documentation, Installation, Hello World

Highlighted Details

  • State-of-the-art (SoTA) performance on SWE-bench (full and light) with Claude 3.7.
  • Dedicated mode (EnIGMA) for offensive cybersecurity CTF challenges, achieving SoTA on multiple benchmarks.
  • Includes features like a debugger, server connection tools, and output summarizer.
  • Supports custom tasks beyond software engineering and cybersecurity.

Maintenance & Community

  • Developed by researchers from Princeton University and Stanford University.
  • Active development with recent releases (v1.0).
  • Community support via Discord.

Licensing & Compatibility

  • License: MIT.
  • Compatibility: Permissive MIT license allows for commercial use and integration into closed-source projects.

Limitations & Caveats

  • The EnIGMA mode is currently being updated for SWE-agent 1.0; use version 0.7 for EnIGMA capabilities.
  • Performance is dependent on the chosen LLM and its API access.
Health Check
Last Commit

2 days ago

Responsiveness

1 day

Pull Requests (30d)
8
Issues (30d)
14
Star History
360 stars in the last 30 days

Explore Similar Projects

Starred by Dan Guido Dan Guido(Cofounder of Trail of Bits), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
1 more.

cai by aliasrobotics

3.6%
4k
Cybersecurity AI (CAI) is an open framework for building AI-driven cybersecurity tools
Created 5 months ago
Updated 1 day ago
Starred by Luis Capelo Luis Capelo(Cofounder of Lightning AI), Addy Osmani Addy Osmani(Head of Chrome Developer Experience at Google), and
23 more.

goose by block

0.8%
20k
Open-source AI agent for automating complex engineering tasks
Created 1 year ago
Updated 20 hours ago
Feedback? Help us improve.