mini-swe-agent  by SWE-agent

AI agent for solving GitHub issues and command-line tasks

Created 8 months ago
2,988 stars

Top 15.7% on SourcePulse

GitHubView on GitHub
Project Summary

This project provides a minimalist, 100-line AI agent designed for researchers and developers to benchmark, fine-tune, or deploy AI coding assistants with minimal overhead. It achieves 65% accuracy on the SWE-bench benchmark using Claude Sonnet 4, offering a simple, hackable alternative to more complex agent frameworks.

How It Works

mini-SWE-agent leverages a radically simple design, eschewing complex tools and stateful shell sessions. It relies solely on bash commands executed via subprocess.run, allowing any language model to interact with the environment. This approach simplifies sandboxing, debugging, and fine-tuning by maintaining a linear execution history and ensuring each action is independent.

Quick Start & Requirements

  • Install: pip install uv && uvx mini-swe-agent or pip install pipx && pipx ensurepath && pipx run mini-swe-agent
  • Prerequisites: Python. No specific model or API keys are mandated by the agent itself, but a compatible LLM is required for operation.
  • Docs: Quick start

Highlighted Details

  • Scores 65% on the verified SWE-bench benchmark.
  • Minimalist design: ~100 lines of Python code.
  • No tools beyond bash; relies on LLM's ability to use the shell.
  • Supports local execution and containerized environments (Docker, Podman).

Maintenance & Community

Developed by the team behind SWE-bench and SWE-agent from Princeton and Stanford. Further details on community and contribution can be found in the project's documentation.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README. Users should verify licensing for commercial or closed-source use.

Limitations & Caveats

The agent's functionality is limited to what can be achieved through bash commands, meaning complex operations requiring specific tools or APIs must be handled by the LLM's ability to generate appropriate shell commands.

Health Check
Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
41
Issues (30d)
19
Star History
384 stars in the last 30 days

Explore Similar Projects

Starred by Dan Abramov Dan Abramov(Core Contributor to React; Coauthor of Redux, Create React App), Gabriel Almeida Gabriel Almeida(Cofounder of Langflow), and
9 more.

terminal-bench by laude-institute

1.4%
2k
Benchmark for LLM agents in real terminal environments
Created 1 year ago
Updated 1 month ago
Starred by Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind), and
27 more.

goose by block

1.6%
31k
Open-source AI agent for automating complex engineering tasks
Created 1 year ago
Updated 17 hours ago
Feedback? Help us improve.