Discover and explore top open-source AI tools and projects—updated daily.
R2E-GymScaling open-weight SWE agents with procedural environments and hybrid verifiers
Top 99.3% on SourcePulse
R2E-Gym addresses the challenge of scaling open-weight Software Engineering (SWE) agents by providing a large-scale, procedurally generated environment and novel verification strategies. It targets researchers and developers aiming to enhance open-weight SWE agent performance, enabling state-of-the-art results competitive with proprietary models.
How It Works
The framework introduces SWE-GEN, a synthetic data curation recipe generating executable training environments from commits, enhancing scalability. Hybrid Test-time Scaling combines execution-based and execution-free verifiers to optimize inference-time compute for superior performance.
Quick Start & Requirements
uv (install via curl -LsSf https://astral.sh/uv/install.sh | sh), Python virtual environment setup (uv venv, source .venv/bin/activate), and dependency installation (uv sync && uv pip install -e .).claude-3-5-sonnet-20241022, gpt-4o).Highlighted Details
Maintenance & Community
Associated with UC Berkeley and ANU researchers (Naman Jain, Jaskirat Singh, Manish Shetty, Liang Zheng, Koushik Sen, Ion Stoica). No specific community channels (Discord, Slack), roadmap links, or active maintenance signals beyond the research publication are detailed.
Licensing & Compatibility
The provided README content does not specify a software license. Clarification is needed regarding terms of use, distribution, and compatibility for commercial or closed-source applications.
Limitations & Caveats
Executable gym instances require substantial disk space (300MB-500MB each). Setup involves multiple command-line steps. Explicit details on alpha/beta status, unsupported platforms, or known bugs are absent. Access to specific LLMs or API keys is implicitly required for agent functionality.
8 months ago
Inactive
allenai
SWE-Gym
hkust-nlp
harbor-framework