Code generation model surpassing GPT-4 Turbo
Top 42.8% on sourcepulse
AutoCoder is a suite of large language models designed for code generation tasks, offering superior performance on benchmarks like HumanEval compared to leading proprietary models. It targets developers and researchers seeking advanced code generation capabilities, with a key benefit being its integrated, intelligent code interpreter that can automatically manage package installations and code execution.
How It Works
AutoCoder models are fine-tuned on extensive code datasets, with specific versions based on CodeQwen1.5 and DeepSeek Coder. A distinguishing feature is its code interpreter, which, unlike some competitors, can automatically install necessary Python packages and execute code iteratively until it functions correctly. This approach enhances the model's utility for tasks requiring external libraries or complex execution environments.
Quick Start & Requirements
conda create -n AutoCoder python=3.11
), activate it (conda activate AutoCoder
), and install dependencies (pip install -r requirements.txt
).Evaluation
directory and run python test_humaneval.py
for HumanEval benchmarks, python postprocess_mbpp.py
for MBPP, and python test_ds1000.py
for DS-1000.pip install gradio==3.48.0
) and run python chatbot.py
from the /Web_demo
directory.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project's licensing is not clearly defined, which may impact commercial adoption. While performance on benchmarks is highlighted, real-world usability and robustness of the automatic code execution feature require further evaluation.
1 year ago
1 day