DeepSeek-Coder  by deepseek-ai

Code LLM for code completion and generation

created 1 year ago
21,953 stars

Top 1.9% on sourcepulse

GitHubView on GitHub
Project Summary

DeepSeek Coder is a suite of open-source code language models trained from scratch on 2 trillion tokens, comprising 87% code and 13% natural language. It offers models ranging from 1B to 33B parameters, designed for project-level code completion and infilling with a 16K context window. The models achieve state-of-the-art performance on various coding benchmarks, making them suitable for developers and researchers seeking advanced code generation capabilities.

How It Works

The models are pre-trained on a massive dataset of code and natural language, with a focus on project-level context. This is achieved through a 16K context window and a fill-in-the-blank task, enabling the models to understand and generate code across entire projects. Instruction-tuned variants are also available for conversational coding assistance.

Quick Start & Requirements

  • Install dependencies: pip install -r requirements.txt
  • Requires PyTorch and Hugging Face Transformers.
  • Demo available on Hugging Face Spaces.
  • Official models can be downloaded from Hugging Face Hub.

Highlighted Details

  • Outperforms existing open-source code LLMs on HumanEval, MBPP, and DS-1000 benchmarks.
  • 7B model achieves performance comparable to CodeLlama-34B.
  • 33B instruct model rivals GPT-3.5-turbo on HumanEval.
  • Supports over 100 programming languages.

Maintenance & Community

  • Active development and community support.
  • Links to Discord and WeChat for community interaction.
  • Resources available via awesome-deepseek-coder.

Licensing & Compatibility

  • Code repository licensed under MIT.
  • Model usage subject to a separate Model License.
  • Supports commercial use.

Limitations & Caveats

  • Quantization to GGUF (llama.cpp) and GPTQ (exllamav2) requires specific setup steps and potential PR merges.
  • Instruct models can perform code completion with a specific eos_token_id adjustment.
Health Check
Last commit

1 year ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
3
Star History
711 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley).

DeepSeek-Coder-V2 by deepseek-ai

0.4%
6k
Open-source code language model comparable to GPT4-Turbo
created 1 year ago
updated 10 months ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Travis Fischer Travis Fischer(Founder of Agentic), and
6 more.

codellama by meta-llama

0.1%
16k
Inference code for CodeLlama models
created 1 year ago
updated 11 months ago
Feedback? Help us improve.