Code LLM for code generation, completion, and reasoning tasks
Top 59.9% on sourcepulse
Seed-Coder is a family of 8B parameter code LLMs (base, instruct, reasoning) from ByteDance Seed, designed to enhance coding capabilities by using LLMs to curate their own training data, minimizing human effort. It targets developers and researchers seeking powerful, lightweight, open-source code intelligence solutions.
How It Works
Seed-Coder employs a "model-centric" data pipeline, leveraging LLMs for data filtering and curation from sources like GitHub, commits, and web data. This approach aims to reduce manual effort in pretraining data construction while achieving state-of-the-art performance for its size.
Quick Start & Requirements
transformers
or vLLM
.torch
(bfloat16 recommended), transformers
, vLLM
for advanced deployment.vLLM
supports multi-GPU and tensor parallelism for long contexts (up to 32K tokens).Highlighted Details
vLLM
for efficient inference and distributed serving.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 month ago
Inactive