Discover and explore top open-source AI tools and projects—updated daily.
ByteDance-SeedCode LLM for code generation, completion, and reasoning tasks
Top 55.9% on SourcePulse
Seed-Coder is a family of 8B parameter code LLMs (base, instruct, reasoning) from ByteDance Seed, designed to enhance coding capabilities by using LLMs to curate their own training data, minimizing human effort. It targets developers and researchers seeking powerful, lightweight, open-source code intelligence solutions.
How It Works
Seed-Coder employs a "model-centric" data pipeline, leveraging LLMs for data filtering and curation from sources like GitHub, commits, and web data. This approach aims to reduce manual effort in pretraining data construction while achieving state-of-the-art performance for its size.
Quick Start & Requirements
transformers or vLLM.torch (bfloat16 recommended), transformers, vLLM for advanced deployment.vLLM supports multi-GPU and tensor parallelism for long contexts (up to 32K tokens).Highlighted Details
vLLM for efficient inference and distributed serving.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
5 months ago
1 day
bigcode-project
SafeAILab
QwenLM
meta-llama
deepseek-ai