Discover and explore top open-source AI tools and projects—updated daily.
Large language models for long context and reasoning
Top 44.2% on SourcePulse
Seed-OSS is a series of open-source large language models developed by ByteDance's Seed Team, offering powerful long-context, reasoning, agent, and general capabilities. Optimized for international use cases, it provides flexible control over the "thinking budget" for dynamic reasoning length adjustment, enhanced reasoning and agentic intelligence, and native support for up to 512K context length. The models are released under the Apache-2.0 license, making them suitable for both research and development.
How It Works
Seed-OSS models utilize a causal language model architecture incorporating RoPE, GQA attention, RMSNorm, and SwiGLU activation. A key feature is the "thinking budget," allowing users to control the model's reasoning depth. The model can dynamically adjust its chain-of-thought (CoT) process, reflecting on token usage and remaining budget, which can improve efficiency and performance on complex tasks.
Quick Start & Requirements
pip install -r requirements.txt
and pip install git+ssh://git@github.com/Fazziekey/transformers.git@seed-oss
.generate.py
script or vLLM for faster inference. Quantization options (4-bit, 8-bit) are available to reduce memory usage.MODEL_CARD
and inference scripts are available in the repository.Highlighted Details
Maintenance & Community
The project is developed by the ByteDance Seed Team, founded in 2023 with a focus on advanced AI foundation models. Further community engagement details (e.g., Discord/Slack) are not explicitly mentioned in the README.
Licensing & Compatibility
Limitations & Caveats
The README notes that for optimal performance with the "thinking budget" feature, users are advised to use values that are integer multiples of 512. Some benchmark results are presented as reproduced or reported, with specific details on evaluation configurations provided for certain tasks.
3 days ago
Inactive