Large language model series by Qwen team, Alibaba Cloud
Top 1.7% on sourcepulse
Qwen3 is a series of large language models from Alibaba Cloud, offering dense and Mixture-of-Experts (MoE) architectures with enhanced reasoning, human alignment, and multilingual capabilities. It targets developers and researchers seeking advanced LLM performance for chat, coding, and agentic tasks.
How It Works
Qwen3 models feature a novel "thinking mode" that can be seamlessly switched with a general-purpose "non-thinking mode" via chat templates or specific instructions. This dual-mode approach optimizes performance for complex reasoning tasks (math, code) and efficient general chat, surpassing previous Qwen versions and other open-source models in benchmarks.
Quick Start & Requirements
pip install transformers>=4.51.0
llama.cpp>=b5092
ollama serve
(v0.6.6+)vllm>=0.8.5
sglang>=0.4.6.post1
mlx-lm>=0.24.0
(for Apple Silicon)Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
4 days ago
Inactive