Qwen3  by QwenLM

Large language model series by Qwen team, Alibaba Cloud

Created 1 year ago
24,598 stars

Top 1.6% on SourcePulse

GitHubView on GitHub
Project Summary

Qwen3 is a series of large language models from Alibaba Cloud, offering dense and Mixture-of-Experts (MoE) architectures with enhanced reasoning, human alignment, and multilingual capabilities. It targets developers and researchers seeking advanced LLM performance for chat, coding, and agentic tasks.

How It Works

Qwen3 models feature a novel "thinking mode" that can be seamlessly switched with a general-purpose "non-thinking mode" via chat templates or specific instructions. This dual-mode approach optimizes performance for complex reasoning tasks (math, code) and efficient general chat, surpassing previous Qwen versions and other open-source models in benchmarks.

Quick Start & Requirements

  • Transformers: pip install transformers>=4.51.0
  • llama.cpp: llama.cpp>=b5092
  • Ollama: ollama serve (v0.6.6+)
  • vLLM: vllm>=0.8.5
  • SGLang: sglang>=0.4.6.post1
  • MLX LM: mlx-lm>=0.24.0 (for Apple Silicon)
  • Documentation: EN|ZH
  • Models: Hugging Face or ModelScope

Highlighted Details

  • Supports dense and MoE models across various sizes (0.6B to 235B).
  • Excels in reasoning, human preference alignment, and multilingual tasks (100+ languages).
  • Strong agent capabilities with precise tool integration.
  • Offers OpenAI-compatible APIs via vLLM, SGLang, and Ollama.

Maintenance & Community

  • Active development by Alibaba Cloud's Qwen team.
  • Community channels: Discord, WeChat.

Licensing & Compatibility

  • Licensed under Apache 2.0, permitting commercial use and closed-source linking.

Limitations & Caveats

  • llama.cpp's chat template engine may encounter errors with Qwen3's specific template features; a fix is in progress.
  • Ollama's default context management settings may require adjustment for optimal Qwen3 performance.
Health Check
Last Commit

2 weeks ago

Responsiveness

1 day

Pull Requests (30d)
3
Issues (30d)
56
Star History
529 stars in the last 30 days

Explore Similar Projects

Starred by Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), Binyuan Hui Binyuan Hui(Research Scientist at Alibaba Qwen), and
3 more.

Lemur by OpenLemur

0.2%
556
Open language model for language agents
Created 2 years ago
Updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Simon Willison Simon Willison(Coauthor of Django), and
10 more.

Yi by 01-ai

0%
8k
Open-source bilingual LLMs trained from scratch
Created 1 year ago
Updated 9 months ago
Feedback? Help us improve.