QwQ by QwenLM

Reasoning model for complex problem-solving, based on Qwen2.5

Created 10 months ago

531 stars

Top 59.6% on SourcePulse

4 Experts Love This Project

shimmyshimmer

Cofounder of Unsloth

huybery

Research Scientist at Alibaba Qwen

JustinLin610

Core Maintainer at Alibaba Qwen

zhyncs

Inference Lead at SGLang; Research Scientist at Together AI

Project Summary

QwQ is a reasoning-specialized large language model series from Alibaba Cloud's Qwen team, designed for complex problem-solving tasks. It aims to outperform traditional instruction-tuned models by leveraging advanced reasoning and critical thinking, making it suitable for researchers and developers tackling challenging NLP applications.

How It Works

QwQ is built upon the Qwen2.5 architecture, specifically optimized for reasoning. It utilizes a thoughtful output generation process, often starting with "\ \n", to separate reasoning steps from the final answer. The model recommends specific sampling parameters (Temperature=0.6, TopP=0.95, TopK=40) and advises against greedy decoding to prevent repetition. For long contexts, it supports YaRN scaling, configurable via rope_scaling in config.json.

Quick Start & Requirements

Hugging Face Transformers: Install with pip install transformers. Requires transformers>=4.37.0.

from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "Qwen/QwQ-32B"
model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype="auto", device_map="auto")
tokenizer = AutoTokenizer.from_pretrained(model_name)
# ... generation code ...

Ollama: ollama run hf.co/Qwen/QwQ-32B-GGUF:Q4_K_M

Llama.cpp: Requires GGUF model files.

./llama-cli --model QwQ-32B-GGUF/qwq-32b-q4_k_m.gguf --threads 32 --ctx-size 32768 --temp 0.6 --top-p 0.95 --prompt "<|im_start|>user\nHow many r's are in the word \"strawberry\"<|im_end|>\n<|im_start|>assistant\n \n"

API: Alibaba Cloud Model Studio API.

Highlighted Details

QwQ-32B competes with top-tier reasoning models like DeepSeek-R1 and o1-mini.
Supports YaRN for long context handling (e.g., 8192+ tokens) with specific configuration.
Provides detailed usage guidelines for optimal performance, including prompt standardization for math and multiple-choice questions.
Offers GGUF versions for local inference via Ollama and Llama.cpp.

Maintenance & Community

Developed by the Qwen team at Alibaba Cloud.
Community links: Hugging Face, ModelScope, Blog, Demo, WeChat, Discord.
API service available via Alibaba Cloud Model Studio.

Licensing & Compatibility

License details are not explicitly stated in the README, but usage is governed by Usage Guidelines.
Compatibility for commercial use is not specified.

Limitations & Caveats

Users encountering performance issues or endless repetitions should consult the Usage Guidelines.
vLLM's static YaRN implementation may impact performance on shorter texts.
The README mentions a potential KeyError: 'qwen2' with transformers<4.37.0.

Health Check

Last Commit

9 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

3 stars in the last 30 days

Explore Similar Projects

Awesome-Long2short-on-LRMs by Hongcheng-Gao

Optimizing large reasoning models for concise outputs

Created 10 months ago

Updated 5 months ago

awesome-deep-reasoning by modelscope

Collection of resources for reasoning models

Created 11 months ago

Updated 8 months ago

Starred by

Simon Willison

Simon Willison(Coauthor of Django).

XBai-o4 by MetaStone-AI

Advanced LLM for complex reasoning

Created 5 months ago

Updated 5 months ago

Awesome-Efficient-Reasoning by hemingkx

Paper list for efficient reasoning in large language models

Created 10 months ago

Updated 14 hours ago

Starred by

Wing Lian

Wing Lian(Founder of Axolotl AI).

ReasonFlux by Gen-Verse

LLM post-training algorithms for data selection, RL, and inference

Created 11 months ago

Updated 3 months ago

One-Shot-RLVR by ypwang61

RL fine-tuning with one training example

Created 8 months ago

Updated 1 month ago

Skywork-OR1 by SkyworkAI

Math/code reasoner models trained with RL

Created 9 months ago

Updated 7 months ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

1 more.

LIMO by GAIR-NLP

Reasoning model using less data

Created 11 months ago

Updated 5 months ago

RAT-retrieval-augmented-thinking by Doriandarko

AI tool enhancing responses via structured reasoning and retrieval

Created 11 months ago

Updated 11 months ago

Mulberry by HJYao00

MLLM research paper for reasoning/reflection via collective Monte Carlo Tree Search

Created 1 year ago

Updated 3 months ago

Starred by

Didier Lopes

Didier Lopes(Founder of OpenBB),

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and

3 more.

DeepSeek-Coder-V2 by deepseek-ai

Open-source code language model comparable to GPT4-Turbo

Created 1 year ago

Updated 2 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory), and

1 more.

DeepSeek-LLM by deepseek-ai

Large language model for research/commercial use

Created 2 years ago

Updated 1 year ago

Feedback? Help us improve.