Qwen3Guard  by QwenLM

Safety guardrails for LLM interactions

Created 3 weeks ago

New!

257 stars

Top 98.4% on SourcePulse

GitHubView on GitHub
Project Summary

Qwen3Guard is a series of multilingual safety moderation models designed to protect against harmful content in LLM applications. Targeting developers and researchers, it offers robust prompt and response analysis, with specialized variants for real-time, token-level monitoring during text generation, providing comprehensive, multi-language safety solutions.

How It Works

Qwen3Guard is built upon Qwen3 and trained on a large safety-labeled dataset. It comprises two main types: Qwen3Guard-Gen for static classification of prompts and responses, and Qwen3Guard-Stream for real-time, token-level safety assessment during incremental generation. The models support 119 languages and classify content into three severity levels (safe, controversial, unsafe) across nine defined safety categories, enabling adaptable risk management.

Quick Start & Requirements

Highlighted Details

  • Offers both prompt/response classification (-Gen) and real-time token-level streaming moderation (-Stream).
  • Supports 119 languages, providing broad multilingual safety coverage.
  • Classifies content into Safe, Controversial, and Unsafe severity levels across categories like Violent, PII, and Unethical Acts.
  • Claims state-of-the-art performance on safety benchmarks, with visual performance data provided.

Maintenance & Community

Licensing & Compatibility

  • The README does not explicitly state a software license. This absence poses a significant adoption blocker, requiring clarification for commercial use or integration into proprietary systems.

Limitations & Caveats

  • The Qwen3Guard-Stream model requires using the same tokenizer as Qwen3 for optimal performance; integration with different tokenizers necessitates re-tokenization.
  • Support for Qwen3Guard-Stream in vLLM and SGLang is listed as "coming soon."
  • The technical report and specific benchmark results are not directly embedded, requiring users to consult external documents.
Health Check
Last Commit

2 weeks ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
9
Star History
258 stars in the last 22 days

Explore Similar Projects

Starred by Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
18 more.

guardrails by guardrails-ai

0.7%
6k
Python framework for adding guardrails to LLMs
Created 2 years ago
Updated 5 days ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), and
41 more.

guidance by guidance-ai

0.1%
21k
Guidance is a programming paradigm for steering LLMs
Created 2 years ago
Updated 1 day ago
Feedback? Help us improve.