Qwen3-Embedding by QwenLM

Text embedding and reranking model

Created 7 months ago

1,709 stars

Top 24.6% on SourcePulse

1 Expert Loves This Project

JustinLin610

Core Maintainer at Alibaba Qwen

Project Summary

The Qwen3 Embedding model series offers a suite of proprietary text embedding and reranking models designed for diverse NLP tasks like retrieval, classification, and clustering. Targeting developers and researchers, it provides state-of-the-art performance across multiple benchmarks, leveraging the advanced multilingual and long-text understanding capabilities of the Qwen3 foundational models.

How It Works

This series builds upon dense foundational models, offering embedding and reranking capabilities in sizes ranging from 0.6B to 8B parameters. A key feature is Matryoshka Representation Learning (MRL) support, allowing flexible vector dimension definition. The models are also "instruction aware," enabling task-specific prompt engineering for performance boosts, with English instructions recommended for optimal multilingual results.

Quick Start & Requirements

Transformers: pip install transformers>=4.51.0
Sentence Transformers: pip install sentence-transformers>=2.7.0
vLLM: pip install vllm>=0.8.5
Dependencies: PyTorch, CUDA (recommended for acceleration).
Usage: Examples provided for Transformers, vLLM, and Sentence Transformers libraries.
Docs: Huggingface, ModelScope, Blog, Arxiv

Highlighted Details

The 8B embedding model achieved the #1 rank on the MTEB multilingual leaderboard (70.58 score as of June 5, 2025).
Supports over 100 languages, including programming languages, for robust multilingual and cross-lingual retrieval.
Offers models in 0.6B, 4B, and 8B parameter sizes for both embedding and reranking, balancing efficiency and effectiveness.
Instruction-aware design allows customization for specific tasks, languages, or scenarios, potentially improving performance by 1-5%.

Maintenance & Community

Developed by Alibaba Cloud.
Community support available via Discord.

Licensing & Compatibility

Proprietary model. Specific license details are not explicitly stated in the README but are typically governed by Alibaba Cloud's terms for their proprietary models. Commercial use should be verified.

Limitations & Caveats

Requires transformers>=4.51.0 to avoid a KeyError.
Flash Attention 2 is recommended for performance but requires compatible hardware.
The README mentions "proprietary model" without detailing licensing terms for commercial use.

Health Check

Last Commit

3 months ago

Responsiveness

1 week

Pull Requests (30d)

1

Issues (30d)

7

Star History

76 stars in the last 30 days

Explore Similar Projects

Luotuo-Text-Embedding by LC1332

Text embedding model distilled from OpenAI API

Created 2 years ago

Updated 2 years ago

Starred by

Andrey Vasnetsov

Andrey Vasnetsov(Cofounder of Qdrant).

FlashRank by PrithivirajDamodaran

Reranking library for search & retrieval pipelines

Created 2 years ago

Updated 1 week ago

Starred by

Jeff Huber

Jeff Huber(Cofounder of Chroma) and

Casper Hansen

Casper Hansen(Author of AutoAWQ).

rank_llm by castorini

Python toolkit for reproducible information retrieval research

Created 2 years ago

Updated 2 weeks ago

RAG-Retrieval by NovaSearch-Team

End-to-end code for RAG retrieval model training, inference, and distillation

Created 1 year ago

Updated 6 months ago

Starred by

Jeff Huber

Jeff Huber(Cofounder of Chroma) and

Simon Willison

Simon Willison(Coauthor of Django).

rerankers by AnswerDotAI

Unified API for reranking models, simplifying retrieval architectures

Created 1 year ago

Updated 3 weeks ago

Starred by

Andre Zayarni

Andre Zayarni(Cofounder of Qdrant).

fastembed-rs by Anush008

Rust library for local vector embeddings and reranking

Created 2 years ago

Updated 1 day ago

Starred by

Tomas Valenta

Tomas Valenta(Cofounder of E2B) and

Travis Fischer

Travis Fischer(Founder of Agentic).

superlinked by superlinked

Python framework for building high-performance search & recommendation apps

Created 2 years ago

Updated 1 month ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

BCEmbedding by netease-youdao

Open-source embedding/reranker models for RAG

Created 2 years ago

Updated 4 months ago

Local_Pdf_Chat_RAG by weiwill88

RAG system for local PDF Q&A, aiding RAG beginners

Created 11 months ago

Updated 2 months ago

text2vec by shibing624

Text embeddings tool for vectorizing text

Created 6 years ago

Updated 1 month ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind), and

11 more.

FlagEmbedding by FlagOpen

Toolkit for retrieval and RAG applications

Created 2 years ago

Updated 3 weeks ago

Starred by

Julien Chaumond

Julien Chaumond(Cofounder of Hugging Face),

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and

25 more.

sentence-transformers by huggingface

Framework for text embeddings, retrieval, and reranking

Created 6 years ago

Updated 3 days ago

Feedback? Help us improve.