BCEmbedding by netease-youdao

Open-source embedding/reranker models for RAG

Created 2 years ago

1,859 stars

Top 23.2% on SourcePulse

View on GitHub

1 Expert Loves This Project

Chip Huyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Project Summary

BCEmbedding provides open-source bilingual and crosslingual embedding and reranker models specifically designed for Retrieval Augmented Generation (RAG) applications. Developed by Netease Youdao, it targets developers and researchers building RAG systems that require robust performance across Chinese and English languages, offering a two-stage retrieval solution.

How It Works

BCEmbedding utilizes a two-stage retrieval process. The EmbeddingModel acts as a dual-encoder for efficient first-stage retrieval, generating semantic vectors. The RerankerModel then employs a cross-encoder for a second-stage refinement, re-ranking retrieved passages for enhanced precision and relevance. This approach leverages Youdao's translation engine for strong bilingual and crosslingual capabilities, aiming for out-of-the-box usability without fine-tuning.

Quick Start & Requirements

Installation: pip install BCEmbedding or install from source.
Prerequisites: PyTorch (ensure CUDA compatibility), transformers, sentence-transformers, langchain, llama-index (for integrations). GPU recommended for optimal performance.
Resources: Models are available on Hugging Face. Specific resource requirements depend on model size and usage.
Docs: Manual, Quick Start, Langchain Integration, LlamaIndex Integration.

Highlighted Details

Achieves State-of-the-Art (SOTA) performance on MTEB and LlamaIndex RAG evaluations, outperforming comparable open-source models in bilingual and crosslingual scenarios.
RerankerModel supports long passages (up to 32k tokens) and provides meaningful relevance scores.
Instruction-free design for EmbeddingModel, simplifying integration.
Models are proven in production within Youdao's products.

Maintenance & Community

Actively maintained by Netease Youdao.
Community engagement via WeChat group.
Related Links include QAnything, FlagEmbedding, MTEB, and LlamaIndex.

Licensing & Compatibility

Licensed under Apache 2.0 License.
Permissive license suitable for commercial use and integration into closed-source projects.

Limitations & Caveats

While supporting Chinese and English, broader language support for EmbeddingModel is noted as "coming soon."
RAG evaluation scripts recommend at least two GPUs for optimal execution.

Health Check

Last Commit

4 months ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

10 stars in the last 30 days