Discover and explore top open-source AI tools and projects—updated daily.
qdrantFast embedding SDK for text and images
Top 18.8% on SourcePulse
FastEmbed is a lightweight, fast Python library for generating text, image, and multimodal embeddings using state-of-the-art models. It targets developers and researchers needing efficient embedding generation for applications like retrieval-augmented generation (RAG), semantic search, and recommendation systems, offering a faster and more memory-efficient alternative to larger libraries.
How It Works
FastEmbed leverages the ONNX Runtime for accelerated inference, enabling faster execution compared to PyTorch. It supports various embedding types, including dense text, sparse text (SPLADE++), late interaction (ColBERT), image, and multimodal embeddings. The library allows for easy model switching and custom model integration, with options for CPU and GPU acceleration.
Quick Start & Requirements
pip install fastembed or pip install fastembed-gpu for GPU support.Highlighted Details
fastembed-gpu package.qdrant-client[fastembed]).add_custom_model.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The library is primarily focused on ONNX-compatible models; other model formats may require conversion. While it aims for broad compatibility, specific model performance can vary.
4 days ago
1 week
kohjingyu
dleemiller
unum-cloud
lucidrains
jina-ai