Fast embedding SDK for text and images
Top 20.5% on sourcepulse
FastEmbed is a lightweight, fast Python library for generating text, image, and multimodal embeddings using state-of-the-art models. It targets developers and researchers needing efficient embedding generation for applications like retrieval-augmented generation (RAG), semantic search, and recommendation systems, offering a faster and more memory-efficient alternative to larger libraries.
How It Works
FastEmbed leverages the ONNX Runtime for accelerated inference, enabling faster execution compared to PyTorch. It supports various embedding types, including dense text, sparse text (SPLADE++), late interaction (ColBERT), image, and multimodal embeddings. The library allows for easy model switching and custom model integration, with options for CPU and GPU acceleration.
Quick Start & Requirements
pip install fastembed
or pip install fastembed-gpu
for GPU support.Highlighted Details
fastembed-gpu
package.qdrant-client[fastembed]
).add_custom_model
.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The library is primarily focused on ONNX-compatible models; other model formats may require conversion. While it aims for broad compatibility, specific model performance can vary.
2 days ago
1 week