fastembed  by qdrant

Fast embedding SDK for text and images

Created 2 years ago
2,467 stars

Top 18.8% on SourcePulse

GitHubView on GitHub
Project Summary

FastEmbed is a lightweight, fast Python library for generating text, image, and multimodal embeddings using state-of-the-art models. It targets developers and researchers needing efficient embedding generation for applications like retrieval-augmented generation (RAG), semantic search, and recommendation systems, offering a faster and more memory-efficient alternative to larger libraries.

How It Works

FastEmbed leverages the ONNX Runtime for accelerated inference, enabling faster execution compared to PyTorch. It supports various embedding types, including dense text, sparse text (SPLADE++), late interaction (ColBERT), image, and multimodal embeddings. The library allows for easy model switching and custom model integration, with options for CPU and GPU acceleration.

Quick Start & Requirements

Highlighted Details

  • Supports dense, sparse, late interaction, image, and multimodal embeddings.
  • Outperforms OpenAI Ada-002 and is faster/lighter than Transformers/Sentence-Transformers.
  • ONNX Runtime backend for speed and reduced dependencies.
  • GPU acceleration via fastembed-gpu package.
  • Integrates seamlessly with Qdrant client (qdrant-client[fastembed]).
  • Allows adding custom models via add_custom_model.

Maintenance & Community

  • Supported and maintained by Qdrant.
  • Active development indicated by frequent updates and model additions.

Licensing & Compatibility

  • Apache 2.0 License.
  • Permissive, allowing commercial use and integration with closed-source applications.

Limitations & Caveats

The library is primarily focused on ONNX-compatible models; other model formats may require conversion. While it aims for broad compatibility, specific model performance can vary.

Health Check
Last Commit

4 days ago

Responsiveness

1 week

Pull Requests (30d)
6
Issues (30d)
5
Star History
52 stars in the last 30 days

Explore Similar Projects

Starred by Jason Knight Jason Knight(Director AI Compilers at NVIDIA; Cofounder of OctoML), Travis Fischer Travis Fischer(Founder of Agentic), and
5 more.

fromage by kohjingyu

0%
482
Multimodal model for grounding language models to images
Created 2 years ago
Updated 2 years ago
Feedback? Help us improve.