Python vector database for semantic similarity search
Top 53.5% on sourcepulse
A Pythonic vector database designed for simplicity and scalability, offering core CRUD operations and flexible deployment options from local to cloud environments. It targets developers needing a lean yet powerful solution for managing and searching vector embeddings, leveraging DocArray for search logic and Jina for scalable index serving.
How It Works
vectordb utilizes DocArray as its core engine for vector search logic, enabling efficient Approximate Nearest Neighbor (ANN) and Exact Nearest Neighbor (ENN) searches. Jina provides the underlying infrastructure for scalable index serving, supporting sharding and replication for high availability and throughput. This architecture allows vectordb to function as a standalone library or be served as a scalable service via gRPC, HTTP, or WebSockets.
Quick Start & Requirements
pip install vectordb
BaseDoc
schema with DocArray and initializing a database class (e.g., InMemoryExactNNVectorDB
, HNSWVectorDB
).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
Currently, Jina AI Cloud deployments are limited to 1 replica; support for N replicas in the cloud is under development. The roadmap indicates plans for more ANN algorithms and enhanced filtering capabilities.
1 year ago
1 day