Discover and explore top open-source AI tools and projects—updated daily.
pgvectorPython SDK for pgvector, enabling vector storage in Postgres
Top 29.6% on SourcePulse
This library provides Python bindings for the pgvector PostgreSQL extension, enabling efficient storage and querying of vector embeddings directly within a PostgreSQL database. It targets Python developers working with machine learning, AI, and data science applications who need to integrate vector search capabilities into their existing PostgreSQL-based data pipelines. The primary benefit is leveraging PostgreSQL's robustness and familiarity for vector similarity search.
How It Works
The library translates Python objects and ORM calls into SQL queries that interact with the pgvector extension's custom data types (vector, halfvec, bit, sparsevec) and functions. It supports various distance metrics (L2, cosine, inner product) and indexing methods (HNSW, IVFFlat) for optimized nearest neighbor searches. This approach allows developers to manage vector data alongside relational data within a single database, simplifying architecture and data management.
Quick Start & Requirements
pip install pgvectorpgvector extension installed.Highlighted Details
vector, halfvec, bit, and sparsevec data types.Maintenance & Community
The project is actively maintained by the pgvector team. Contributions are welcomed, with clear guidelines for reporting bugs, fixing issues, and adding features. Development setup instructions and testing commands are provided.
Licensing & Compatibility
The library is released under the MIT License, allowing for broad use, including commercial applications.
Limitations & Caveats
While the library offers extensive ORM and driver support, users must ensure the pgvector extension is correctly installed and enabled in their PostgreSQL instance. Performance for very large datasets will depend heavily on proper indexing and PostgreSQL configuration.
3 weeks ago
1 day
asg017
tensorchord
marqo-ai
lancedb
pgvector