Postgres extension for building AI workloads
Top 44.8% on sourcepulse
pg_vectorize is a PostgreSQL extension that simplifies building AI applications by automating text-to-embedding transformations and integrating with LLMs. It targets developers and researchers who want to leverage their existing PostgreSQL data for vector search and Retrieval-Augmented Generation (RAG) without complex infrastructure setup. The extension offers a high-level API for seamless embedding management and LLM interaction directly within SQL.
How It Works
pg_vectorize leverages several key PostgreSQL extensions: pgvector
for efficient vector similarity search, pgmq
for background job orchestration, and pgrx
for Rust-based extension development. It automates the creation of embeddings using specified models (like Hugging Face's Sentence-Transformers or OpenAI's embeddings) and stores them in new columns. For RAG, it integrates with LLMs like OpenAI's GPT models or local Ollama services, enabling conversational AI directly on PostgreSQL data. The extension supports real-time or cron-based embedding updates via triggers or scheduled jobs.
Quick Start & Requirements
docker compose up -d
. Connect to PostgreSQL using docker compose exec -it postgres psql
and enable the extension with CREATE EXTENSION vectorize CASCADE;
.pgrx
), pg_cron
(^1.5), pgmq
(^1), and pgvector
(^0.5.0). Configuration requires setting shared_preload_libraries
and vectorize.embedding_service_url
.Highlighted Details
realtime
triggers or cron schedules.vectorize.generate
) and embedding encoding (vectorize.encode
).Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
updated_at_col
or schedule
parameters.3 weeks ago
Inactive