Discover and explore top open-source AI tools and projects—updated daily.
ChuckHendPostgres extension for building AI workloads
Top 43.6% on SourcePulse
pg_vectorize is a PostgreSQL extension that simplifies building AI applications by automating text-to-embedding transformations and integrating with LLMs. It targets developers and researchers who want to leverage their existing PostgreSQL data for vector search and Retrieval-Augmented Generation (RAG) without complex infrastructure setup. The extension offers a high-level API for seamless embedding management and LLM interaction directly within SQL.
How It Works
pg_vectorize leverages several key PostgreSQL extensions: pgvector for efficient vector similarity search, pgmq for background job orchestration, and pgrx for Rust-based extension development. It automates the creation of embeddings using specified models (like Hugging Face's Sentence-Transformers or OpenAI's embeddings) and stores them in new columns. For RAG, it integrates with LLMs like OpenAI's GPT models or local Ollama services, enabling conversational AI directly on PostgreSQL data. The extension supports real-time or cron-based embedding updates via triggers or scheduled jobs.
Quick Start & Requirements
docker compose up -d. Connect to PostgreSQL using docker compose exec -it postgres psql and enable the extension with CREATE EXTENSION vectorize CASCADE;.pgrx), pg_cron (^1.5), pgmq (^1), and pgvector (^0.5.0). Configuration requires setting shared_preload_libraries and vectorize.embedding_service_url.Highlighted Details
realtime triggers or cron schedules.vectorize.generate) and embedding encoding (vectorize.encode).Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
updated_at_col or schedule parameters.1 day ago
1 day