Tool for querying GitHub repositories using embeddings
Top 53.9% on sourcepulse
Repo Chat enables users to query GitHub repositories using natural language. It leverages OpenAI embeddings and Supabase with pgvector for efficient semantic search, making it suitable for developers and researchers seeking to understand codebases without manual inspection.
How It Works
The project utilizes OpenAI's embedding models to convert repository code into vector representations. These vectors are stored in Supabase, a PostgreSQL database enhanced with the pgvector
extension. When a user asks a question, it's also embedded, and the system retrieves the most semantically similar code chunks from the vector database to generate an answer.
Quick Start & Requirements
pip install -r requirements.txt
python3 load.py
python3 embed.py
python3 main.py
pgvector
extension enabled, GitHub repository URL and branch.Highlighted Details
pgvector
for efficient similarity search.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project relies on external services (OpenAI, Supabase) which may incur costs. The README does not specify a license, raising concerns about usage rights and commercial compatibility.
1 year ago
Inactive