CLI tool for chatbot creation using Streamlit, OpenAI, and Deep Lake
Top 34.2% on sourcepulse
This project provides Python scripts to build a Git repository-aware chatbot using Streamlit, OpenAI GPT-3.5-turbo, and Activeloop's Deep Lake. It's designed for developers and researchers who want to quickly query and understand the content of any Git repository through a conversational interface.
How It Works
The solution comprises two core Python scripts. process.py
clones a specified Git repository, extracts text content from specified file types, generates embeddings using OpenAIEmbeddings, and stores these embeddings in an Activeloop Deep Lake dataset. chat.py
then builds a Streamlit web application that queries this Deep Lake dataset based on user input and leverages OpenAI GPT-3.5-turbo to generate contextually relevant answers. This approach allows for efficient semantic search over repository content.
Quick Start & Requirements
pip install -r requirements.txt
OPENAI_API_KEY
, ACTIVELOOP_TOKEN
, ACTIVELOOP_USERNAME
(copy from .env.example
).python src/main.py process --repo-url <github_repo_url>
python src/main.py chat --activeloop-dataset-name <dataset_name>
Highlighted Details
Maintenance & Community
No specific information on contributors, sponsorships (beyond a general mention of "Exploding Insights"), or community channels is provided in the README.
Licensing & Compatibility
Limitations & Caveats
The project relies on external API keys for OpenAI and Activeloop, which incur costs. The effectiveness of the chatbot is dependent on the quality of embeddings generated by OpenAI and the structure of the repository's text content.
1 year ago
Inactive