rag-chatbot by umbertogriffo

RAG chatbot answers questions using context from Markdown files

Created 2 years ago

368 stars

Top 76.7% on SourcePulse

Project Summary

This project provides a conversational RAG chatbot that answers questions based on a collection of Markdown files. It's designed for users who want to leverage local, open-source LLMs for document-based Q&A, offering features like conversation memory and multiple response synthesis strategies.

How It Works

The chatbot processes Markdown files by splitting them into chunks, generating embeddings using all-MiniLM-L6-v2, and storing them in a Chroma vector database. When a user asks a question, an LLM first rewrites the query for better retrieval. Relevant document chunks are then fetched from Chroma and used as context to generate an answer with a local LLM via llama-cpp-python. It supports conversation memory and offers three response synthesis strategies: Create and Refine, Hierarchical Summarization, and Async Hierarchical Summarization.

Quick Start & Requirements

Install: Use make setup_cuda (for NVIDIA) or make setup_metal (for macOS Metal).
Prerequisites: Python 3.10+, Poetry 1.7.0, GPU with CUDA 12.1+ (for setup_cuda).
Run Chatbot: streamlit run chatbot/chatbot_app.py -- --model <model_name>
Run RAG Chatbot: streamlit run chatbot/rag_chatbot_app.py -- --model <model_name> --k <num_chunks> --synthesis-strategy <strategy>
Docs: Llama Cpp Python GitHub Issues

Highlighted Details

Leverages llama-cpp-python for efficient local LLM execution with quantization (4-bit precision).
Supports various open-source LLMs including Llama 3.1, OpenChat, Starling, Phi-3.5, and StableLM.
Implements conversation-aware memory and three context synthesis strategies for handling long contexts.
Refactored RecursiveCharacterTextSplitter from LangChain to avoid adding it as a dependency.

Maintenance & Community

No specific contributors, sponsorships, or community links (Discord/Slack) are mentioned in the README.

Licensing & Compatibility

The project does not explicitly state a license in the README.

Limitations & Caveats

The README warns that LLMs may generate hallucinations or false information.
GPU acceleration on M1 Macs requires using an ARM version of Python; x86 Python will not use the GPU.

Health Check

Last Commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

0

Star History

9 stars in the last 30 days

Explore Similar Projects

Starred by

Jeffrey Morgan

Jeffrey Morgan(Cofounder of Ollama),

Georgi Gerganov

Georgi Gerganov(Author of llama.cpp, whisper.cpp), and

1 more.

tenere by pythops

Rust TUI for LLM interaction

Created 2 years ago

Updated 4 months ago

keras-llm-robot by smalltong02

Web UI for LLMs, built with Keras, Langchain, and Fastchat

Created 2 years ago

Updated 1 year ago

Starred by

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind).

local_llama by jlonge4

Local LLM chatbot for documents, runnable offline

Created 2 years ago

Updated 1 year ago

whatsapp-ai-clone by kinggongzilla

AI chatbot for personalized conversations via WhatsApp data

Created 2 years ago

Updated 1 year ago

TalkingHeads by ugorsahin

Python library for LLM communication

Created 3 years ago

Updated 10 months ago

All-Model-Chat by yeahhe365

Multimodal chatbot interface for Google Gemini API

Created 6 months ago

Updated 3 days ago

py-gpt by szczyglis-dev

Desktop AI assistant for multimodal interaction with various LLMs

Created 2 years ago

Updated 4 days ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Tomas Valenta

Tomas Valenta(Cofounder of E2B), and

6 more.

GodMode by smol-ai

AI chat browser for accessing multiple LLMs' web apps

Created 2 years ago

Updated 1 year ago

ChatGPT-Telegram-Bot by yym68686

Telegram bot for AI chat, powered by various LLMs

Created 3 years ago

Updated 1 month ago

Starred by

Taranjeet Singh

Taranjeet Singh(Cofounder of Mem0) and

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

OpenChat by openchatai

LLM chatbot console for everyday users

Created 2 years ago

Updated 1 year ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

AstrBot by AstrBotDevs

LLM chatbot/framework for multiple platforms

Created 3 years ago

Updated 20 hours ago

Starred by

Tim J. Baek

Tim J. Baek(Founder of Open WebUI),

Lewis Tunstall

Lewis Tunstall(Research Engineer at Hugging Face), and

11 more.

chat-ui by huggingface

Chat UI: open-source interface for LLMs

Created 2 years ago

Updated 2 days ago

Feedback? Help us improve.