ThinkRAG by wzdavid

Local LLM RAG system for laptop deployment, enabling local knowledge Q&A

Created 1 year ago

300 stars

Top 88.9% on SourcePulse

Project Summary

ThinkRAG is a locally deployable Retrieval-Augmented Generation (RAG) system designed for efficient Q&A over private knowledge bases on a laptop. It targets professionals, researchers, and students seeking an offline, privacy-preserving AI assistant, offering optimized handling of Chinese language data and flexible model integration.

How It Works

Built on LlamaIndex and Streamlit, ThinkRAG employs a modular architecture. It supports various LLMs via OpenAI-compatible APIs and local deployments through Ollama. For data processing, it utilizes SpacyTextSplitter for enhanced Chinese text segmentation and BAAI embedding/reranking models for improved relevance. The system offers a development mode with local file storage and an optional production mode leveraging Redis and LanceDB for persistent storage and vector indexing.

Quick Start & Requirements

Install dependencies: pip3 install -r requirements.txt
For offline use, download Ollama and LLMs (e.g., DeepSeek, Qwen, Gemma) via Ollama commands.
Download embedding (e.g., BAAI/bge-large-zh-v1.5) and reranking models to the localmodels directory.
Configure API keys via environment variables (e.g., OPENAI_API_KEY, DEEPSEEK_API_KEY) or the application interface.
Run the system: streamlit run app.py
Requires Python 3.x.
Known Issue: Windows users may encounter issues; Linux or macOS is recommended.
Dependency Note: Requires Ollama version 0.3.3 due to compatibility issues with newer versions.
See docs/HowToDownloadModels.md for detailed model download instructions.

Highlighted Details

Optimized for Chinese language processing with Spacy text splitter, Chinese prompt templates, and bilingual embedding models.
Supports a wide range of LLMs, including popular Chinese providers like DeepSeek, Moonshot, and ZhiPu, alongside OpenAI and Ollama-compatible models.
Offers both development (local file storage) and production (Redis, LanceDB) modes for flexible deployment.
Enables local file uploads (PDF, DOCX, PPTX) and URL ingestion for knowledge base creation.

Maintenance & Community

The project is open-source and welcomes contributions. Links to community channels or roadmaps are not explicitly provided in the README.

Licensing & Compatibility

License: MIT License.
Compatible with commercial use and closed-source linking.

Limitations & Caveats

The system is not recommended for Windows users due to unresolved issues. A specific, older version of Ollama (0.3.3) is required for compatibility.

Health Check

Last Commit

1 month ago

Responsiveness

1 week

Pull Requests (30d)

0

Issues (30d)

0

Star History

9 stars in the last 30 days

Explore Similar Projects

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

pandallm by dandelionsllm

Open-source LLM project for Chinese language exploration

Created 2 years ago

Updated 2 years ago

llm-mcp-rag by KelvinQiu802

Augmented LLM for RAG and MCP agents

Created 9 months ago

Updated 9 months ago

granite-snack-cookbook by ibm-granite-community

Notebooks showcasing IBM Granite models' capabilities

Created 1 year ago

Updated 1 month ago

Starred by

Xiaofan Luan

Xiaofan Luan(VP Engineering at Zilliz).

history_rag by wxywb

RAG for Chinese history Q&A

Created 2 years ago

Updated 1 year ago

ChatPDF by shibing624

RAG for local LLM, enables chat with PDF/docs

Created 2 years ago

Updated 9 months ago

Hands-On-Large-Language-Models-CN by bbruceyuan

Chinese translation of "Hands-On Large Language Models" book

Created 1 year ago

Updated 2 months ago

easy-local-rag by AllAboutAI-YT

Local RAG implementation using Ollama

Created 1 year ago

Updated 1 year ago

text2vec by shibing624

Text embeddings tool for vectorizing text

Created 6 years ago

Updated 1 month ago

LangChain-ChatGLM-Webui by X-D-Lab

WebUI for local knowledge-based Q\&A using LangChain and ChatGLM

Created 2 years ago

Updated 1 year ago

Awesome-Chinese-NLP by crownpku

Chinese NLP resource list

Created 8 years ago

Updated 2 years ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory) and

Georgi Gerganov

Georgi Gerganov(Author of llama.cpp, whisper.cpp).

Chinese-LLaMA-Alpaca by ymcui

Chinese LLaMA & Alpaca: LLMs for Chinese NLP research

Created 2 years ago

Updated 6 months ago

Langchain-Chatchat by chatchat-space

RAG and agent app for local knowledge-based LLMs

Created 2 years ago

Updated 2 months ago

Feedback? Help us improve.