Discover and explore top open-source AI tools and projects—updated daily.
NanGePlusRAG framework for multi-LLM private knowledge base construction and retrieval
Top 97.5% on SourcePulse
This project offers a unified Retrieval Augmented Generation (RAG) framework for building and querying private knowledge bases, supporting multiple LLMs (OpenAI, Qwen) via a single codebase. It targets developers needing to integrate LLMs with proprietary data, simplifying domain-specific AI application development.
How It Works
The RAG pipeline includes offline (document loading, chunking, vectorization, Chroma DB ingestion) and online (query vectorization, retrieval, prompt templating, LLM generation) phases. It leverages LangChain for orchestration and LCEL for chain composition, with optional LangSmith integration. A key component is OneAPI, an API gateway abstracting LLM provider specifics for seamless model switching.
Quick Start & Requirements
pip install -r requirements.txt.input).vectorSaveTest.py, main.py, apiTest.py).Highlighted Details
bge-reranker-large) for refined search results.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
langchain_openai/embeddings/base.py to fix a BadRequestError.1 year ago
Inactive
microsoft
HKUDS