RagLangChainTest by NanGePlus

RAG framework for multi-LLM private knowledge base construction and retrieval

Created 1 year ago

309 stars

Top 86.8% on SourcePulse

Project Summary

This project offers a unified Retrieval Augmented Generation (RAG) framework for building and querying private knowledge bases, supporting multiple LLMs (OpenAI, Qwen) via a single codebase. It targets developers needing to integrate LLMs with proprietary data, simplifying domain-specific AI application development.

How It Works

The RAG pipeline includes offline (document loading, chunking, vectorization, Chroma DB ingestion) and online (query vectorization, retrieval, prompt templating, LLM generation) phases. It leverages LangChain for orchestration and LCEL for chain composition, with optional LangSmith integration. A key component is OneAPI, an API gateway abstracting LLM provider specifics for seamless model switching.

Quick Start & Requirements

Installation: Clone repo, set up Python environment (Anaconda/PyCharm), pip install -r requirements.txt.
Prerequisites: Anaconda/PyCharm, OneAPI (deployment, LLM API keys), potential OpenAI proxy, LangSmith API key (optional), input documents (e.g., PDFs in input).
Configuration: Adjust API endpoints, keys, and models in scripts (vectorSaveTest.py, main.py, apiTest.py).
Resources: No specific hardware (GPU/CUDA) or OS requirements detailed.
Demos: Numerous Bilibili video links provided for setup and advanced features.

Highlighted Details

Multi-LLM Support: Unified RAG via OneAPI gateway for OpenAI, Qwen, etc.
Advanced Retrieval: Integrates a re-ranker (bge-reranker-large) for refined search results.
PDF Table Handling: Solutions for processing tables in PDFs (image-to-text analysis, text extraction/summarization).
Conversation Memory: Retains and utilizes historical dialogue context.
LangChain Expression Language (LCEL): Enables declarative chain composition.

Maintenance & Community

Hosted on GitHub and Gitee.
Extensive Bilibili video series offer detailed setup and feature walkthroughs.
No explicit community channels or roadmap mentioned.

Licensing & Compatibility

The README does not specify a software license, potentially impacting commercial use or integration.

Limitations & Caveats

Requires a code modification in langchain_openai/embeddings/base.py to fix a BadRequestError.
OneAPI setup and API key management add deployment complexity.
Lacks explicit performance benchmarks, hardware acceleration requirements (GPU/CUDA), or OS compatibility details.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

18 stars in the last 30 days

Explore Similar Projects

Knowledge-Base-Self-Hosting-Kit by 2dogsandanerd

RAG system for private code and document querying

Created 7 months ago

Updated 3 months ago

conversational-agent-langchain by mfmezger

FastAPI backend for conversational RAG agents

Created 3 years ago

Updated 2 weeks ago

rag-all-in-one by lehoanglong95

Mastering Retrieval-Augmented Generation (RAG) applications

Created 1 year ago

Updated 1 month ago

mcp-local-rag by shinpr

Local RAG for private code and document search

Created 8 months ago

Updated 22 hours ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera) and

Philipp Schmid

Philipp Schmid(DevRel at Google DeepMind).

llm-search by snexus

Advanced RAG system for local document interaction

Created 3 years ago

Updated 5 months ago

Chat_with_Datawhale_langchain by logan-zou

RAG for personal knowledge base Q&A

Created 2 years ago

Updated 2 years ago

ChatPDF by shibing624

RAG for local LLM, enables chat with PDF/docs

Created 3 years ago

Updated 1 year ago

Chinese-LangChain by yanqiangmiffy

Gradio SDK for local knowledge base QA using ChatGLM-6B + LangChain

Created 3 years ago

Updated 3 years ago

rag-web-ui by rag-web-ui

RAG system for building intelligent Q&A over a knowledge base

Created 1 year ago

Updated 3 months ago

Starred by

Eric Zhu

Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research) and

Andre Zayarni

Andre Zayarni(Cofounder of Qdrant).

kernel-memory by microsoft

RAG architecture for indexing and querying data using LLMs

Created 3 years ago

Updated 1 month ago

Starred by

Clarence Chio

Clarence Chio(Cofounder of Coverbase, Unit21) and

Jasper Zhang

Jasper Zhang(Cofounder of Hyperbolic).

pdfGPT by bhaskatripathi

PDF chatbot for interacting with PDF content

Created 3 years ago

Updated 4 months ago

Starred by

Matei Zaharia

Matei Zaharia(Cofounder of Databricks),

Luis Capelo

Luis Capelo(Cofounder of Lightning AI), and

2 more.

LEANN by StarTrail-org

RAG on Everything with 97% storage savings

Created 1 year ago

Updated 1 week ago

Feedback? Help us improve.