history_rag  by wxywb

RAG for Chinese history Q&A

Created 1 year ago
1,029 stars

Top 36.5% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

This project provides a Chinese history question-answering application using Retrieval-Augmented Generation (RAG) with vector databases. It targets users seeking accurate historical information and aims to mitigate LLM hallucination by grounding responses in retrieved historical documents.

How It Works

The core approach leverages the LlamaIndex framework for RAG. It offers two primary deployment options: a local Milvus vector database setup using the BAAI/bge-base-zh-v1.5 embedding model, or a cloud-based Zilliz Cloud Pipelines service for document processing and retrieval. Both methods utilize OpenAI's GPT-4 for text generation. The system ingests historical texts, slices them, generates embeddings, and stores them in a vector index for efficient retrieval.

Quick Start & Requirements

Highlighted Details

  • Supports local LLM services (e.g., fastchat) and Gemini models via a proxy.
  • Includes a Gradio-based web UI for interactive use.
  • Allows customization of embedding and reranker models via cfgs/config.yaml.
  • Zilliz Cloud Pipelines offer scalability and managed RAG services.

Maintenance & Community

The project has seen recent updates (June 2024) to LlamaIndex and reranker functionality. Contributions are noted from users like darius-gs, BetterAndBetterII, leyiang, and taihaozesong.

Licensing & Compatibility

The repository does not explicitly state a license in the README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The Zilliz Cloud Pipelines solution currently only supports importing documents via URL. Customizing LLMs requires modifying executor.py in addition to configuration files. The project relies on OpenAI's GPT-4 by default, and alternative LLM integration may require code changes.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research) and Andre Zayarni Andre Zayarni(Cofounder of Qdrant).

kernel-memory by microsoft

0.2%
2k
RAG architecture for indexing and querying data using LLMs
Created 2 years ago
Updated 1 day ago
Starred by Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
2 more.

LightRAG by HKUDS

1.2%
21k
RAG framework for fast, simple retrieval-augmented generation
Created 11 months ago
Updated 2 days ago
Feedback? Help us improve.