history_rag  by wxywb

RAG for Chinese history Q&A

created 1 year ago
1,028 stars

Top 37.1% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a Chinese history question-answering application using Retrieval-Augmented Generation (RAG) with vector databases. It targets users seeking accurate historical information and aims to mitigate LLM hallucination by grounding responses in retrieved historical documents.

How It Works

The core approach leverages the LlamaIndex framework for RAG. It offers two primary deployment options: a local Milvus vector database setup using the BAAI/bge-base-zh-v1.5 embedding model, or a cloud-based Zilliz Cloud Pipelines service for document processing and retrieval. Both methods utilize OpenAI's GPT-4 for text generation. The system ingests historical texts, slices them, generates embeddings, and stores them in a vector index for efficient retrieval.

Quick Start & Requirements

Highlighted Details

  • Supports local LLM services (e.g., fastchat) and Gemini models via a proxy.
  • Includes a Gradio-based web UI for interactive use.
  • Allows customization of embedding and reranker models via cfgs/config.yaml.
  • Zilliz Cloud Pipelines offer scalability and managed RAG services.

Maintenance & Community

The project has seen recent updates (June 2024) to LlamaIndex and reranker functionality. Contributions are noted from users like darius-gs, BetterAndBetterII, leyiang, and taihaozesong.

Licensing & Compatibility

The repository does not explicitly state a license in the README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The Zilliz Cloud Pipelines solution currently only supports importing documents via URL. Customizing LLMs requires modifying executor.py in addition to configuration files. The project relies on OpenAI's GPT-4 by default, and alternative LLM integration may require code changes.

Health Check
Last commit

1 year ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
30 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind).

LightRAG by HKUDS

1.0%
19k
RAG framework for fast, simple retrieval-augmented generation
created 10 months ago
updated 1 day ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Anton Troynikov Anton Troynikov(Cofounder of Chroma), and
20 more.

llama_index by run-llama

0.3%
43k
Data framework for building LLM-powered agents
created 2 years ago
updated 1 day ago
Feedback? Help us improve.