TrustRAG  by gomate-community

RAG framework for reliable input, trusted output

created 1 year ago
1,091 stars

Top 35.5% on sourcepulse

GitHubView on GitHub
Project Summary

TrustRAG is a configurable and modular Retrieval-Augmented Generation (RAG) framework designed for reliable input and trusted output in question-answering scenarios. It targets developers and researchers seeking to build robust RAG systems with flexible components.

How It Works

TrustRAG employs a "DeepResearch" framework for advanced information processing. This involves parsing user queries into sub-queries, performing recursive retrieval and reasoning, and making intelligent action decisions (answer, reflect, search, read, code). This layered, iterative approach allows for deeper understanding and more accurate responses, especially in complex information-seeking tasks.

Quick Start & Requirements

  • Installation: Install via pip (pip install trustrag) or from source (pip install -e .).
  • Prerequisites: Python 3.9+, LLM API keys (e.g., OpenAI), and potentially specific embedding models (e.g., BGE). Docker deployment is also supported.
  • Resources: Requires configuration of LLM API endpoints and potentially local model paths. Indexing and retrieval may require significant disk space and compute.
  • Docs: TrustRAG Documentation

Highlighted Details

  • Supports Deep Search for generating research reports.
  • Integrates multiple retrieval strategies (BM25, Dense, Hybrid) and reranking methods (BGE, LLM-based pointwise, pairwise, setwise).
  • Handles multimodal RAG with models like GLM-4V-Flash.
  • Includes modular document parsing for various formats (PDF, DOCX, HTML, etc.) and chunking.

Maintenance & Community

The project is developed by the GoMate team from the Key Laboratory of Network Data Science and Technology. Community interaction is encouraged via suggestions and PRs.

Licensing & Compatibility

The project is released under the Apache-2.0 license, permitting commercial use and integration with closed-source applications.

Limitations & Caveats

The README mentions "Waiting to implement" for ListWise-Rerank and TourRank reranking methods, indicating these features are not yet complete. Some advanced configurations may require specific hardware or model availability.

Health Check
Last commit

2 weeks ago

Responsiveness

1 day

Pull Requests (30d)
3
Issues (30d)
2
Star History
239 stars in the last 90 days

Explore Similar Projects

Starred by Jason Liu Jason Liu(Author of Instructor) and Ross Taylor Ross Taylor(Cofounder of General Reasoning; Creator of Papers with Code).

Search-R1 by PeterGriffinJin

1.3%
3k
RL framework for training LLMs to use search engines
created 5 months ago
updated 3 weeks ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind).

LightRAG by HKUDS

1.0%
19k
RAG framework for fast, simple retrieval-augmented generation
created 10 months ago
updated 21 hours ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Tim J. Baek Tim J. Baek(Founder of Open WebUI), and
2 more.

llmware by llmware-ai

0.2%
14k
Framework for enterprise RAG pipelines using small, specialized models
created 1 year ago
updated 1 week ago
Feedback? Help us improve.