PathRAG  by BUPT-GAMMA

Graph-based RAG with relational path pruning

Created 7 months ago
262 stars

Top 97.3% on SourcePulse

GitHubView on GitHub
Project Summary

PathRAG implements a novel retrieval-augmented generation (RAG) system that leverages graph-based relational paths to prune irrelevant information, enhancing the efficiency and accuracy of large language model responses. It is designed for researchers and developers working with complex, interconnected data who need to improve RAG performance.

How It Works

PathRAG constructs a graph from input documents, representing relationships between entities. It then utilizes these relational paths to intelligently prune the search space during retrieval, focusing on contextually relevant information. This graph-based pruning approach aims to reduce noise and improve the signal-to-noise ratio for the LLM.

Quick Start & Requirements

  • Install via pip install -e . after cloning the repository.
  • Requires an OpenAI API key for LLM interaction.
  • Example usage provided in v1_test.py.
  • Supports batch insertion of .txt files.

Highlighted Details

  • Graph-based pruning of retrieval context.
  • Focus on relational paths for improved relevance.
  • Designed to enhance RAG performance for complex data.

Maintenance & Community

  • The project is associated with the paper "PathRAG: Pruning Graph-based Retrieval Augmented Generation with Relational Paths" (arXiv:2502.14902).
  • No specific community channels or contributor details are provided in the README.

Licensing & Compatibility

  • The README does not explicitly state a license.
  • Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project is presented as code for a research paper, implying it may be experimental and not production-ready. Reliance on OpenAI models and API keys is a significant dependency. No information on supported platforms or detailed performance benchmarks is available.

Health Check
Last Commit

6 months ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
0
Star History
12 stars in the last 30 days

Explore Similar Projects

Starred by Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
2 more.

LightRAG by HKUDS

1.2%
21k
RAG framework for fast, simple retrieval-augmented generation
Created 11 months ago
Updated 2 days ago
Feedback? Help us improve.