MiniRAG by HKUDS

RAG framework for small language models

Created 1 year ago

1,723 stars

Top 24.3% on SourcePulse

Project Summary

MiniRAG is an open-source framework designed to simplify Retrieval-Augmented Generation (RAG) for Small Language Models (SLMs). It addresses the performance degradation often seen with SLMs in traditional RAG systems by employing a novel heterogeneous graph indexing and lightweight topology-enhanced retrieval approach. This makes RAG more accessible for resource-constrained environments and on-device applications.

How It Works

MiniRAG utilizes a two-pronged approach: a semantic-aware heterogeneous graph indexing mechanism that unifies text chunks and named entities, and a lightweight topology-enhanced retrieval method that leverages graph structures. This design reduces the reliance on complex semantic understanding, enabling SLMs to achieve strong RAG performance by efficiently discovering knowledge through graph relationships.

Quick Start & Requirements

Install via pip: pip install lightrag-hku or pip install -e . from source.
Requires Python.
The LiHua-World dataset is available in ./dataset/LiHua-World/data/.
Official documentation and reproduction scripts are available in the repository.

Highlighted Details

Achieves comparable performance to LLM-based methods with SLMs.
Requires only 25% of the storage space compared to other methods.
Supports 10+ heterogeneous graph databases, including Neo4j and PostgreSQL.
Offers API and Docker deployment options.

Maintenance & Community

The project is actively developed with recent updates in February 2025. It acknowledges foundational work from nano-graphrag and LightRAG.

Licensing & Compatibility

The project is released under the MIT license, permitting commercial use and integration with closed-source applications.

Limitations & Caveats

The performance table indicates that some methods struggle to generate effective responses for certain models and datasets, with '/' denoting such cases. The project is based on recent research, with a citation provided for the arXiv preprint.

Health Check

Last Commit

4 months ago

Responsiveness

1 week

Pull Requests (30d)

Issues (30d)

Star History

44 stars in the last 30 days