RAG framework for small language models
Top 31.8% on sourcepulse
MiniRAG is an open-source framework designed to simplify Retrieval-Augmented Generation (RAG) for Small Language Models (SLMs). It addresses the performance degradation often seen with SLMs in traditional RAG systems by employing a novel heterogeneous graph indexing and lightweight topology-enhanced retrieval approach. This makes RAG more accessible for resource-constrained environments and on-device applications.
How It Works
MiniRAG utilizes a two-pronged approach: a semantic-aware heterogeneous graph indexing mechanism that unifies text chunks and named entities, and a lightweight topology-enhanced retrieval method that leverages graph structures. This design reduces the reliance on complex semantic understanding, enabling SLMs to achieve strong RAG performance by efficiently discovering knowledge through graph relationships.
Quick Start & Requirements
pip install lightrag-hku
or pip install -e .
from source.LiHua-World
dataset is available in ./dataset/LiHua-World/data/
.Highlighted Details
Maintenance & Community
The project is actively developed with recent updates in February 2025. It acknowledges foundational work from nano-graphrag and LightRAG.
Licensing & Compatibility
The project is released under the MIT license, permitting commercial use and integration with closed-source applications.
Limitations & Caveats
The performance table indicates that some methods struggle to generate effective responses for certain models and datasets, with '/' denoting such cases. The project is based on recent research, with a citation provided for the arXiv preprint.
1 month ago
1 week