Discover and explore top open-source AI tools and projects—updated daily.
deepletheProduction-ready RAG with structure-aware reasoning
New!
Top 83.2% on SourcePulse
Summary
ForgeRAG provides a production-ready Retrieval-Augmented Generation (RAG) system, addressing naive approach limitations with structure-aware reasoning, knowledge graph (KG) multi-hop traversal, and LLM tree navigation. It delivers grounded answers with pixel-precise, verifiable citations, targeting engineers and researchers seeking advanced RAG capabilities and superior performance.
How It Works
ForgeRAG mimics domain expert reasoning: BM25/vector search retrieves candidates, a KG connects concepts, and LLM tree navigation pinpoints exact information. This fused approach handles multi-hop queries via KG path extraction and dual-level retrieval. It injects a distilled KG knowledge layer into the LLM prompt, grounding answers in original text and providing pixel-precise citations for verification, mitigating hallucination risks.
Quick Start & Requirements
Prerequisites: Python 3.10+, Node.js 18+ (frontend), LLM API key (LiteLLM compatible). Recommended: 4+ CPU, 8GB+ RAM (16GB+ for KG extraction). Local setup: clone, install dependencies (pip, npm), configure LLM keys (scripts/setup.py), run main.py. Docker: docker compose up -d. Web UI at http://localhost:8000. MinerU recommended for complex PDFs. Extensive documentation available.
Highlighted Details
Maintenance & Community
Roadmap includes expanded benchmarks, scaling to 1M+ documents, multi-language support, a Python SDK, and improved configuration diagnostics. A contributing guide is provided. Related projects like LightRAG, GraphRAG, and PageIndex are noted.
Licensing & Compatibility
Released under the MIT License, permitting commercial use and integration into closed-source projects.
Limitations & Caveats
The UltraDomain benchmark evaluates Comprehensiveness, Diversity, and Empowerment, but not factual accuracy. While ForgeRAG provides citations for verification, the benchmark's focus means factual correctness is not directly measured. KG extraction can be resource-intensive, requiring significant RAM for large documents.
7 hours ago
Inactive
Future-House
NirDiamant