Advanced RAG system for local document interaction
Top 54.0% on sourcepulse
This package provides an advanced Retrieval-Augmented Generation (RAG) system for querying local documents using Large Language Models (LLMs). It targets developers and researchers seeking a robust, configurable solution for document Q&A, offering enhanced features over basic RAG implementations. The system aims to improve document parsing, search relevance, and user interaction through a flexible YAML configuration.
How It Works
The system employs a hybrid search approach, combining dense embeddings (from models like Sentence-Transformers, Instructor, OpenAI) with sparse embeddings generated by SPLADE. This dual strategy aims to capture both semantic meaning and keyword relevance for more accurate retrieval. It supports incremental indexing, efficient parsing of various document formats (Markdown, PDF, DOCX), and integrates with LiteLLM and Ollama for broad LLM compatibility, including locally hosted models. Advanced features like HyDE, re-ranking, and multi-querying are included to further refine search quality.
Quick Start & Requirements
pip install pyllmsearch
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The README notes that enabling HyDE can significantly alter result quality and advises users to consult the associated paper. Image parsing requires the Gemini API, which may incur costs.
2 weeks ago
1 day