RAG system for learning, usage, and extension
Top 62.8% on sourcepulse
Easy-RAG is an open-source Retrieval Augmented Generation (RAG) system designed for learning, usage, and extensibility, enabling AI-powered web searches. It caters to users who want to build and customize RAG pipelines, offering features for knowledge base management, multi-turn chat, and internet-based AI search.
How It Works
The system supports various data formats (txt, csv, pdf, docx, mp3, mp4, wav, excel) for knowledge base creation, updating, and deletion. It leverages vectorization with support for Chroma, FAISS, and Elasticsearch. For chat, it offers multi-turn conversations with LLMs and RAG-based Q&A using different retrieval methods including reranking with the BGE-reranker-large model. AI web search is integrated via SearxNG, allowing LLMs to query the internet. Audio/video processing uses funasr for speech-to-text.
Quick Start & Requirements
pip3 install -r requirements.txt
ollama run qwen2:7b
, ollama run mofanke/acge_text_embedding:latest
), SearxNG setup (Docker recommended), and configuration of vector database and reranker paths in Config/config.py
.python webui.py
Highlighted Details
Maintenance & Community
The project is actively updated, with recent additions including AI web search, Elasticsearch support, FAISS support, and reranking. Future plans include more vector database integrations and voice output.
Licensing & Compatibility
The repository does not explicitly state a license in the README. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The initial release focuses on specific vector databases and does not include all planned integrations. The README mentions that funasr model downloads might be slow on first startup. The licensing status requires clarification for commercial adoption.
11 months ago
Inactive