RAG pipeline tuning and evaluation tool
Top 97.5% on sourcepulse
RAGTune provides an automated framework for evaluating and optimizing Retrieval-Augmented Generation (RAG) pipelines. It targets developers and researchers seeking to systematically tune LLMs, embedding models, query transformations, and rerankers for improved RAG performance, leveraging Ragas metrics for evaluation.
How It Works
RAGTune orchestrates experiments by allowing users to select and configure various components of a RAG pipeline. It utilizes the Ragas library to evaluate the generated output against ground truth data using metrics like answer relevancy, context precision, and context recall. The tool supports a wide range of LLMs (OpenAI, Cohere, Anthropic) and embedding models, with extensibility for custom additions.
Quick Start & Requirements
pip install -r requirements.txt
libmagic-dev
, poppler-utils
, tesseract-ocr
, libreoffice
, pandoc
..env
file.streamlit run Home.py
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The README mentions that automated optimization is planned but not yet implemented. Users must manually configure and add their datasets with questions and ground truth answers before running evaluations, and skipping steps can lead to errors.
1 year ago
Inactive