fastRAG by IntelLabs

RAG research framework for efficient generative pipelines

Created 3 years ago

1,758 stars

Top 24.2% on SourcePulse

View on GitHub

6 Experts Love This Project

Cofounder of Cloudera

and 2 more!

Project Summary

fastRAG is a research framework for building efficient retrieval-augmented generation (RAG) pipelines, targeting researchers and developers. It aims to advance RAG by providing optimized components and state-of-the-art LLMs and information retrieval techniques, enabling greater compute efficiency.

How It Works

fastRAG leverages the Haystack and HuggingFace ecosystems, offering full compatibility with Haystack v2+. Its core advantage lies in its optimized components, including efficient bi-encoders, sparse cross-encoders, ColBERT for token-based late interaction, Fusion-in-Decoder (FiD), REPLUG, and the PLAID indexing engine. It also provides backend support for various LLM execution environments, including Intel Gaudi accelerators, ONNX Runtime, OpenVINO, and Llama-CPP.

Quick Start & Requirements

Install via pip: pip install fastrag
Additional packages for specific features: fastrag[intel], fastrag[openvino], fastrag[qdrant], fastrag[colbert], fastrag[faiss-cpu], fastrag[faiss-gpu].
Preliminary requirements: Python 3.8+, PyTorch 2.0+.
For bleeding-edge updates, clone the repository and install with pip install ..
See Examples for usage.

Highlighted Details

Optimized for Intel hardware using Intel extensions for PyTorch (IPEX) and Optimum Intel/Habana.
Supports multiple LLM backends: Gaudi2, ONNX Runtime, OpenVINO, Llama-CPP.
Features advanced RAG components like ColBERT, FiD, REPLUG, and PLAID.
Compatible with Haystack v2+, with recent updates including Gaudi2 and ONNX runtime support.

Maintenance & Community

This is a research framework from Intel Labs. Comments, suggestions, issues, and pull requests are welcomed.

Licensing & Compatibility

Licensed under the Apache 2.0 License. This is not an official Intel product.

Limitations & Caveats

The framework is research-oriented, and users should be aware of potential changes and the need to report issues, especially with the recent Haystack v2+ compatibility.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

11 stars in the last 30 days