fastRAG  by IntelLabs

RAG research framework for efficient generative pipelines

Created 2 years ago
1,695 stars

Top 25.0% on SourcePulse

GitHubView on GitHub
Project Summary

fastRAG is a research framework for building efficient retrieval-augmented generation (RAG) pipelines, targeting researchers and developers. It aims to advance RAG by providing optimized components and state-of-the-art LLMs and information retrieval techniques, enabling greater compute efficiency.

How It Works

fastRAG leverages the Haystack and HuggingFace ecosystems, offering full compatibility with Haystack v2+. Its core advantage lies in its optimized components, including efficient bi-encoders, sparse cross-encoders, ColBERT for token-based late interaction, Fusion-in-Decoder (FiD), REPLUG, and the PLAID indexing engine. It also provides backend support for various LLM execution environments, including Intel Gaudi accelerators, ONNX Runtime, OpenVINO, and Llama-CPP.

Quick Start & Requirements

  • Install via pip: pip install fastrag
  • Additional packages for specific features: fastrag[intel], fastrag[openvino], fastrag[qdrant], fastrag[colbert], fastrag[faiss-cpu], fastrag[faiss-gpu].
  • Preliminary requirements: Python 3.8+, PyTorch 2.0+.
  • For bleeding-edge updates, clone the repository and install with pip install ..
  • See Examples for usage.

Highlighted Details

  • Optimized for Intel hardware using Intel extensions for PyTorch (IPEX) and Optimum Intel/Habana.
  • Supports multiple LLM backends: Gaudi2, ONNX Runtime, OpenVINO, Llama-CPP.
  • Features advanced RAG components like ColBERT, FiD, REPLUG, and PLAID.
  • Compatible with Haystack v2+, with recent updates including Gaudi2 and ONNX runtime support.

Maintenance & Community

This is a research framework from Intel Labs. Comments, suggestions, issues, and pull requests are welcomed.

Licensing & Compatibility

Licensed under the Apache 2.0 License. This is not an official Intel product.

Limitations & Caveats

The framework is research-oriented, and users should be aware of potential changes and the need to report issues, especially with the recent Haystack v2+ compatibility.

Health Check
Last Commit

8 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
58 stars in the last 30 days

Explore Similar Projects

Starred by Li Jiang Li Jiang(Coauthor of AutoGen; Engineer at Microsoft), Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind), and
1 more.

AutoRAG by Marker-Inc-Korea

0.3%
4k
RAG AutoML tool for optimizing RAG pipelines
Created 1 year ago
Updated 1 day ago
Starred by Jason Knight Jason Knight(Director AI Compilers at NVIDIA; Cofounder of OctoML), Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), and
11 more.

mistral.rs by EricLBuehler

0.3%
6k
LLM inference engine for blazing fast performance
Created 1 year ago
Updated 22 hours ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), and
36 more.

unsloth by unslothai

0.6%
46k
Finetuning tool for LLMs, targeting speed and memory efficiency
Created 1 year ago
Updated 12 hours ago
Feedback? Help us improve.