fastRAG  by IntelLabs

RAG research framework for efficient generative pipelines

created 2 years ago
1,620 stars

Top 26.5% on sourcepulse

GitHubView on GitHub
Project Summary

fastRAG is a research framework for building efficient retrieval-augmented generation (RAG) pipelines, targeting researchers and developers. It aims to advance RAG by providing optimized components and state-of-the-art LLMs and information retrieval techniques, enabling greater compute efficiency.

How It Works

fastRAG leverages the Haystack and HuggingFace ecosystems, offering full compatibility with Haystack v2+. Its core advantage lies in its optimized components, including efficient bi-encoders, sparse cross-encoders, ColBERT for token-based late interaction, Fusion-in-Decoder (FiD), REPLUG, and the PLAID indexing engine. It also provides backend support for various LLM execution environments, including Intel Gaudi accelerators, ONNX Runtime, OpenVINO, and Llama-CPP.

Quick Start & Requirements

  • Install via pip: pip install fastrag
  • Additional packages for specific features: fastrag[intel], fastrag[openvino], fastrag[qdrant], fastrag[colbert], fastrag[faiss-cpu], fastrag[faiss-gpu].
  • Preliminary requirements: Python 3.8+, PyTorch 2.0+.
  • For bleeding-edge updates, clone the repository and install with pip install ..
  • See Examples for usage.

Highlighted Details

  • Optimized for Intel hardware using Intel extensions for PyTorch (IPEX) and Optimum Intel/Habana.
  • Supports multiple LLM backends: Gaudi2, ONNX Runtime, OpenVINO, Llama-CPP.
  • Features advanced RAG components like ColBERT, FiD, REPLUG, and PLAID.
  • Compatible with Haystack v2+, with recent updates including Gaudi2 and ONNX runtime support.

Maintenance & Community

This is a research framework from Intel Labs. Comments, suggestions, issues, and pull requests are welcomed.

Licensing & Compatibility

Licensed under the Apache 2.0 License. This is not an official Intel product.

Limitations & Caveats

The framework is research-oriented, and users should be aware of potential changes and the need to report issues, especially with the recent Haystack v2+ compatibility.

Health Check
Last commit

6 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
101 stars in the last 90 days

Explore Similar Projects

Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera) and Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake).

InternEvo by InternLM

1.0%
402
Lightweight training framework for model pre-training
created 1 year ago
updated 1 week ago
Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Anton Bukov Anton Bukov(Cofounder of 1inch Network), and
16 more.

tinygrad by tinygrad

0.1%
30k
Minimalist deep learning framework for education and exploration
created 4 years ago
updated 23 hours ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Nat Friedman Nat Friedman(Former CEO of GitHub), and
32 more.

llama.cpp by ggml-org

0.4%
84k
C/C++ library for local LLM inference
created 2 years ago
updated 18 hours ago
Feedback? Help us improve.